LLMs
First time here? Go to our setup guide
Features
Features
- š¤ Multiple model integrations: OpenAI, transformers, llama.cpp, exllama2, mamba
- šļø Simple and powerful prompting primitives based on the Jinja templating engine
- š Multiple choices, type constraints and dynamic stopping
- ā” Fast regex-structured generation
- š„ Fast JSON generation following a JSON schema
outlines-dev ⢠GitHub - outlines-dev/outlines: Neuro Symbolic Text Generation
The quality of dataset is 95% of everything. The rest 5% is not to ruin it with bad parameters.
After 500+ LoRAs made, here is the secret
Unlike consumers, enterprises want control over how their data is used and shared with companies, including the providers of AI software. Enterprises have spent a lot effort in consolidating data from different sources and bringing them in-house (this article Partner integrations + System of Intelligence: Todayās deepest Moat by fellow Medium... See more
AI Startup Trends: Insights from Y Combinatorās Latest Batch
- You have access to a proprietary asset (like data) that others donāt have easy access to. In our āwrite job postingsā example, perhaps you have a corpus of thousands of job postings including some outcome scores (as to how well they did). You could use this data to create better job postings. Others donāt have ready access to this data. Note: The
Dharmesh Shah ⢠How To Build a Defensible A.I. Startup
Protecting LLM products:
(1) Is hard to bootstrap. This already hints to existing customers or you need to get a bunch of your customers to co-develop (insurance model ā companies pooling their data to solve a problem they all have). This runs into a bunch of issues: competitive drive of the companies, data privacy and security.
(2) Reserved for existing companies. This is the co-pilot model.
(3) This might be the most sustainable one, but it is also the hardest one. I have not seen anything in that direction yet besides OpenAI.
Whatās the best way for an end user to organize and explore millions of latent space features?
Iāve found tens of thousands of interpretable features in my experiments, and frontier labs have demonstrated results with a thousand times more features in production-scale models. No doubt, as interpretability techniques advance, weāll see feature maps... See more
Iāve found tens of thousands of interpretable features in my experiments, and frontier labs have demonstrated results with a thousand times more features in production-scale models. No doubt, as interpretability techniques advance, weāll see feature maps... See more
Shortwave ā rajhesh.panchanadhan@gmail.com [Gmail alternative]
Top considerations when choosing foundation models
Accuracy
Cost
Latency
Privacy
Top challenges when deploying production AI
Serving cost
Evaluation
Infra reliability
Model quality
Accuracy
Cost
Latency
Privacy
Top challenges when deploying production AI
Serving cost
Evaluation
Infra reliability
Model quality
Notion ā The all-in-one workspace for your notes, tasks, wikis, and databases.
Overview
MaxText is a high performance , highly scalable , open-source LLM written in pure Python/Jax and targeting Google Cloud TPUs and GPUs for training and inference . MaxText achieves high MFUs and scales from single host to very large clusters while staying simple and "optimization-free" thanks to the power of Jax and the XLA compiler.
MaxText... See more
MaxText is a high performance , highly scalable , open-source LLM written in pure Python/Jax and targeting Google Cloud TPUs and GPUs for training and inference . MaxText achieves high MFUs and scales from single host to very large clusters while staying simple and "optimization-free" thanks to the power of Jax and the XLA compiler.
MaxText... See more
google ⢠GitHub - google/maxtext: A simple, performant and scalable Jax LLM!
Why is Discord such a good GTM for AI applications?
Text interface. Most users are just generating images, videos, and audio in these Discord servers. Prompts are easily expressible in simple text commands. Itās why weāve seen image generation strategies like Midjourney (all-in-one) flourish in Discord while more raw diffusion models havenāt grown... See more
Text interface. Most users are just generating images, videos, and audio in these Discord servers. Prompts are easily expressible in simple text commands. Itās why weāve seen image generation strategies like Midjourney (all-in-one) flourish in Discord while more raw diffusion models havenāt grown... See more
.png?table=block&id=5cffd615-f82a-4e84-b2ff-4f4e496e2d3e&spaceId=996f2b3b-deaa-4214-aedb-cbc914a1833e&width=1330&userId=&cache=v2)