LLMs
First time here? Go to our setup guide
Features
Features
- š¤ Multiple model integrations: OpenAI, transformers, llama.cpp, exllama2, mamba
- šļø Simple and powerful prompting primitives based on the Jinja templating engine
- š Multiple choices, type constraints and dynamic stopping
- ā” Fast regex-structured generation
- š„ Fast JSON generation following a JSON schema
outlines-dev ⢠GitHub - outlines-dev/outlines: Neuro Symbolic Text Generation
Setting up the necessary machine learning infrastructure to run these big models is another challenge. We need a dedicated model server for running model inference (using frameworks like Triton oder vLLM), powerful GPUs to run everything robustly, and configurability in our servers to make sure they're high throughput and low latency. Tuning the... See more
Developing Rapidly with Generative AI
š¤ Cutting-edge framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
- Why CrewAI
- Getting Started
- Key Features
- Examples
- Local Open Source Models
- CrewAI x AutoGen x ChatDev
- Contribution
- š¬ CrewAI Discord Community
- Hire Consulting
- License
joaomdmoura ⢠GitHub - joaomdmoura/crewAI: Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
Whatās the best way for an end user to organize and explore millions of latent space features?
Iāve found tens of thousands of interpretable features in my experiments, and frontier labs have demonstrated results with a thousand times more features in production-scale models. No doubt, as interpretability techniques advance, weāll see feature maps... See more
Iāve found tens of thousands of interpretable features in my experiments, and frontier labs have demonstrated results with a thousand times more features in production-scale models. No doubt, as interpretability techniques advance, weāll see feature maps... See more
Shortwave ā rajhesh.panchanadhan@gmail.com [Gmail alternative]
weāre in a capability overhang - the AI tech that already exists has huge potential impact, whether you engage or not, so get ahead by exploring
the appropriate approach is pathfinding which uses experiments to learn and, critically, artefacts to tell the organisation what to do next.
the appropriate approach is pathfinding which uses experiments to learn and, critically, artefacts to tell the organisation what to do next.
Shortwave ā rajhesh.panchanadhan@gmail.com [Gmail alternative]
The need for better AI or LLM-specific infrastructure, along with the host of problems that come with non-deterministic of LLMs, means that thereās more software work ahead of us, not less. Abstraction layers like LLMs create more possibilities and thus, more work.
Is this a good thing or a bad thing? Iām not sure.
A great example of this is frontend... See more
Is this a good thing or a bad thing? Iām not sure.
A great example of this is frontend... See more
Shortwave ā rajhesh.panchanadhan@gmail.com [Gmail alternative]
- Multiple indices. Splitting the document corpus up into multiple indices and then routing queries based on some criteria. This means that the search is over a much smaller set of documents rather than the entire dataset. Again, it is not always useful, but it can be helpful for certain datasets. The same approach works with the LLMs themselves.
Matt Rickard ⢠Improving RAG: Strategies
Disruptive innovation comes in two flavors: (1) New-market disruption, where the company creates and claims a new segment in an existing market by catering to an underserved customer base, or (2) Low-end disruption, in which a company uses a low-cost business model to enter at the bottom of an existing market and claim a segment.
Copilots donāt... See more
Copilots donāt... See more
