LLMs

Matt Rickard Improving RAG: Strategies

microsoft DeepSpeed-FastGen

Shortwave — rajhesh.panchanadhan@gmail.com [Gmail alternative]

GitHub - kingjulio8238/memary: Longterm Memory for Autonomous Agents.

Mistral 7B is 187x cheaper compared to GPT-4

PromptIDE

Understanding the Cost of Generative AI Models in Production

Ask HN: What are some actual use cases of AI Agents right now? | Hacker News