LLMs

Understanding the Cost of Generative AI Models in Production

Announcing Together Inference Engine – the fastest inference available

PromptIDE

outlines-dev GitHub - outlines-dev/outlines: Neuro Symbolic Text Generation

Nathan Lambert The Q* hypothesis: Tree-of-thoughts reasoning, process reward models, and supercharging synthetic data

Discord - A New Way to Chat with Friends & Communities

Adam Huda The Transformative Power of Generative AI in Software Development: Lessons from Uber's Tech-Wide Hackathon

New models and developer products announced at DevDay

GitHub - SeldonIO/MLServer: An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and more