LLMs

Shortwave — rajhesh.panchanadhan@gmail.com [Gmail alternative]

Shortwave — rajhesh.panchanadhan@gmail.com [Gmail alternative]

r/LLMDevs - Reddit

Understanding the Cost of Generative AI Models in Production

Shortwave — rajhesh.panchanadhan@gmail.com [Gmail alternative]

GitHub - SeldonIO/MLServer: An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and more

swyx Tweet

GitHub - sqrkl/lm-evaluation-harness: A framework for few-shot evaluation of language models.

Understanding the Cost of Generative AI Models in Production