LLMs

GitHub - SeldonIO/MLServer: An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and more

GitHub - kingjulio8238/memary: Longterm Memory for Autonomous Agents.

Shortwave — rajhesh.panchanadhan@gmail.com [Gmail alternative]

Filimoa GitHub - Filimoa/open-parse: Improved file parsing for LLM’s

Developing Rapidly with Generative AI

thesephist.com Navigate, don't search

Context caching guide | Google AI for Developers | Google for Developers

Announcing Together Inference Engine – the fastest inference available