LLMs

microsoft DeepSpeed-FastGen

Sean Sheng Scaling AI Models Like You Mean It

GitHub - confident-ai/deepeval: The LLM Evaluation Framework

r/MachineLearning - Reddit

Dharmesh Shah How To Build a Defensible A.I. Startup

Ethan Mollick Almost an Agent: What GPTs can do

Context caching guide | Google AI for Developers | Google for Developers

Matei Zaharia, Omar Khattab, Lingjiao Chen, et al. The Shift From Models to Compound AI Systems