GitHub - confident-ai/deepeval: The LLM Evaluation Framework...

GitHub - confident-ai/deepeval: The LLM Evaluation Framework

RelatedInsightsHighlights

Thumbnail of www-x-com-karpathy-status-1760022429605474550-b742b37857b64fd9

"My benchmark for large language models" https://t.co/YZBuwpL0tl Nice post but even more than the 100 tests specifically, the Github code looks excellent - full-featured test evaluation framework, easy to extend with further tests and run against many... See more

Andrej Karpathy

x.com

LLM Engineer Toolkit: A curated list of 120+ LLM libraries for training, fine-tuning, building, evaluating, deploying, RAG, and AI agents! 100% Open Source https://t.co/81X4ZGUx1E

Sumanth

x.com

GitHub - Shubhamsaboo/awesome-llm-apps: Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.

github.com