GitHub - confident-ai/deepeval: The LLM Evaluation Framework...

GitHub - confident-ai/deepeval: The LLM Evaluation Framework

github.com

RelatedHighlights

LangChain

langchain.com

Rivet

rivet.ironcladapp.com

GitHub - arthur-ai/bench: A tool for evaluating LLMs

DeepEval — It’s a tool for easy and efficient LLM testing. Deepeval aims to make writing tests for LLM applications (such as RAG) as easy as writing Python unit tests.

Testing framework for LLM Part

Welcome to prompttools created by Hegel AI! This repo offers a set of open-source, self-hostable tools for experimenting with, testing, and evaluating LLMs, vector databases, and prompts. The core idea is to enable developers to evaluate using familiar interfaces like code, notebooks, and a local playground.

In just a few lines of codes, you can t