• Large variety of ready-to-use LLM evaluation metrics (all with explanations) powered by

    ANY

    LLM of your choice, statistical methods, or NLP models that runs

    locally on your machine

    :

    • G-Eval

    • Summarization

    • Answer Relevancy

    • Faithfulness

    • Contextual Recall

    • Contextual Precision

    • RAGAS

    • Hallucination

    • Toxicity

    • Bias

    • etc.

from GitHub - confident-ai/deepeval: The LLM Evaluation Framework

Nicolay Gerold added 4mo ago

  • from LanceDB - LanceDB

    Nicolay Gerold added 8mo ago

  • from GitHub - dstackai/dstack: dstack is an open-source toolkit for running GPU workloads on any cloud. It works seamlessly with any cloud GPU providers. Discord: https://discord.gg/u8SmfwPpMd by dstackai

    Nicolay Gerold added 8mo ago

  • from GitHub - Portkey-AI/gateway: A Blazing Fast AI Gateway. Route to 100+ LLMs with 1 fast & friendly API. by Portkey-AI

    Nicolay Gerold added 9mo ago

  • from GitHub - skypilot-org/skypilot: SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface. by skypilot-org

    Nicolay Gerold added 10mo ago

  • from GitHub - Oxen-AI/oxen-release: Official repository for docs and releases of the Oxen CLI by Oxen-AI

    Nicolay Gerold added 10mo ago

  • from GitHub - VikParuchuri/marker: Convert PDF to markdown quickly with high accuracy by VikParuchuri

    Nicolay Gerold added 10mo ago

  • from Monitoring

    Nicolay Gerold added 10mo ago

  • from GitHub - lastmile-ai/aiconfig: aiconfig -- config-driven, source control friendly AI application development by lastmile-ai

    Nicolay Gerold added 10mo ago