Nicolay Gerold
@nicolaygerold
Nicolay Gerold
@nicolaygerold
promptfoo is a tool for testing and evaluating LLM output quality.... See more
With promptfoo, you can:
Systematically test prompts & models against predefined test cases
Evaluate quality and catch regressions by comparing LLM outputs side-by-side
Speed up evaluations with caching and concurrency
Score outputs automatically by defining test cases
Use as a
Second, TA teaches us to never overlook the simplest solutions. There are a lot of complicated methods to automatically find augmentation policies, but the simplest method was so-far overlooked , even though it performs comparably or better
They will start to support autoscaling in March. You can configure multiple clouds and they deploy to the cheapest one.