Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
License
openai/evals
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
About
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.