OpenAI Evals
Visit ToolOpenAI Evals is an open-source framework for evaluating large language models (LLMs) and LLM systems. It provides a registry of benchmarks and allows users to create custom evaluations for specific use cases.
At a glance
Trending