LiveBench
Visit ToolLiveBench is an open-source benchmark tool that evaluates large language models (LLMs) with a focus on contamination-free assessment. It provides a challenging set of tasks and objective ground-truth answers to accurately score models.
At a glance
Trending