AgentBench
Visit ToolAgentBench is an AI Agents & Automation tool that provides a comprehensive benchmark for evaluating Large Language Models (LLMs) as agents. It includes diverse environments and tasks to assess LLM performance in various scenarios.
At a glance
Trending
Also listed in