ContextualBench-Leaderboard
Visit ToolContextualBench-Leaderboard is a benchmarking tool that provides a leaderboard for Large Language Model (LLM) evaluations. It allows users to view and submit model outputs for evaluation.
At a glance
Trending
ContextualBench-Leaderboard is a benchmarking tool that provides a leaderboard for Large Language Model (LLM) evaluations. It allows users to view and submit model outputs for evaluation.
Trending
About
ContextualBench-Leaderboard, developed by Salesforce, is a platform designed for evaluating and comparing Large Language Models (LLMs). It features a leaderboard where users can track the performance of various LLMs based on benchmark evaluations. The tool also provides functionality for users to submit their own model outputs for evaluation against established benchmarks. This helps AI researchers and developers assess the accuracy and efficiency of their models in a standardized manner. While the platform aims to facilitate model comparison, the current live website indicates a runtime error, suggesting it may not be fully operational at this time.
Capabilities
Pricing & Plans
Likely Free
Free
FAQs
Trending