Instruct-Eval
Visit Toolinstruct-eval is a tool for quantitatively evaluating instruction-tuned large language models (LLMs). It allows researchers to benchmark models on held-out tasks and supports safety re-alignment.
At a glance
Pricing
—
Free tier
—
API
—
Skill level
Technical
Trending
     Â