Bloom By Safety Research
Visit ToolBloom by Safety Research generates evaluation suites to probe LLMs for specific behaviors like sycophancy, self-preservation, and political bias. It creates diverse test scenarios, runs conversations, and scores results to assess model behavior.
At a glance
Trending