GPTFuzz
Visit ToolGPTFuzz is an AI Agents & Automation tool that red teams large language models. It automatically generates jailbreak prompts to identify vulnerabilities and improve model security.
At a glance
Trending
GPTFuzz is an AI Agents & Automation tool that red teams large language models. It automatically generates jailbreak prompts to identify vulnerabilities and improve model security.
Trending
About
GPTFuzz is an open-source tool designed for red teaming large language models (LLMs) by automatically generating jailbreak prompts. This process helps identify vulnerabilities and weaknesses in AI models, ultimately enhancing their robustness and security. The repository provides the official codebase for "GPTFUZZER: Red Teaming Large Language Models with Auto-Generated Jailbreak Prompts." It includes datasets for harmful questions and human-written templates, along with a finetuned RoBERTa-large model for judgment. Researchers can use GPTFuzz to generate their own adversarial templates and contribute to building a general black-box fuzzing framework for LLMs.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending