Gpt-2-Output-Dataset
Visit Toolgpt-2-output-dataset is an Open Source dataset of GPT-2 outputs for research. It provides 250K documents from WebText and generated samples for various GPT-2 models.
At a glance
Trending
gpt-2-output-dataset is an Open Source dataset of GPT-2 outputs for research. It provides 250K documents from WebText and generated samples for various GPT-2 models.
Trending
About
gpt-2-output-dataset is an Open Source project by OpenAI providing a comprehensive dataset of GPT-2 outputs. It includes 250,000 documents from the WebText test set, alongside 250,000 random samples and 250,000 samples generated with Top-K 40 truncation for each GPT-2 model (small-117M, medium-345M, large-762M, xl-1542M). This dataset is specifically designed for research in areas such as detection of AI-generated text, biases, and more. It also offers samples from a GPT-2 model finetuned to output Amazon reviews, encouraging research into finetuned model detection. The project provides detectability baselines and a script for easy download of the data.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending