HomeDatasetslmsys/chatbot_arena_conversations
C

lmsys/chatbot_arena_conversations

Conversational · lmsys· 2.4K
cc 3.5 GB license:ccsize_categories:10K<n<100Kformat:parquetmodality:tabularmodality:text

Chatbot Arena Conversations Dataset This dataset contains 33K cleaned conversations with pairwise human preferences. It is collected from 13K unique IP addresses on the Chatbot Arena from April to June 2023. Each sample includes a question ID, two model names, their full conversation text in OpenAI API JSON format, the user vote, the anonymized user ID, the detected language tag, the OpenAI moderation API tag, the additional toxic tag, and the timestamp. To ensure the safe release… See the full description on the dataset page: https://huggingface.co/datasets/lmsys/chatbot_arena_conversations.

Open in MLForge Sign up free Desktop app
# download instantly
mlforge datasets pull lmsys/chatbot_arena_conversations

Dataset details

Task
Conversational
License
cc
Size
3.5 GB
Rows / images
33.0K
Classes
13
Creator
lmsys
Downloads
2.4K
Source
huggingface_datasets
Updated
2023-09-30