HomeDatasetsllamafactory/demo_data
D

llamafactory/demo_data

Text Generation · llamafactory· 51.0K
apache-2.0 13 MB task_categories:text-generationlanguage:enlanguage:zhlicense:apache-2.0size_categories:1K<n<10K

- 1,000 examples from https://huggingface.co/datasets/llamafactory/alpacagpt4en - 1,000 examples from https://huggingface.co/datasets/llamafactory/alpacagpt4zh - 300 examples from https://huggingface.co/datasets/llamafactory/glaivetoolcallen - 300 examples from https://huggingface.co/datasets/llamafactory/glaivetoolcallzh - 91 examples for identity learning - 300 examples from https://huggingface.

Open in MLForge Sign up free Desktop app
# download instantly
mlforge datasets pull llamafactory/demo_data

Dataset details

Task
Text Generation
Language
en
License
apache-2.0
Size
13 MB
Rows / images
4.2K
Creator
llamafactory
Downloads
51.0K
Source
huggingface_datasets
Updated
2024-07-18

About llamafactory/demo_data

- 1,000 examples from https://huggingface.co/datasets/llamafactory/alpacagpt4en - 1,000 examples from https://huggingface.co/datasets/llamafactory/alpacagpt4zh - 300 examples from https://huggingface.co/datasets/llamafactory/glaivetoolcallen - 300 examples from https://huggingface.co/datasets/llamafactory/glaivetoolcallzh - 91 examples for identity learning - 300 examples from https://huggingface.co/datasets/cognitivecomputations/SystemChat-2.0 - 6 examples for multimodal supervised fine-tuning - 300(en)+300(zh) examples from https://huggingface.co/datasets/hiyouga/DPO-En-Zh-20k - 300 examples from https://huggingface.co/datasets/argilla/kto-mix-15k - 300 examples from https://huggingface.co/datasets/allenai/c4 - 30 examples from https://huggingface.co/datasets/wikipedia