HomeDatasetsopenbmb/UltraData-SFT-2605
U

openbmb/UltraData-SFT-2605

Text Generation · openbmb· 42.0K
apache-2.0 335 GB task_categories:text-generationtask_categories:question-answeringlanguage:enlanguage:zhlicense:apache-2.0

UltraData-SFT-2605 📦 UltraData Collection | 🌐 UltraData | 🤗 MiniCPM5 Series English | 中文 📚 Introduction UltraData-SFT-2605 is the full set of core-domain SFT data used in the post-training of MiniCPM5-1B-SFT within the MiniCPM5-1B series, and a key representative of L3 refined data in the UltraData L0-L4 tiered data management framework. It covers math, code, knowledge, instruction following, and other core domains, containing over 15 million Deep… See the full description on the dataset page: https://huggingface.co/datasets/openbmb/UltraData-SFT-2605.

Open in MLForge Sign up free Desktop app
# download instantly
mlforge datasets pull openbmb/UltraData-SFT-2605

Dataset details

Task
Text Generation
Language
en
License
apache-2.0
Size
335 GB
Creator
openbmb
Downloads
42.0K
Source
huggingface_datasets
Updated
2026-05-28