HomeDatasetstokyotech-llm/swallow-code-v2
S

tokyotech-llm/swallow-code-v2

Text Generation · tokyotech-llm· 16.4K
apache-2.0 4.7 TB task_categories:text-generationlanguage:enlicense:apache-2.0size_categories:100M<n<1Bformat:json

- 📑 arXiv: Read our paper for detailed methodology and results at arXiv:2505.02881. - 🤗 Sister Dataset: Discover SwallowMath-v2, our companion dataset for mathematical reasoning.

Open in MLForge Sign up free Desktop app
# download instantly
mlforge datasets pull tokyotech-llm/swallow-code-v2

Dataset details

Task
Text Generation
Language
en
License
apache-2.0
Size
4.7 TB
Rows / images
1.7M
Creator
tokyotech-llm
Downloads
16.4K
Source
huggingface_datasets
Updated
2025-11-08

About tokyotech-llm/swallow-code-v2

- 📑 arXiv: Read our paper for detailed methodology and results at arXiv:2505.02881. - 🤗 Sister Dataset: Discover SwallowMath-v2, our companion dataset for mathematical reasoning.