C
codeparrot/codeparrot-clean
General · codeparrot
· 46.5K
Unknown
32 GB
size_categories:1M<n<10Mformat:jsonmodality:tabularmodality:textlibrary:datasets
A dataset of Python files from Github. This is the deduplicated version of the codeparrot.
# download instantly
mlforge datasets pull codeparrot/codeparrot-clean
Dataset details
Source
huggingface_datasets
About codeparrot/codeparrot-clean
A dataset of Python files from Github. This is the deduplicated version of the codeparrot.