HomeDatasetscodeparrot/codeparrot-clean
C

codeparrot/codeparrot-clean

General · codeparrot· 46.5K
Unknown 32 GB size_categories:1M<n<10Mformat:jsonmodality:tabularmodality:textlibrary:datasets

A dataset of Python files from Github. This is the deduplicated version of the codeparrot.

Open in MLForge Sign up free Desktop app
# download instantly
mlforge datasets pull codeparrot/codeparrot-clean

Dataset details

Task
General
License
Unknown
Size
32 GB
Rows / images
477.0K
Creator
codeparrot
Downloads
46.5K
Source
huggingface_datasets
Updated
2022-10-10

About codeparrot/codeparrot-clean

A dataset of Python files from Github. This is the deduplicated version of the codeparrot.