HomeDatasetsopen-index/open-github
O

open-index/open-github

Text Generation · open-index· 94.0K
odc-by 50 GB task_categories:text-generationtask_categories:text-classificationtask_categories:feature-extractionlanguage:enlanguage:mul

This dataset contains every public event on GitHub: every push, pull request, issue, star, fork, code review, release, and discussion across all public repositories. GitHub is the world's largest software development platform, home to over 200 million repositories and the daily work of tens of millions of developers, from individual open-source contributors to the engineering teams behind the most

Open in MLForge Sign up free Desktop app
# download instantly
mlforge datasets pull open-index/open-github

Dataset details

Task
Text Generation
Language
en
License
odc-by
Size
50 GB
Rows / images
368.2K
Creator
open-index
Downloads
94.0K
Source
huggingface_datasets
Updated
2026-04-09

About open-index/open-github

This dataset contains every public event on GitHub: every push, pull request, issue, star, fork, code review, release, and discussion across all public repositories. GitHub is the world's largest software development platform, home to over 200 million repositories and the daily work of tens of millions of developers, from individual open-source contributors to the engineering teams behind the most widely used software on earth.