HomeDatasetsnebius/SWE-rebench-V2
S

nebius/SWE-rebench-V2

Text Generation · nebius· 15.7K
cc-by-4.0 1.2 GB task_categories:text-generationlanguage:enlicense:cc-by-4.0size_categories:10K<n<100Kformat:parquet

SWE-rebench-V2 is a curated dataset of software-engineering tasks derived from real GitHub issues and pull requests. The dataset contains 32,079 samples covering Python, Go, TypeScript, JavaScript, Rust, Java, PHP, Kotlin, Julia, Elixir, Scala, Swift, Dart, C, C++, C, R, Clojure, OCaml, and Lua. For log parser functions, base Dockerfiles, and the prompts used, please see https://github.com/SWE-reb

Open in MLForge Sign up free Desktop app
# download instantly
mlforge datasets pull nebius/SWE-rebench-V2

Dataset details

Task
Text Generation
Language
en
License
cc-by-4.0
Size
1.2 GB
Rows / images
32.1K
Classes
16
Creator
nebius
Downloads
15.7K
Source
huggingface_datasets
Updated
2026-05-12

About nebius/SWE-rebench-V2

SWE-rebench-V2 is a curated dataset of software-engineering tasks derived from real GitHub issues and pull requests. The dataset contains 32,079 samples covering Python, Go, TypeScript, JavaScript, Rust, Java, PHP, Kotlin, Julia, Elixir, Scala, Swift, Dart, C, C++, C, R, Clojure, OCaml, and Lua. For log parser functions, base Dockerfiles, and the prompts used, please see https://github.com/SWE-rebench/SWE-rebench-V2 The detailed technical report is available at “SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale”.