1 Star 1 Fork 0

Hugging Face 数据集镜像/SWE-bench

加入 Gitee
与超过 1200万 开发者一起发现、参与优秀开源项目,私有仓库也完全免费 :)
免费加入
该仓库未声明开源许可证文件(LICENSE),使用请关注具体项目描述及其代码上游依赖。
克隆/下载
贡献代码
同步代码
取消
提示: 由于 Git 不支持空文件夾,创建文件夹后会生成空的 .keep 文件
Loading...
README
dataset_infoconfigs
featuressplitsdownload_sizedataset_size
namedtype
repostring
namedtype
instance_idstring
namedtype
base_commitstring
namedtype
patchstring
namedtype
test_patchstring
namedtype
problem_statementstring
namedtype
hints_textstring
namedtype
created_atstring
namedtype
versionstring
namedtype
FAIL_TO_PASSstring
namedtype
PASS_TO_PASSstring
namedtype
environment_setup_commitstring
namenum_bytesnum_examples
dev4783179225
namenum_bytesnum_examples
test441429262294
namenum_bytesnum_examples
train36761037719008
120092029416536482
config_namedata_files
default
splitpath
devdata/dev-*
splitpath
testdata/test-*
splitpath
traindata/train-*

Dataset Summary

SWE-bench is a dataset that tests systems’ ability to solve GitHub issues automatically. The dataset collects 2,294 Issue-Pull Request pairs from 12 popular Python. Evaluation is performed by unit test verification using post-PR behavior as the reference solution.

The dataset was released as part of SWE-bench: Can Language Models Resolve Real-World GitHub Issues?

Want to run inference now?

This dataset only contains the problem_statement (i.e. issue text) and the base_commit which can represents the state of the codebase before the issue has been resolved. If you want to run inference using the "Oracle" or BM25 retrieval settings mentioned in the paper, consider the following datasets.

princeton-nlp/SWE-bench_oracle

princeton-nlp/SWE-bench_bm25_13K

princeton-nlp/SWE-bench_bm25_27K

princeton-nlp/SWE-bench_bm25_40K

princeton-nlp/SWE-bench_bm25_50k_llama

Supported Tasks and Leaderboards

SWE-bench proposes a new task: issue resolution provided a full repository and GitHub issue. The leaderboard can be found at www.swebench.com

Languages

The text of the dataset is primarily English, but we make no effort to filter or otherwise clean based on language type.

Dataset Structure

Data Instances

An example of a SWE-bench datum is as follows:

instance_id: (str) - A formatted instance identifier, usually as repo_owner__repo_name-PR-number.
patch: (str) - The gold patch, the patch generated by the PR (minus test-related code), that resolved the issue.
repo: (str) - The repository owner/name identifier from GitHub.
base_commit: (str) - The commit hash of the repository representing the HEAD of the repository before the solution PR is applied.
hints_text: (str) - Comments made on the issue prior to the creation of the solution PR’s first commit creation date.
created_at: (str) - The creation date of the pull request.
test_patch: (str) - A test-file patch that was contributed by the solution PR.
problem_statement: (str) - The issue title and body.
version: (str) - Installation version to use for running evaluation.
environment_setup_commit: (str) - commit hash to use for environment setup and installation.
FAIL_TO_PASS: (str) - A json list of strings that represent the set of tests resolved by the PR and tied to the issue resolution.
PASS_TO_PASS: (str) - A json list of strings that represent tests that should pass before and after the PR application.

More Information needed

空文件

简介

Mirror of https://huggingface.co/datasets/princeton-nlp/SWE-bench 展开 收起
取消

发行版

暂无发行版

贡献者 (2)

全部

近期动态

Pushed new commit to main branch 1 year ago, 18fdf0d...283547a
Pushed a new main branch over 1 year ago
Created project over 1 year ago
不能加载更多了
马建仓 AI 助手
尝试更多
代码解读
代码找茬
代码优化
1
https://gitee.com/hf-datasets/SWE-bench.git
git@gitee.com:hf-datasets/SWE-bench.git
hf-datasets
SWE-bench
SWE-bench
main

搜索帮助