8000 Multi-SWE-bench · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
@multi-swe-bench

Multi-SWE-bench

A Multilingual Benchmark for Issue Resolving
👋 Hi, everyone!
We are ByteDance Seed team.

You can get to know us better through the following channels👇

seed logo

Multi-SWE-bench

This organization contains the source code for Multi-SWE-bench, a multilingual benchmark for evaluating LLMs in real-world code issue resolution. Unlike existing Python-centric benchmarks (e.g., SWE-bench), the benchmark spans ​7 languages (i.e., Java, TypeScript, JavaScript, Go, Rust, C, and C++) with ​1,632 high-quality instances, curated from 2,456 candidates by ​68 expert annotators for reliability.

Use the repositories in this organization to...

Multi-SWE-RL Dataset Overview

The Multi-SWE-RL Community is an open-source initiative focused on collaborative dataset creation for software engineering and reinforcement learning research. To foster active participation and recognize contributors, we introduce this Contribution Incentive Plan. By contributing high-quality data, you directly support advancements in AI research and earn recognition within the community.

Incentive Tiers:

  1. Be a Contributor: Get listed in the Contribution Progress Sheet
  2. Report Authorship: Become an author in future technical reports

Full details: Contribution Incentive Plan

Get Started in 2 Steps:

  1. Learn: Quick-Start Guide
  2. Try: Follow our Contribution Demo

Popular repositories Loading

  1. multi-swe-bench multi-swe-bench Public

    Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving

    Python 190 19

  2. MagentLess MagentLess Public

    Python 8

  3. MSWE-agent MSWE-agent Public

    Python 8 2

  4. experiments experiments Public

    Shell 7 1

  5. MopenHands MopenHands Public

    Python 6

  6. multi-swe-bench.github.io multi-swe-bench.github.io Public

    Vue 1

Repositories

Showing 8 of 8 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…

0