10000 zfj1998 (fengji.zhang) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View zfj1998's full-sized avatar
🎯
Focusing
🎯
Focusing
  • City University of Hong Kong
  • Hong Kong, China
  • 06:57 (UTC +08:00)
  • X @FengjiZhang98

Highlights

  • Pro

Block or report zfj1998

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

RL Tango: Reinforcing Generator and Verifier Together for Language Reasoning

22 Updated May 27, 2025

Skywork-R1V2:Multimodal Hybrid Reinforcement Learning for Reasoning

Python 2,610 251 Updated Jun 5, 2025

Sampling profiler for Python programs

Rust 13,755 457 Updated Jun 5, 2025

Scaling Deep Research via Reinforcement Learning in Real-world Environments.

Python 422 32 Updated Apr 13, 2025

Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving

Python 183 14 Updated Jun 8, 2025

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

854 38 Updated Jun 3, 2025

⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)

Python 2,378 207 Updated Jun 8, 2025

ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning

Python 903 61 Updated May 16, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 2,531 183 Updated Jun 6, 2025

A code-first agent framework for seamlessly planning and executing data analytics tasks.

Python 5,757 738 Updated May 19, 2025

Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"

Python 537 39 Updated Mar 16, 2025

This repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?"

Python 1,403 130 Updated May 16, 2025

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 56,222 6,724 Updated May 16, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 51,830 6,271 Updated Jun 7, 2025

Inference code of Lingma SWE-GPT

Python 222 14 Updated Dec 2, 2024

Multimodal Large Language Models for Code Generation under Multimodal Scenarios

76 2 Updated Jun 6, 2025

LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning

Python 2,001 76 Updated May 13, 2025

Codev-Bench (Code Development Benchmark), a fine-grained, real-world, repository-level, and developer-centric evaluation framework. Codev-Bench assesses whether a code completion tool can accuratel…

Python 42 1 Updated Nov 6, 2024

MapCoder: Multi-Agent Code Generation for Competitive Problem Solving

Python 147 28 Updated Feb 12, 2025

Enhancing AI Software Engineering with Repository-level Code Graph

Python 183 23 Updated Apr 1, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,744 376 Updated Jun 5, 2025

A Lightweight Visual Reasoning Benchmark for Evaluating Large Multimodal Models through Complex Diagrams in Coding Tasks

Python 12 Updated Feb 25, 2025

Codebase for Aria - an Open Multimodal Native MoE

Jupyter Notebook 1,042 85 Updated Jan 22, 2025

An LLM-powered repository agent designed to assist developers and teams in generating documentation and understanding repositories quickly.

Python 677 104 Updated Dec 23, 2024

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 12,127 1,134 Updated Jun 5, 2025

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

5,826 897 Updated Sep 24, 2024

Baselines for all tasks from Long Code Arena benchmarks 🏟️

Python 30 5 Updated Mar 30, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 2,490 383 Updated Jun 8, 2025
Next
0