AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reproducibility.

Python 343 65 Updated Jun 10, 2025

TheAgentCompany / TheAgentCompany

An agent benchmark with tasks in a simulated software company.

Python 392 54 Updated May 17, 2025

jzhang38 / TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,546 538 Updated May 3, 2024

opendilab / awesome-diffusion-model-in-rl

A curated list of Diffusion Model in RL resources (continually updated)

1,188 61 Updated Feb 15, 2025

codefuse-ai / Awesome-Code-LLM

[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.

2,591 167 Updated Jun 5, 2025

GAIR-NLP / O1-Journey

O1 Replication Journey

1,992 65 Updated Jan 14, 2025

ShishirPatil / gorilla

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 12,135 1,137 Updated Jun 9, 2025

wshi83 / MedAdapter

[EMNLP'24] MedAdapter: Efficient Test-Time Adaptation of Large Language Models Towards Medical Reasoning

Python 33 Updated Dec 26, 2024

Adamdad / kat

[ICLR2025] Kolmogorov-Arnold Transformer

Python 783 49 Updated Mar 23, 2025

DeqingFu / transformers-icl-second-order

Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Models.

Python 16 2 Updated Nov 19, 2024

All-Hands-AI / OpenHands

🙌 OpenHands: Code Less, Make More

Python 57,760 6,587 Updated Jun 10, 2025

zlzGithub-0801 / GuardAgent-code

Forked from guardagent/code

Python 5 2 Updated Jan 8, 2025

guardagent / code

Python 11 7 Updated Jan 12, 2025

OpenMatch / NeuScraper

[ACL 2024] This is the code repo for our ACL’24 paper "Cleaner Pretraining Corpus Curation with Neural Web Scraping".

Python 226 19 Updated Aug 28, 2024

Lingkai-Kong / RE-Control

Code for paper: Aligning Large Language Models with Representation Editing: A Control Perspective

Python 32 4 Updated Jan 31, 2025

ritaranx / RAM-EHR

[ACL 2024] This is the code for our paper ”RAM-EHR: Retrieval Augmentation Meets Clinical Predictions on Electronic Health Records“.

Python 37 3 Updated Sep 19, 2024

zoom-wang112358 / MOLLEO

Source code of MOLLEO

Python 44 3 Updated Apr 11, 2025

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 14,761 2,931 Updated Jun 10, 2025

JieyuZ2 / TaskMeAnything

[NeurIPS 2024] A task generation and model evaluation system for multimodal language models.

Python 71 4 Updated Nov 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yuchen Zhuang night-chen

Achievements

Achievements

Highlights

Block or report night-chen

Stars

MLE-Dojo / MLE-Dojo

willccbb / verifiers

TIGER-AI-Lab / verl-tool

agentica-project / rllm

ritaranx / Collab-RAG

TsinghuaC3I / Awesome-RL-Reasoning-Recipes

punkpeye / awesome-mcp-servers

0russwest0 / Agent-R1

lmgame-org / GamingAgent

Jiayi-Pan / TinyZero

ServiceNow / BrowserGym

ServiceNow / AgentLab