10000 Mor-Li (Mo Li) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Mor-Li's full-sized avatar
  • Tsinghua University & Shanghai AI Lab
  • Shanghai
  • 22:07 (UTC +08:00)

Highlights

  • Pro

Block or report Mor-Li

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Python 2,656 188 Updated Jan 30, 2025

Code and Data for an expanded Tau-Bench with training and test sets in a variety of domains

Python 3 Updated Jan 25, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 53,644 6,570 Updated Jul 6, 2025
Python 154 7 Updated Jun 27, 2025

Muon is an optimizer for hidden layers in neural networks

Python 982 50 Updated Jul 4, 2025

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 19,773 1,438 Updated Jun 30, 2025

🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.

TypeScript 29,557 2,581 Updated Jul 6, 2025

Implementations for a family of attention mechanisms, suitable for all kinds of natural language processing tasks and compatible with TensorFlow 2.0 and Keras.

Python 361 83 Updated Feb 6, 2024

⚓️ Repository for the "Thought Anchors: Which LLM Reasoning Steps Matter?" paper.

Jupyter Notebook 38 5 Updated Jul 3, 2025

Port of OpenAI's Whisper model in C/C++

C++ 41,293 4,419 Updated Jul 2, 2025

The Triton TensorRT-LLM Backend

Shell 858 125 Updated Jul 2, 2025

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Python 5,145 445 Updated May 21, 2025

(ACL 2025) Divide-Then-Aggregate: An Efficient Tool Learning Method via Parallel Tool Invocation

Python 9 3 Updated May 21, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 2,776 208 Updated Jun 20, 2025

A library for generating difficulty-scalable, multi-tool, and verifiable agentic tasks with execution trajectories.

Python 98 13 Updated Jul 6, 2025

Implementation for OAgents: An Empirical Study of Building Effective Agents

Python 77 2 Updated Jul 1, 2025

The official Python SDK for Model Context Protocol servers and clients

Python 15,696 1,977 Updated Jul 6, 2025

Expose your FastAPI endpoints as Model Context Protocol (MCP) tools, with Auth!

Python 6,397 534 Updated Jul 4, 2025

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,409 59 Updated May 11, 2025

A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.

Python 74 4 Updated Jun 26, 2025

open-source coding LLM for software engineering tasks

Python 676 84 Updated Jun 27, 2025

[NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents

Python 111 8 Updated Mar 28, 2025

The data and code for paper: "SOPBench: Evaluating Language Agents at Following Standard Operating Procedures and Constraints"

Python 3 Updated Jun 4, 2025

EvaLearn is a pioneering benchmark designed to evaluate large language models (LLMs) on their learning capability and efficiency in challenging tasks.

Python 275 10 Updated Jul 5, 2025

Evaluating Conversational Agents in a Dual-Control Environment

Python 40 1 Updated Jun 18, 2025

Code and Data for Tau-Bench

Python 651 94 Updated Jan 22, 2025

Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparen…

Python 2,319 310 Updated Jul 3, 2025

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

PowerShell 17,657 986 Updated Jul 3, 2025

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

Python 14,790 1,805 Updated Jul 5, 2025
Next
0