Mor-Li

Mo Li Mor-Li

Ph.D. candidate at Tsinghua University

34 followers · 120 following

Tsinghua University & Shanghai AI Lab
Shanghai
22:07 (UTC +08:00)

Achievements

x2 x2

Achievements

x2 x2

Highlights

Lists (1)

Sort

🔮 Future ideas

1 repository

Stars

THUDM / AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Python 2,656 188 Updated Jan 30, 2025

Ephibbs / big-tau

Forked from sierra-research/tau-bench

Code and Data for an expanded Tau-Bench with training and test sets in a variety of domains

Python 3 Updated Jan 25, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 53,644 6,570 Updated Jul 6, 2025

huggingface / fineweb-2

Python 154 7 Updated Jun 27, 2025

KellerJordan / Muon

Muon is an optimizer for hidden layers in neural networks

Python 982 50 Updated Jul 4, 2025

OpenBMB / MiniCPM-o

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 19,773 1,438 Updated Jun 30, 2025

thizirinait / Mind-the-Gap-a-Spectral-Analysis-of-Rank-Collapse-and-Signal-Propagation-in-Attention-Layers

Jupyter Notebook 2 Updated May 16, 2025

CherryHQ / cherry-studio

🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.

TypeScript 29,557 2,581 Updated Jul 6, 2025

uzaymacar / attention-mechanisms

Implementations for a family of attention mechanisms, suitable for all kinds of natural language processing tasks and compatible with TensorFlow 2.0 and Keras.

Python 361 83 Updated Feb 6, 2024

interp-reasoning / thought-anchors

⚓️ Repository for the "Thought Anchors: Which LLM Reasoning Steps Matter?" paper.

Jupyter Notebook 38 5 Updated Jul 3, 2025

ggml-org / whisper.cpp

Port of OpenAI's Whisper model in C/C++

C++ 41,293 4,419 Updated Jul 2, 2025

triton-inference-server / tensorrtllm_backend

The Triton TensorRT-LLM Backend

Shell 858 125 Updated Jul 2, 2025

OpenBMB / ToolBench

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Python 5,145 445 Updated May 21, 2025

Zhudongsheng75 / Divide-Then-Aggregate

(ACL 2025) Divide-Then-Aggregate: An Efficient Tool Learning Method via Parallel Tool Invocation

Python 9 3 Updated May 21, 2025

PeterGriffinJin / Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 2,776 208 Updated Jun 20, 2025

OPPO-PersonalAI / TaskCraft

A library for generating difficulty-scalable, multi-tool, and verifiable agentic tasks with execution trajectories.

Python 98 13 Updated Jul 6, 2025

OPPO-PersonalAI / OAgents

Implementation for OAgents: An Empirical Study of Building Effective Agents

Python 77 2 Updated Jul 1, 2025

modelcontextprotocol / python-sdk

The official Python SDK for Model Context Protocol servers and clients

Python 15,696 1,977 Updated Jul 6, 2025

tadata-org / fastapi_mcp

Expose your FastAPI endpoints as Model Context Protocol (MCP) tools, with Auth!

Python 6,397 534 Updated Jul 4, 2025

BytedTsinghua-SIA / DAPO

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,409 59 Updated May 11, 2025

BytedTsinghua-SIA / MemAgent

A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.

Python 74 4 Updated Jun 26, 2025

MoonshotAI / Kimi-Dev

open-source coding LLM for software engineering tasks

Python 676 84 Updated Jun 27, 2025

open-compass / GTA

[NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents

Python 111 8 Updated Mar 28, 2025

Leezekun / SOPBench

The data and code for paper: "SOPBench: Evaluating Language Agents at Following Standard Operating Procedures and Constraints"

Python 3 Updated Jun 4, 2025

ByteDance-Seed / EvaLearn

EvaLearn is a pioneering benchmark designed to evaluate large language models (LLMs) on their learning capability and efficiency in challenging tasks.

Python 275 10 Updated Jul 5, 2025

sierra-research / tau2-bench

Evaluating Conversational Agents in a Dual-Control Environment

Python 40 1 Updated Jun 18, 2025

sierra-research / tau-bench

Code and Data for Tau-Bench

Python 651 94 Updated Jan 22, 2025

stanford-crfm / helm

Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparen…

Python 2,319 310 Updated Jul 3, 2025

anthropics / claude-code

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

PowerShell 17,657 986 Updated Jul 3, 2025

bytedance / deer-flow

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

Python 14,790 1,805 Updated Jul 5, 2025