-
Alibaba
- China
More
Stars
Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"
A lightweight, powerful framework for multi-agent workflows
ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning
verl: Volcano Engine Reinforcement Learning for LLMs
Repo for NAACL 2025 Paper "Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization"
Collection of leaked system prompts
A Survey of Attributions for Large Language Models
Evaluating LLMs with fewer examples
🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
LLM API 管理 & 分发系统,支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型,统一 API 适配,可用于 key 管理与二次分发。单可执行文件,提供 Docker 镜像,一键部署,开箱即用。LLM API management & k…
CDQA: Chinese Dynamic Question Answering Benchmark
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Evaluating tool-augmented LLMs in conversation settings
[ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step
A lightweight framework for building LLM-based agents
fanqiwan / KCA
Forked from 18907305772/KCAEMNLP'2024: Knowledge Verification to Nip Hallucination in the Bud
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
A high-throughput and memory-efficient inference and serving engine for LLMs
This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
🦜🔗 Build context-aware reasoning applications
A fast, clean, responsive Hugo theme.
Retrieval and Retrieval-augmented LLMs
An Autonomous LLM Agent for Complex Task Solving
Evaluate the accuracy of LLM generated outputs
SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding