-
Huazhong University of Science and Technology
- Wuhan, China
- https://jianyue.tech
Starred repositories
Memory for AI Agents; SOTA in AI Agent Memory; Announcing OpenMemory MCP - local and secure memory management.
[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology
A Datacenter Scale Distributed Inference Serving Framework
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
verl: Volcano Engine Reinforcement Learning for LLMs
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3.
[NeurIPS'24 Spotlight, ICLR'25, ICML'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filli…
10 Lessons to Get Started Building AI Agents
🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.
Thunderbird add-on to minimize Thunderbird using the window's close button
Train your AI self, amplify you, bridge the world
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
Official Implementation of "KBLaM: Knowledge Base augmented Language Model"
A lightweight, powerful framework for multi-agent workflows
Official PyTorch implementation for "Large Language Diffusion Models"
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models
DSPy: The framework for programming—not prompting—language models
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
AG2 (formerly AutoGen): The Open-Source AgentOS. Join us at: https://discord.gg/pAbnFJrkgZ
No fortress, purely open ground. OpenManus is Coming.