- Beijing China
Stars
AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。
Official Implementation of "Democratizing Large Language Models via Personalized Parameter-Efficient Fine-tuning" at EMNLP 2024 Main Conference
LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation
A bibliography and survey of the papers surrounding o1
LLM Reasoning and Generation Benchmark. Evaluate LLMs in complex scenarios systematically.
My learning notes/codes for ML SYS.
Out-of-the-box (OOTB) GUI Agent for Windows and macOS
[EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation
Data related to the investigation of realtime censorship
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS 2025]
Agentic LLM Vulnerability Scanner / AI red teaming kit 🧪
Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥
The official github repo for the open online courses: "Dive into LLMs".
Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation me…
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
EfficientVLM: Fast and Accurate Vision-Language Models via Knowledge Distillation and Modal-adaptive Pruning (ACL 2023)
TigerBot: A multi-language multi-task LLM
An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents
DSPy: The framework for programming—not prompting—language models
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
WIP: project for engineering automatic bot (chatbot mainly)
A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)
⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡
Official Code for Paper: RecurrentGPT: Interactive Generation of (Arbitrarily) Long Text
A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).