-
Worked at Kuaishou, Baidu, Meituan
- Beijing
- https://ageliss.github.io/gqjiang/
-
OpenRLHF Public
Forked from OpenRLHF/OpenRLHFAn Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & RFT & Dynamic Sampling & Async Agent RL)
Python Apache License 2.0 UpdatedMay 23, 2025 -
NeMo-RL Public
Forked from NVIDIA/NeMo-RLScalable toolkit for efficient model reinforcement
Python Apache License 2.0 UpdatedMay 22, 2025 -
Awesome-LLM-Inference Public
Forked from xlite-dev/Awesome-LLM-Inference📚A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, MLA, Parallelism etc.
Python GNU General Public License v3.0 UpdatedApr 17, 2025 -
verl Public
Forked from volcengine/verlverl: Volcano Engine Reinforcement Learning for LLMs
Python Apache License 2.0 UpdatedApr 16, 2025 -
Awesome-LLM-Compression Public
Forked from HuangOwen/Awesome-LLM-CompressionAwesome LLM compression research papers and tools.
MIT License UpdatedDec 24, 2024 -
-
llm_interview_note Public
Forked from wdndev/llm_interview_note主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
HTML UpdatedOct 22, 2024 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedAug 7, 2024 -
SpeculativeDecodingPapers Public
Forked from hemingkx/SpeculativeDecodingPapers📰 Must-read papers and blogs on Speculative Decoding ⚡️
Apache License 2.0 UpdatedJul 24, 2024 -
lmdeploy Public
Forked from InternLM/lmdeployLMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Python Apache License 2.0 UpdatedJun 25, 2024 -
EAGLE Public
Forked from SafeAILab/EAGLE[ICML'24] EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty
Python Apache License 2.0 UpdatedMay 26, 2024 -
llama.cpp Public
Forked from ggml-org/llama.cppLLM inference in C/C++
C++ MIT License UpdatedMar 5, 2024 -
Medusa Public
Forked from FasterDecoding/MedusaMedusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
Jupyter Notebook Apache License 2.0 UpdatedFeb 27, 2024 -
NeMo Public
Forked from NVIDIA/NeMoNeMo: a framework for generative AI
Python Apache License 2.0 UpdatedFeb 17, 2024 -
rtp-llm Public
Forked from alibaba/rtp-llmRTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
C++ Apache License 2.0 UpdatedFeb 5, 2024 -
LAVIS Public
Forked from salesforce/LAVISLAVIS - A One-stop Library for Language-Vision Intelligence
Jupyter Notebook BSD 3-Clause "New" or "Revised" License UpdatedJan 31, 2024 -
CLIP Public
Forked from openai/CLIPCLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Jupyter Notebook MIT License UpdatedJan 11, 2024 -
trlx Public
Forked from CarperAI/trlxA repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Python MIT License UpdatedJan 8, 2024 -
peft Public
Forked from huggingface/peft🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Python Apache License 2.0 UpdatedNov 3, 2023 -
MS-AMP Public
Forked from Azure/MS-AMPMicrosoft Automatic Mixed Precision Library
Python MIT License UpdatedOct 30, 2023 -
TensorRT-LLM Public
Forked from NVIDIA/TensorRT-LLMTensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
C++ Apache License 2.0 UpdatedOct 30, 2023 -
TransformerEngine Public
Forked from NVIDIA/TransformerEngineA library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…
Python Apache License 2.0 UpdatedOct 17, 2023 -
Megatron-DeepSpeed Public
Forked from deepspeedai/Megatron-DeepSpeedOngoing research training transformer language models at scale, including: BERT & GPT-2
Python Other UpdatedSep 22, 2023 -
DeepSpeedExamples Public
Forked from deepspeedai/DeepSpeedExamplesExample models using DeepSpeed
Python Apache License 2.0 UpdatedSep 12, 2023 -
Awesome-Deep-Neural-Network-Compression Public
Forked from csyhhu/Awesome-Deep-Neural-Network-CompressionSummary, Code for Deep Neural Network Quantization
Python UpdatedAug 25, 2023 -
Chinese-LLaMA-Alpaca Public
Forked from ymcui/Chinese-LLaMA-Alpaca中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Python Apache License 2.0 UpdatedJul 23, 2023 -
FastChat Public
Forked from lm-sys/FastChatAn open platform for training, serving, and evaluating large language models. Release repo for Vicuna and FastChat-T5.
Python Apache License 2.0 UpdatedJun 7, 2023 -
BELLE Public
Forked from LianjiaTech/BELLEBELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
HTML Apache License 2.0 UpdatedJun 3, 2023 -
lightseq Public
Forked from bytedance/lightseqLightSeq: A High Performance Library for Sequence Processing and Generation
C++ Other UpdatedMay 16, 2023 -
langchain Public
Forked from langchain-ai/langchain⚡ Building applications with LLMs through composability ⚡
Python MIT License UpdatedMay 12, 2023