yaof20

😶

I may be slow to respond

Feng Yao yaof20

😶

I may be slow to respond

NLPer

21 followers · 24 following

University of California, San Diego
La Jolla, California
22:55 (UTC -07:00)

Achievements

Highlights

Organizations

Stars

codefuse-ai / Awesome-Code-LLM

[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.

2,494 161 Updated May 11, 2025

KomeijiForce / Cuckoo

Cuckoo: A Series of IE Free Riders Using LLM's Resources to Scale up Themselves.

Python 7 Updated Mar 7, 2025

srush / awesome-o1

A bibliography and survey of the papers surrounding o1

TeX 1,192 50 Updated Nov 16, 2024

microsoft / GRIN-MoE

GRadient-INformed MoE

262 15 Updated Sep 25, 2024

Jack47 / hack-SysML

The road to hack SysML and become an system expert

Emacs Lisp 483 59 Updated Sep 25, 2024

bigcode-project / bigcode-evaluation-harness

A framework for the evaluation of autoregressive code generation language models.

Python 944 244 Updated Oct 31, 2024

arcee-ai / mergekit

Tools for merging pretrained large language models.

Python 5,712 545 Updated May 14, 2025

huggingface / text-generation-inference

Large Language Model Text Generation Inference

Python 10,118 1,194 Updated May 15, 2025

gpu-mode / resource-stream

GPU programming related news and material links

1,506 88 Updated Jan 6, 2025

NVIDIA / cutlass

CUDA Templates for Linear Algebra Subroutines

C++ 7,515 1,227 Updated May 15, 2025

meta-llama / llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 17,305 2,476 Updated May 14, 2025

alibaba / Megatron-LLaMA

Forked from NVIDIA/Megatron-LM

Best practice for training LLaMA models in Megatron-LM

Python 651 57 Updated Jan 2, 2024

deepspeedai / DeepSpeed-MII

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 2,010 182 Updated Mar 26, 2025

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 38,388 4,369 Updated May 16, 2025

openai / chatgpt-retrieval-plugin

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

Python 21,175 3,691 Updated Jul 4, 2024

luban-agi / Awesome-Tool-Learning

A curated list of papers and applications on tool learning.

119 4 Updated Dec 27, 2023

AGI-Edgerunners / LLM-Agents-Papers

A repo lists papers related to LLM based agent

Python 1,638 90 Updated May 9, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 13,747 1,882 Updated May 16, 2025

RZFan525 / Awesome-ScalingLaws

A curated list of awesome resources dedicated to Scaling Laws for LLMs

71 5 Updated Apr 10, 2023

FoundationAgents / MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 55,605 6,616 Updated Mar 31, 2025

zjunlp / LLMAgentPapers

Must-read Papers on LLM Agents.

2,380 141 Updated May 13, 2025

hollobit / GenAI_LLM_timeline

ChatGPT, GenerativeAI and LLMs Timeline

952 58 Updated May 19, 2024

kyegomez / FlashLora

FlashAttention2.0 with Lora

Python 9 Updated Jul 31, 2023

yuanzhoulvpi2017 / zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 3,445 406 Updated Feb 12, 2025

Hannibal046 / nanoRWKV

The nanoGPT-style implementation of RWKV Language Model - an RNN with GPT-level LLM performance.

Python 184 12 Updated Nov 9, 2023

THUlawtech / MUSER

Python 24 1 Updated Nov 22, 2023

Facico / Chinese-Vicuna

Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案，结构参考alpaca

C 4,153 417 Updated Apr 18, 2025

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 18,413 1,871 Updated May 12, 2025

artidoro / qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,430 851 Updated Jun 10, 2024

SinclairCoder / Instruction-Tuning-Papers

Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).

769 24 Updated Jul 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feng Yao yaof20

Achievements

Achievements

Highlights

Organizations

Block or report yaof20

Stars

codefuse-ai / Awesome-Code-LLM

KomeijiForce / Cuckoo

srush / awesome-o1

microsoft / GRIN-MoE

Jack47 / hack-SysML

bigcode-project / bigcode-evaluation-harness

arcee-ai / mergekit

huggingface / text-generation-inference

gpu-mode / resource-stream

NVIDIA / cutlass

meta-llama / llama-cookbook

alibaba / Megatron-LLaMA

deepspeedai / DeepSpeed-MII

deepspeedai / DeepSpeed

openai / chatgpt-retrieval-plugin

luban-agi / Awesome-Tool-Learning

AGI-Edgerunners / LLM-Agents-Papers

huggingface / trl

RZFan525 / Awesome-ScalingLaws

FoundationAgents / MetaGPT

zjunlp / LLMAgentPapers

hollobit / GenAI_LLM_timeline

kyegomez / FlashLora

yuanzhoulvpi2017 / zero_nlp

Hannibal046 / nanoRWKV

THUlawtech / MUSER

Facico / Chinese-Vicuna

huggingface / peft

artidoro / qlora

SinclairCoder / Instruction-Tuning-Papers