8000 yaof20 (Feng Yao) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View yaof20's full-sized avatar
😶
I may be slow to respond
😶
I may be slow to respond
  • University of California, San Diego
  • La Jolla, California
  • 22:55 (UTC -07:00)

Highlights

  • Pro

Organizations

@thunlp

Block or report yaof20

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.

2,494 161 Updated May 11, 2025

Cuckoo: A Series of IE Free Riders Using LLM's Resources to Scale up Themselves.

Python 7 Updated Mar 7, 2025

A bibliography and survey of the papers surrounding o1

TeX 1,192 50 Updated Nov 16, 2024

GRadient-INformed MoE

262 15 Updated Sep 25, 2024

The road to hack SysML and become an system expert

Emacs Lisp 483 59 Updated Sep 25, 2024

A framework for the evaluation of autoregressive code generation language models.

Python 944 244 Updated Oct 31, 2024

Tools for merging pretrained large language models.

Python 5,712 545 Updated May 14, 2025

Large Language Model Text Generation Inference

Python 10,118 1,194 Updated May 15, 2025

GPU programming related news and material links

1,506 88 Updated Jan 6, 2025

CUDA Templates for Linear Algebra Subroutines

C++ 7,515 1,227 Updated May 15, 2025

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 17,305 2,476 Updated May 14, 2025

Best practice for training LLaMA models in Megatron-LM

Python 651 57 Updated Jan 2, 2024

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 2,010 182 Updated Mar 26, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 38,388 4,369 Updated May 16, 2025

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

Python 21,175 3,691 Updated Jul 4, 2024

A curated list of papers and applications on tool learning.

119 4 Updated Dec 27, 2023

A repo lists papers related to LLM based agent

Python 1,638 90 Updated May 9, 2025

Train transformer language models with reinforcement learning.

Python 13,747 1,882 Updated May 16, 2025

A curated list of awesome resources dedicated to Scaling Laws for LLMs

71 5 Updated Apr 10, 2023

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 55,605 6,616 Updated Mar 31, 2025

Must-read Papers on LLM Agents.

2,380 141 Updated May 13, 2025

ChatGPT, GenerativeAI and LLMs Timeline

952 58 Updated May 19, 2024

FlashAttention2.0 with Lora

Python 9 Updated Jul 31, 2023

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 3,445 406 Updated Feb 12, 2025

The nanoGPT-style implementation of RWKV Language Model - an RNN with GPT-level LLM performance.

Python 184 12 Updated Nov 9, 2023
Python 24 1 Updated Nov 22, 2023

Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca

C 4,153 417 Updated Apr 18, 2025

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 18,413 1,871 Updated May 12, 2025

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,430 851 Updated Jun 10, 2024

Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).

769 24 Updated Jul 20, 2023
Next
0