henryzhongsc

🦈

正在摸鱼

Shaochen (Henry) Zhong henryzhongsc

🦈

正在摸鱼

CS PhD@Rice

38 followers · 39 following

Achievements

x2 x2

Achievements

x2 x2

Stars

Jingyu6 / hamburger

Python 13 Updated Jun 10, 2025

LeanModels / DFloat11

DFloat11: Lossless LLM Compression for Efficient GPU Inference

Python 425 27 Updated May 23, 2025

Eclipsess / Awesome-Efficient-Reasoning-LLMs

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

456 14 Updated Jun 16, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,826 278 Updated May 15, 2025

LiamMa / GRIT

This is an official implementation for "GRIT: Graph Inductive Biases in Transformers without Message Passing".

Python 125 14 Updated Dec 8, 2024

johnnyhwu / Awesome-LLM-Tabular

Awesome-LLM-Tabular: a curated list of Large Language Model applied to Tabular Data

398 30 Updated Dec 22, 2024

sycny / RAE

[CIKM2024] Retrieval-enhanced Knowledge Editing in Language Models for Multi-Hop Question Answering

Python 36 5 Updated Jan 12, 2025

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 5,701 620 Updated Jun 19, 2025

guanchuwang / Taylor-Unswift

Python 22 2 Updated Oct 3, 2024

BlinkDL / RWKV-LM

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 13,722 916 Updated Jun 17, 2025

FlagOpen / FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Python 9,967 736 Updated Jun 4, 2025

microsoft / MInference

[NeurIPS'24 Spotlight, ICLR'25, ICML'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filli…

Python 1,056 53 Updated Jun 17, 2025

fla-org / flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Python 2,749 196 Updated Jun 20, 2025

daochenzha / data-centric-AI

A curated, but incomplete, list of data-centric AI resources.

1,111 78 Updated Jun 26, 2024

datamllab / ltsm

Understanding Different Design Choices in Training Large Time Series Models

Python 95 15 Updated Apr 14, 2025

datamllab / LongLM

[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Python 653 60 Updated Jun 1, 2024

NirDiamant / RAG_Techniques

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 17,829 1,783 Updated Jun 17, 2025

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 5,238 354 Updated Jun 20, 2025

henryzhongsc / gnn_editing

Official implementation for Zhong & Le et al., GNNs Also Deserve Editing, and They Need It More Than Once. ICML 2024

Python 9 2 Updated Aug 5, 2024

opengear-project / GEAR

GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM

Python 83F2 163 16 Updated Jul 12, 2024

hemingkx / SpeculativeDecodingPapers

📰 Must-read papers and blogs on Speculative Decoding ⚡️

800 46 Updated Jun 18, 2025

hemingkx / Spec-Bench

Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)

Python 281 39 Updated Apr 22, 2025

datamllab / labnews

5 Updated Feb 12, 2024

jy-yuan / KIVI

[ICML 2024] KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache

Python 303 31 Updated Jan 19, 2025

NVIDIA-Merlin / Transformers4Rec

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation and works with PyTorch.

Python 1,186 150 Updated Oct 8, 2024

yuxwind / CBS

Official Code of The Combinatorial Brain Surgeon: Pruning Weights That Cancel One Another in Neural Networks[ICML2022]

Python 15 1 Updated Sep 20, 2022

ndanielsen / Same-Size-K-Means

A k-means variation that produces clusters of the same size utilizing the scikit-learn API and related utilities

Python 96 62 Updated Aug 23, 2022

DGraphXinye / 2022_finvcup_baseline

Python 100 27 Updated Jun 3, 2022

Harry24k / adversarial-attacks-pytorch

PyTorch implementation of adversarial attacks [torchattacks]

Python 2,039 362 Updated Jun 29, 2024

HiddenStrawberry / Crawler_Illegal_Cases_In_China

Collection of China illegal cases about web crawler 本项目用来整理所有中国大陆爬虫开发者涉诉与违规相关的新闻、资料与法律法规。致力于帮助在中国大陆工作的爬虫行业从业者了解我国相关法律，避免触碰数据合规红线。 [AD]企业租显卡算力部署AI请选Novagrid

HTML 4,149 303 Updated Mar 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shaochen (Henry) Zhong henryzhongsc

Achievements

Achievements

Block or report henryzhongsc

Stars

Jingyu6 / hamburger

LeanModels / DFloat11

Eclipsess / Awesome-Efficient-Reasoning-LLMs

deepseek-ai / open-infra-index

LiamMa / GRIT

johnnyhwu / Awesome-LLM-Tabular

sycny / RAE

allenai / OLMo

guanchuwang / Taylor-Unswift

BlinkDL / RWKV-LM

FlagOpen / FlagEmbedding

microsoft / MInference

fla-org / flash-linear-attention

daochenzha / data-centric-AI

datamllab / ltsm

datamllab / LongLM

NirDiamant / RAG_Techniques

linkedin / Liger-Kernel

henryzhongsc / gnn_editing

opengear-project / GEAR

hemingkx / SpeculativeDecodingPapers

hemingkx / Spec-Bench

datamllab / labnews

jy-yuan / KIVI

NVIDIA-Merlin / Transformers4Rec

yuxwind / CBS

ndanielsen / Same-Size-K-Means

DGraphXinye / 2022_finvcup_baseline

Harry24k / adversarial-attacks-pytorch

HiddenStrawberry / Crawler_Illegal_Cases_In_China