8000 henryzhongsc (Shaochen (Henry) Zhong) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View henryzhongsc's full-sized avatar
🦈
正在摸鱼
🦈
正在摸鱼

Block or report henryzhongsc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 13 Updated Jun 10, 2025

DFloat11: Lossless LLM Compression for Efficient GPU Inference

Python 425 27 Updated May 23, 2025

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

456 14 Updated Jun 16, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,826 278 Updated May 15, 2025

This is an official implementation for "GRIT: Graph Inductive Biases in Transformers without Message Passing".

Python 125 14 Updated Dec 8, 2024

Awesome-LLM-Tabular: a curated list of Large Language Model applied to Tabular Data

398 30 Updated Dec 22, 2024

[CIKM2024] Retrieval-enhanced Knowledge Editing in Language Models for Multi-Hop Question Answering

Python 36 5 Updated Jan 12, 2025

Modeling, training, eval, and inference code for OLMo

Python 5,701 620 Updated Jun 19, 2025
Python 22 2 Updated Oct 3, 2024

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 13,722 916 Updated Jun 17, 2025

Retrieval and Retrieval-augmented LLMs

Python 9,967 736 Updated Jun 4, 2025

[NeurIPS'24 Spotlight, ICLR'25, ICML'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filli…

Python 1,056 53 Updated Jun 17, 2025

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Python 2,749 196 Updated Jun 20, 2025

A curated, but incomplete, list of data-centric AI resources.

1,111 78 Updated Jun 26, 2024

Understanding Different Design Choices in Training Large Time Series Models

Python 95 15 Updated Apr 14, 2025

[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Python 653 60 Updated Jun 1, 2024

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 17,829 1,783 Updated Jun 17, 2025

Efficient Triton Kernels for LLM Training

Python 5,238 354 Updated Jun 20, 2025

Official implementation for Zhong & Le et al., GNNs Also Deserve Editing, and They Need It More Than Once. ICML 2024

Python 9 2 Updated Aug 5, 2024

GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM

Python 83F2 163 16 Updated Jul 12, 2024

📰 Must-read papers and blogs on Speculative Decoding ⚡️

800 46 Updated Jun 18, 2025

Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)

Python 281 39 Updated Apr 22, 2025
5 Updated Feb 12, 2024

[ICML 2024] KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache

Python 303 31 Updated Jan 19, 2025

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation and works with PyTorch.

Python 1,186 150 Updated Oct 8, 2024

Official Code of The Combinatorial Brain Surgeon: Pruning Weights That Cancel One Another in Neural Networks[ICML2022]

Python 15 1 Updated Sep 20, 2022

A k-means variation that produces clusters of the same size utilizing the scikit-learn API and related utilities

Python 96 62 Updated Aug 23, 2022

PyTorch implementation of adversarial attacks [torchattacks]

Python 2,039 362 Updated Jun 29, 2024

Collection of China illegal cases about web crawler 本项目用来整理所有中国大陆爬虫开发者涉诉与违规相关的新闻、资料与法律法规。致力于帮助在中国大陆工作的爬虫行业从业者了解我国相关法律,避免触碰数据合规红线。 [AD]企业租显卡算力部署AI请选Novagrid

HTML 4,149 303 Updated Mar 24, 2025
Next
0