8000 Arist12 (Yikai Zhang) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Arist12's full-sized avatar
:octocat:
Exploring the Unknown!
:octocat:
Exploring the Unknown!

Highlights

  • Pro

Block or report Arist12

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

My learning notes/codes for ML SYS.

Python 2,187 132 Updated May 14, 2025

A final sanity checklist to help your CS paper get accepted, not desk rejected.

914 102 Updated May 7, 2025

Revisiting Mid-training in the Era of RL Scaling

Jupyter Notebook 41 1 Updated Apr 24, 2025

A series of math-specific large language models of our Qwen2 series.

Python 929 133 Updated Jan 11, 2025

Simple RL training for reasoning

Python 3,563 265 Updated Apr 10, 2025

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 1,775 105 Updated Apr 3, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 41,258 6,829 Updated Dec 9, 2024

Paper list for Efficient Reasoning.

435 14 Updated May 14, 2025

An extremely fast Python package and project manager, written in Rust.

Rust 54,513 1,529 Updated May 17, 2025

Development repository for the Triton language and compiler

MLIR 15,577 1,982 Updated May 17, 2025

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 3,223 254 Updated May 16, 2025

Awesome RL-based LLM Reasoning

490 25 Updated May 4, 2025

Puzzles for learning Triton

Jupyter Notebook 1,628 131 Updated Nov 18, 2024

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA/Tensor Cores Kernels, HGEMM, FA-2 MMA etc.🔥

Cuda 4,244 453 Updated May 12, 2025

Collection of Summer 2025 tech internships!

37,708 2,907 Updated May 17, 2025

FlashInfer: Kernel Library for LLM Serving

Cuda 2,964 307 Updated May 16, 2025

📚A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, MLA, Parallelism etc.

Python 4,003 9D93 277 Updated May 15, 2025

Material for gpu-mode lectures

Jupyter Notebook 4,441 448 Updated Feb 9, 2025

Machine Learning Engineering Open Book

Python 13,730 828 Updated May 8, 2025

Ongoing research training transformer models at scale

Python 12,361 2,766 Updated May 16, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 14,405 1,769 Updated May 17, 2025

Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.

Python 599 106 Updated May 12, 2025

A tool for extracting plain text from Wikipedia dumps

Python 3,857 979 Updated May 23, 2024

Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥

Python 38,810 3,038 Updated May 17, 2025

 Now we have become very big, Different from the original idea. Collect premium software in various categories.

JavaScript 82,935 6,519 Updated May 13, 2025

SPLADE: sparse neural search (SIGIR21, SIGIR22)

Python 843 91 Updated May 3, 2024

LLM training in simple, raw C/CUDA

Cuda 26,606 3,058 Updated May 10, 2025

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Python 1,806 206 Updated Feb 25, 2025

Implementation of paper Data Engineering for Scaling Language Models to 128K Context

Python 461 30 Updated Mar 19, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,485 531 Updated May 3, 2024
Next
0