[ICLR 2023] "Learning to Grow Pretrained Models for Efficient Transformer Training" by Peihao Wang, Rameswar Panda, Lucas Torroba Hennigen, Philip Greengard, Leonid Karlinsky, Rogerio Feris, David …

Python 91 10 Updated Feb 26, 2024

bytedance / MoMA

MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation

Jupyter Notebook 226 18 Updated Jul 11, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

15,254 985 Updated May 15, 2025

pliang279 / awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

6,458 881 Updated Aug 20, 2024

horseee / Awesome-Efficient-LLM

A curated list for Efficient Large Language Models

Python 1,665 134 Updated Apr 23, 2025

Infini-AI-Lab / TriForce

[COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding

Python 250 17 Updated Aug 31, 2024

meta-llama / llama

Inference code for Llama models

Python 58,256 9,769 Updated Jan 26, 2025

hemingkx / SpeculativeDecodingPapers

📰 Must-read papers and blogs on Speculative Decoding ⚡️

739 42 Updated May 17, 2025

thunlp / Ouroboros

Ouroboros: Speculative Decoding with Large Model Enhanced Drafting (EMNLP 2024 main)

Python 103 9 Updated Mar 20, 2025

SJTU-IPADS / PowerInfer

High-speed Large Language Model Serving for Local Deployment

C++ 8,208 431 Updated Feb 19, 2025

hao-ai-lab / LookaheadDecoding

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Python 1,248 75 Updated Mar 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

June MaverickJune

Achievements

Achievements

Highlights

Block or report MaverickJune

Stars

siboehm / SGEMM_CUDA

dave1010 / tree-of-thought-prompting

SergioMEV / slurm-for-dummies

flashinfer-ai / flashinfer

FMInference / FlexLLMGen

microsoft / sarathi-serve

LLMServe / DistServe

kojima-takeshi188 / zero_shot_cot

leobeeson / llm_benchmarks

AmberLJC / LLMSys-PaperList

alexzhang13 / flashattention2-custom-mask

THUDM / LongBench

dvlab-research / Q-LLM

gkamradt / LLMTest_NeedleInAHaystack

hdong920 / LESS

huggingface / course

huggingface / transformers

meta-llama / llama3

YaoJiayi / CacheBlend

VITA-Group / LiGO