GitHub - novakoki/Optimizing-MLSys-Performance: Notes About How To Accelerate ML Systems

In this series of blog, I am going to recap what I learned about optimizing performance in my previous work, especially for machine learning systems. And I also want to introduce some newest techniques on accelerating LLMs.

Introduction
Where You Are: How To Measure and Profile A Program
Where The Peak Is: How To Calculate The Theoretical Performance Upper Bound
1. Roofline Model
2. Get Your Own Benchmark For Hardware
What You Can Do
1. Maximize The Utility Of Hardware
2. Add or Upgrade Hardware
3. Less Work
  1. Quantization
4. Beyond Von Neumann
  1. Quantum computing
  2. Computing with Memory
Trending Applications
1. Fast Attentions
2. Distributed Training and Inference
My Previous Work

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.obsidian		.obsidian
Introduction.md		Introduction.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

novakoki/Optimizing-MLSys-Performance

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages