GitHub - osayamenja/Kleos: Complete GPU residency for ML.

Complete, efficient GPU residency for Distributed Machine Learning.

🌹Kleos: GPU-Resident Runtime for ML

Kleos is an ongoing research project exploring the design of GPU-resident infrastructure for distributed machine learning workloads. The goal is to eliminate CPU bottlenecks by fusing scheduling, communication, and compute directly on the GPU using lightweight, asynchronous primitives.

🎯 Vision and Methods

We aim to achieve complete, efficient GPU residency. We investigate optimizations that approach hardware peak performance for both distributed and single GPU workloads, where bulk-synchronous, CPU-driven orchestration is a limiting factor. To attain this vision, we employ kernel fusion enabled by (1) algorithmic innovations with strong theoretical footing and (2) principled systems design and implementation.

This repository represents a very early-stage release of Kleos infrastructure.

🗞️ News

June 5, 2025 — ⚡️Introducing FlashDMoE, a fused GPU kernel for distributed MoE execution.
➤ See this README for details, benchmarks, and usage.

⚖️ License

This project is licensed under the BSD 3-Clause License. See LICENSE for full terms.

Name		Name	Last commit message	Last commit date
Latest commit History 337 Commits
csrc		csrc
.gitignore		.gitignore
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
logow.png		logow.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🌹Kleos: GPU-Resident Runtime for ML

🎯 Vision and Methods

🗞️ News

⚖️ License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

osayamenja/Kleos

Folders and files

Latest commit

History

Repository files navigation

🌹Kleos: GPU-Resident Runtime for ML

🎯 Vision and Methods

🗞️ News

⚖️ License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages