ints81

이준열 ints81

Hanyang University
222, Wangsimni-ro, Seongdong-gu, Seoul, Republic of Korea

Achievements

Stars

ashvardanian / less_slow.cpp

Playing around "Less Slow" coding practices in C++ 20, C, CUDA, PTX, & Assembly, from numerics & SIMD to coroutines, ranges, exception handling, networking and user-space IO

C++ 1,785 67 Updated May 19, 2025

antgroup / glake

GLake: optimizing GPU memory management and IO transmission.

Python 466 41 Updated Mar 24, 2025

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 4,247 410 Updated Jun 11, 2025

iwanhae / kubegraph

Realtime Web Based Kubernetes Visualizer with WebAssembly and Controller Runtime

JavaScript 34 1 Updated May 1, 2023

lucidrains / titans-pytorch

Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch

Python 1,372 125 Updated Jun 3, 2025

leimao / CUTLASS-Examples

CUTLASS and CuTe Examples

Cuda 53 9 Updated Jan 4, 2025

chiphuyen / aie-book

[WIP] Resources for AI engineers. Also contains supporting materials for the book AI Engineering (Chip Huyen, 2025)

Jupyter Notebook 4,520 561 Updated Feb 12, 2025

siboehm / SGEMM_CUDA

Fast CUDA matrix multiplication from scratch

Cuda 738 115 Updated Dec 28, 2023

TylerYep / torchinfo

View model summaries in PyTorch!

Python 2,800 128 Updated Jun 9, 2025

flet-dev / flet

Flet enables developers to easily build realtime web, mobile and desktop apps in Python. No frontend experience required.

Python 13,426 535 Updated Jun 10, 2025

mehran-prs / snip

A simple and minimal command-line snippet manager

Go 76 1 Updated Feb 11, 2025

HazyResearch / ThunderKittens

Tile primitives for speedy kernels

Cuda 2,447 151 Updated Jun 7, 2025

eniac / paella

Paella: Low-latency Model Serving with Virtualized GPU Scheduling

C++ 59 6 Updated May 1, 2024

deepspeedai / DeepSpeed-MII

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 2,019 185 Updated Mar 26, 2025

jamesstringerparsec / Easy-GPU-PV

A Project dedicated to making GPU Partitioning on Windows easier!

PowerShell 4,890 489 Updated Jun 22, 2024

lesomnus / vfs

Virtual File System with `std::filesystem` API.

C++ 21 2 Updated Aug 12, 2023

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 49,319 7,901 Updated Jun 11, 2025

labmlai / annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 60,959 6,150 Updated Aug 24, 2024

kakaobrain / coyo-dataset

COYO-700M: Large-scale Image-Text Pair Dataset

Python 1,223 38 Updated Nov 30, 2022

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 29,320 6,014 Updated Jun 11, 2025

heyfey / vodascheduler

GPU scheduler for elastic/distributed deep learning workloads in Kubernetes cluster (IC2E'23)

Go 34 3 Updated Nov 11, 2023

tobegit3hub / advisor

Open-source implementation of Google Vizier for hyper parameters tuning

Jupyter Notebook 1,556 257 Updated Nov 11, 2019

seungpyo / tensorflow

Forked from tensorflow/tensorflow

An Open Source Machine Learning Framework for Everyone

C++ 1 Updated Jun 2, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

이준열 ints81

Achievements

Achievements

Block or report ints81

Stars

ashvardanian / less_slow.cpp

antgroup / glake

ai-dynamo / dynamo

iwanhae / kubegraph

lucidrains / titans-pytorch

leimao / CUTLASS-Examples

chiphuyen / aie-book

siboehm / SGEMM_CUDA

TylerYep / torchinfo

flet-dev / flet

mehran-prs / snip

HazyResearch / ThunderKittens

eniac / paella

deepspeedai / DeepSpeed-MII

jamesstringerparsec / Easy-GPU-PV

lesomnus / vfs

vllm-project / vllm

labmlai / annotated_deep_learning_paper_implementations

kakaobrain / coyo-dataset

huggingface / diffusers

heyfey / vodascheduler

tobegit3hub / advisor

seungpyo / tensorflow

seungpyo / M3

albanie / convnet-burden

codertimo / BERT-pytorch

snuspl / nimble

dragen1860 / TensorFlow-2.x-Tutorials

prabhuomkar / pytorch-cpp

dbusbridge / tf-best-practices