8000 ints81 (이준열) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View ints81's full-sized avatar
  • Hanyang University
  • 222, Wangsimni-ro, Seongdong-gu, Seoul, Republic of Korea

Block or report ints81

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Playing around "Less Slow" coding practices in C++ 20, C, CUDA, PTX, & Assembly, from numerics & SIMD to coroutines, ranges, exception handling, networking and user-space IO

C++ 1,785 67 Updated May 19, 2025

GLake: optimizing GPU memory management and IO transmission.

Python 466 41 Updated Mar 24, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 4,247 410 Updated Jun 11, 2025

Realtime Web Based Kubernetes Visualizer with WebAssembly and Controller Runtime

JavaScript 34 1 Updated May 1, 2023

Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch

Python 1,372 125 Updated Jun 3, 2025

CUTLASS and CuTe Examples

Cuda 53 9 Updated Jan 4, 2025

[WIP] Resources for AI engineers. Also contains supporting materials for the book AI Engineering (Chip Huyen, 2025)

Jupyter Notebook 4,520 561 Updated Feb 12, 2025

Fast CUDA matrix multiplication from scratch

Cuda 738 115 Updated Dec 28, 2023

View model summaries in PyTorch!

Python 2,800 128 Updated Jun 9, 2025

Flet enables developers to easily build realtime web, mobile and desktop apps in Python. No frontend experience required.

Python 13,426 535 Updated Jun 10, 2025

A simple and minimal command-line snippet manager

Go 76 1 Updated Feb 11, 2025

Tile primitives for speedy kernels

Cuda 2,447 151 Updated Jun 7, 2025

Paella: Low-latency Model Serving with Virtualized GPU Scheduling

C++ 59 6 Updated May 1, 2024

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 2,019 185 Updated Mar 26, 2025

A Project dedicated to making GPU Partitioning on Windows easier!

PowerShell 4,890 489 Updated Jun 22, 2024

Virtual File System with `std::filesystem` API.

C++ 21 2 Updated Aug 12, 2023

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 49,319 7,901 Updated Jun 11, 2025

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 60,959 6,150 Updated Aug 24, 2024

COYO-700M: Large-scale Image-Text Pair Dataset

Python 1,223 38 Updated Nov 30, 2022

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 29,320 6,014 Updated Jun 11, 2025

GPU scheduler for elastic/distributed deep learning workloads in Kubernetes cluster (IC2E'23)

Go 34 3 Updated Nov 11, 2023

Open-source implementation of Google Vizier for hyper parameters tuning

Jupyter Notebook 1,556 257 Updated Nov 11, 2019

An Open Source Machine Learning Framework for Everyone

C++ 1 Updated Jun 2, 2021

NVIDIA GPU Memory Map Manager

C 1 Updated Apr 13, 2021

Memory consumption and FLOP count estimates for convnets

MATLAB 924 114 Updated Jan 17, 2019

Google AI 2018 BERT pytorch implementation

Python 6,414 1,321 Updated Sep 15, 2023

Lightweight and Parallel Deep Learning Framework

C++ 263 32 Updated Nov 26, 2022

TensorFlow 2.x version's Tutorials and Examples, including CNN, RNN, GAN, Auto-Encoders, FasterRCNN, GPT, BERT examples, etc. TF 2.0版入门实例代码,实战教程。

Jupyter Notebook 6,381 2,229 Updated Sep 23, 2020

C++ Implementation of PyTorch Tutorials for Everyone

C++ 2,048 270 Updated Mar 6, 2025

A starter kit for the TensorFlow Experiment and Estimator APIs

Python 9 7 Updated Nov 23, 2017
Next
0