8000 xzngycw / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View xzngycw's full-sized avatar

Block or report xzngycw

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Material for gpu-mode lectures

Jupyter Notebook 4,597 460 Updated Feb 9, 2025

🎦 Video comparison player for Mac and Windows, built using Electron

Vue 204 12 Updated Mar 1, 2023

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 40,603 3,223 Updated Jun 12, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 51,193 7,457 Updated Jun 15, 2025

Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.

Cuda 419 80 Updated Sep 8, 2024

ONNX-TensorRT: TensorRT backend for ONNX

C++ 3,092 544 Updated Jun 16, 2025

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 11,719 2,205 Updated May 21, 2025

Implementation of popular deep learning networks with TensorRT network definition API

C++ 7,392 1,821 Updated May 10, 2025

Repository for the book "Crafting Interpreters"

HTML 9,857 1,137 Updated Aug 7, 2024

MegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器

C++ 485 58 Updated Oct 23, 2024

Various translations of OSTEP can be found here. Help the cause and contribute!

2,863 490 Updated Jan 20, 2025

One second to read GitHub code with VS Code.

TypeScript 23,096 893 Updated Jun 6, 2025

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python 12,375 3,599 Updated Jun 16, 2025

Simplify your onnx model

C++ 4,097 399 Updated Sep 3, 2024

Low-precision matrix multiplication

C++ 1,805 457 Updated Jan 29, 2024
C++ 312 90 Updated Dec 20, 2024

Winograd minimal convolution algorithm generator for convolutional neural networks.

Python 618 146 Updated Oct 17, 2020
0