8000 yofufufufu (Weikai Tang) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View yofufufufu's full-sized avatar
  • Jilin University

Highlights

  • Pro

Block or report yofufufufu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Multi-GPU Platforms.

Cuda 40 4 Updated Mar 17, 2024

NVIDIA tools guide

Cuda 133 5 Updated Jan 7, 2025

A tool for bandwidth measurements on NVIDIA GPUs.

C++ 443 40 Updated Apr 15, 2025

Inference code for Llama models

Python 58,306 9,778 Updated Jan 26, 2025

The official Meta Llama 3 GitHub site

Python 28,751 3,393 Updated Jan 26, 2025
Python 607 66 Updated Jun 4, 2024

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python 12,326 3,591 Updated May 31, 2025

Simple samples for TensorRT programming

Python 1,606 348 Updated May 27, 2025

Fast and memory-efficient exact attention

Python 17,605 1,710 Updated May 22, 2025

Fast CUDA matrix multiplication from scratch

Cuda 729 112 Updated Dec 28, 2023

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, s…

Cuda 1,054 155 Updated Jul 29, 2023

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA/Tensor Cores Kernels, HGEMM, FA-2 MMA etc.🔥

Cuda 4,552 479 Updated May 28, 2025

Repository for HPCGame 1st Problems.

Go 62 8 Updated Feb 6, 2024

Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 18,922 1,949 Updated Apr 4, 2024

Grok open release

Python 50,296 8,352 Updated Aug 30, 2024

Wiki fo HPC

Python 112 10 Updated Jan 13, 2025

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 11,656 2,197 Updated May 21, 2025

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

C++ 5,414 639 Updated May 30, 2025

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…

C++ 10,604 1,459 Updated May 31, 2025

The C++ Core Guidelines are a set of tried-and-true guidelines, rules, and best practices about coding in C++

CSS 43,741 5,489 Updated May 8, 2025

Multi-GPU dynamic scheduler using PGAS style cross-GPU communication

Cuda 28 3 Updated Jul 23, 2023

Enterprise graph machine learning framework for billion-scale graphs for ML scientists and data scientists.

Python 413 63 Updated May 30, 2025

Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals

Python 12,693 409 Updated May 28, 2025
C++ 538 94 Updated May 29, 2025

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 13,782 1,976 Updated May 29, 2025

stdgpu: Efficient STL-like Data Structures on the GPU

C++ 1,220 91 Updated Apr 16, 2025

🎃 GPU load-balancing library for regular and irregular computations.

C++ 62 4 Updated Jun 14, 2024

cuGraph - RAPIDS Graph Analytics Library

Cuda 1,969 328 Updated May 30, 2025

Programmable CUDA/C++ GPU Graph Analytics

C++ 1,027 207 Updated Jul 30, 2024

🔮 ChatGPT Desktop Application (Mac, Windows and Linux)

Rust 53,789 6,107 Updated Aug 29, 2024
Next
0