StudyingShao

😅

NVJiangShao StudyingShao

😅

7 followers · 7 following

NVIDIA

Achievements

Stars

StudyingShao / cutlass

Forked from NVIDIA/cutlass

CUDA Templates for Linear Algebra Subroutines

C++ 2 Updated Apr 1, 2025

NVIDIA / cutlass

CUDA Templates for Linear Algebra Subroutines

C++ 7,549 1,235 Updated May 15, 2025

fanshiqing / moe_grouped_gemm

A PyTorch Toolbox for Grouped GEMM in MoE Model Training

5 1 Updated May 28, 2024

NVIDIA / TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…

Python 2,415 422 Updated May 17, 2025

triton-inference-server / tensorrtllm_backend

The Triton TensorRT-LLM Backend

Python 834 124 Updated May 16, 2025

fanshiqing / grouped_gemm

Forked from tgale96/grouped_gemm

PyTorch bindings for CUTLASS grouped GEMM.

Cuda 121 37 Updated Jan 2, 2025

StudyingShao / TensorRT-LLM

Forked from NVIDIA/TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 3 Updated May 19, 2025

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…

C++ 10,514 1,434 Updated May 19, 2025

pyscf / gpu4pyscf

A plugin to use Nvidia GPU in PySCF package

Cuda 201 36 Updated May 16, 2025

RayTracing / raytracing.github.io

Main Web Site (Online Books)

HTML 9,453 918 Updated Apr 28, 2025

Tencent / secguide

面向开发人员梳理的代码安全指南

13,436 1,940 Updated Mar 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NVJiangShao StudyingShao

Achievements

Achievements

Block or report StudyingShao

Stars

StudyingShao / cutlass

NVIDIA / cutlass

fanshiqing / moe_grouped_gemm

NVIDIA / TransformerEngine

triton-inference-server / tensorrtllm_backend

fanshiqing / grouped_gemm

StudyingShao / TensorRT-LLM

NVIDIA / TensorRT-LLM

pyscf / gpu4pyscf

RayTracing / raytracing.github.io

Tencent / secguide