8000 gongshaotian (RAM) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View gongshaotian's full-sized avatar
👋
👋

Block or report gongshaotian

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SGLang is a fast serving framework for large language models and vision language models.

Python 14,306 1,737 Updated May 14, 2025

A self-learning tutorail for CUDA High Performance Programing.

JavaScript 628 69 Updated Apr 12, 2025

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

Python 12,580 3,026 Updated May 14, 2025

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Python 84,586 7,321 Updated May 12, 2025

Ongoing research training transformer models at scale

Python 12,340 2,759 Updated May 13, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 3,996 355 Updated May 14, 2025

AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。

Python 2,454 327 Updated May 13, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 47,241 7,394 Updated May 14, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,636 764 Updated May 12, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,761 276 Updated Apr 14, 2025

FlashMLA: Efficient MLA decoding kernels

Cuda 11,539 834 Updated Apr 29, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Python 5,323 587 Updated May 13, 2025

CUDA Templates for Linear Algebra Subroutines

C++ 1 Updated Mar 2, 2025

All-in-One Development Tool based on PaddlePaddle

Python 5,449 1,019 Updated May 13, 2025

基于PaddlePaddle的自动化功能开发小组

Python 10 45 Updated May 14, 2025

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python 12,276 3,587 Updated May 13, 2025

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 89,901 24,144 Updated May 14, 2025

BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.

C++ 1 Updated Aug 28, 2024

BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.

C++ 864 164 Updated Dec 30, 2024

PaddlePaddle Developer Community

Jupyter Notebook 108 285 Updated May 12, 2025

This GitHub Action creates a GitHub contribution calendar on a 3D profile image.

TypeScript 1,260 219 Updated May 8, 2025

record daily learning

C 11 3 Updated Aug 13, 2019

nvidia-modelopt is a unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for do…

Python 913 68 Updated May 9, 2025

CUDA Templates for Linear Algebra Subroutines

C++ 7,462 1,223 Updated May 13, 2025

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 7,426 2,023 Updated May 5, 2025

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

C++ 1 Updated May 6, 2025

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 11,573 2,191 Updated May 9, 2025

pybind11中文文档(个人翻译)

269 60 Updated Jul 24, 2023

Instant neural graphics primitives: lightning fast NeRF and more

Cuda 16,574 1,973 Updated Jan 27, 2025

This is a pytorch implementation of method based on Lightweight Multi-View 3D Pose Estimation through Camera-Disentangled Representation applying on human pose estimation tasks using stereo images.

Python 10 2 Updated Jan 25, 2024
Next
0