8000 IronSublimate (TheIronSubliamte) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View IronSublimate's full-sized avatar
:octocat:
:octocat:

Block or report IronSublimate

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA/Tensor Cores Kernels, HGEMM, FA-2 MMA etc.🔥

Cuda 4,600 485 Updated Jun 3, 2025

Awesome Mobile LLMs

197 12 Updated Jun 2, 2025

【国内梯子排行】最好用的VPN梯子推荐与科学上网测评 -梯子、科学上网、翻墙、机场、v2ray、trojan、shadowsock

1 Updated Dec 15, 2023

CUDA 算子手撕与面试指南

Cuda 400 49 Updated Jan 15, 2025

how to optimize some algorithm in cuda.

Cuda 2,239 196 Updated May 25, 2025

Make RepVGG Greater Again: A Quantization-aware Approach

Python 23 2 Updated Mar 8, 2024

如何成为一名自洽的程序员

Shell 2,844 129 Updated Apr 16, 2025

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (p…

2,116 223 Updated Mar 4, 2025
Dockerfile 282 44 Updated May 23, 2025

Docker image for Remote Desktop server with audio support

Shell 291 140 Updated Apr 5, 2025

中文的C++ Template的教学指南。与知名书籍C++ Templates不同,该系列教程将C++ Templates作为一门图灵完备的语言来讲授,以求帮助读者对Meta-Programming融会贯通。(正在施工中)

C++ 10,173 1,600 Updated Aug 20, 2024

本项目是一个通过文字生成图片的项目,基于开源模型Stable Diffusion V1.5生成可以在手机的CPU和NPU上运行的模型,包括其配套的模型运行框架。

C++ 196 25 Updated Mar 29, 2024

科技爱好者周刊,每周五发布

66,835 3,421 Updated May 30, 2025

👩🏿‍💻👨🏾‍💻👩🏼‍💻👨🏽‍💻👩🏻‍💻中国独立开发者项目列表 -- 分享大家都在做什么

39,349 3,252 Updated Jun 3, 2025

Open-Sora: Democratizing Efficient Video Production for All

Python 26 A6EF ,591 2,576 Updated Apr 30, 2025

Universal cross-platform tokenizers binding to HF and sentencepiece

C++ 341 81 Updated May 31, 2025

Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.

Python 472 58 Updated Sep 11, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 47,122 5,186 Updated Jun 3, 2025

开箱即用的点亮城市、省份的足迹地图,可自定义标注。I'll try anything once.

HTML 13 Updated Mar 31, 2025

The stable diffusion webui training aid extension helps you quickly and visually train models such as Lora.

Python 420 51 Updated Mar 28, 2024

让微信网页版可用 / Allow the use of WeChat via webpage access

TypeScript 2,102 178 Updated Feb 16, 2025

我的电视 电视直播软件,安装即可使用

C 32,042 3,584 Updated Jun 20, 2024

llm deploy project based mnn. This project has merged into MNN.

C++ 1,585 172 Updated Jan 20, 2025

崩坏:星穹铁道脚本 | Honkai: Star Rail auto bot (简体中文/繁體中文/English/Español)

Python 3,763 203 Updated May 27, 2025

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…

C++ 10,627 1,468 Updated Jun 3, 2025

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 18,251 2,145 Updated May 27, 2025

fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型,任意10G以上显卡即可推理满血DeepSeek。双路9004/9005服务器+单显卡部署DeepSeek满血满精度原版模型,单并发20tps;INT4量化模型单并发30tps,多并发可达60+。

C++ 3,605 368 Updated May 30, 2025

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

20,237 1,950 Updated May 19, 2025
Next
0