8000 ver217 (Hongxin Liu) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View ver217's full-sized avatar
💪
💪
  • Shanghai
  • 20:42 (UTC +08:00)

Organizations

@UniqueStudio @HUSTMeituanClub-Web @UNIQUE-AILAB

Block or report ver217

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Python 5,497 636 Updated Jul 2, 2025

Zero Bubble Pipeline Parallelism

Python 401 26 Updated May 7, 2025

Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*

Jupyter Notebook 125 12 Updated Oct 27, 2023

Robust Speech Recognition via Large-Scale Weak Supervision

Python 84,290 10,272 Updated Jun 26, 2025

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…

C++ 10,923 1,546 Updated Jul 3, 2025

Stable Diffusion web UI

Python 154,109 28,626 Updated May 3, 2025

WebUI extension for ControlNet

Python 17,707 2,020 Updated Aug 12, 2024

Fast and memory-efficient exact attention

Python 18,160 1,778 Updated Jul 3, 2025

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

7,502 401 Updated Jul 16, 2023

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Jupyter Notebook 1,575 98 Updated Feb 16, 2024

天涯 kkndme 神贴聊房价

19,065 3,849 Updated Aug 27, 2023

Accessible large language models via k-bit quantization for PyTorch.

Python 7,183 712 Updated Jul 2, 2025

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 2,135 176 Updated Mar 27, 2024

4 bits quantization of LLaMA using GPTQ

Python 3,053 459 Updated Jul 13, 2024

LLM UI with advanced features, easy setup, and multiple backend support.

Python 44,188 5,682 Updated Jul 1, 2025

A collection of libraries to optimise AI model performances

Python 8,372 635 Updated Jul 22, 2024

Code plagiarism detection tool

Python 300 40 Updated Apr 6, 2024

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 146,376 29,506 Updated Jul 3, 2025

A cloud-native Go microservices framework with cli tool for productivity.

Go 31,346 4,154 Updated Jul 2, 2025

A simple, efficient and powerful micro front-end framework. 一款简约、高效、功能强大的微前端框架

CSS 5,968 605 Updated Jun 11, 2025

Workrave is a program that assists in the recovery and prevention of Repetitive Strain Injury (RSI). The program frequently alerts you to take micro-pauses, rest breaks and restricts you to your da…

C++ 1,663 207 Updated Jun 29, 2025

Implementation examples of module federation , by the creators of module federation

JavaScript 5,919 1,822 Updated Mar 7, 2025

Create React App boilerplate with React 17, Webpack 5, Tailwind 2, Module Federation, and TypeScript.

JavaScript 81 12 Updated Sep 6, 2024

micro-app 案例

JavaScript 245 133 Updated Feb 5, 2025

A GUI client for Windows, Linux and macOS, support Xray and sing-box and others

C# 82,403 12,865 Updated Jul 2, 2025

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Python 29,706 3,531 Updated Jul 3, 2025

Profiling and inspecting memory in pytorch

Python 1,061 37 Updated Aug 6, 2024

Making large AI models cheaper, faster and more accessible

Python 41,006 4,521 Updated Jul 2, 2025

Scalable PaLM implementation of PyTorch

Python 190 27 Updated Dec 19, 2022

Optimizing AlphaFold Training and Inference on GPU Clusters

Python 604 90 Updated Jul 16, 2024
Next
0