8000 ver217 (Hongxin Liu) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View ver217's full-sized avatar
💪
💪
  • Shanghai
  • 19:07 (UTC +08:00)

Organizations

@UniqueStudio @HUSTMeituanClub-Web @UNIQUE-AILAB

Block or report ver217

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Python 5,315 586 Updated May 9, 2025

Zero Bubble Pipeline Parallelism

Python 388 26 Updated May 7, 2025

Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*

Jupyter Notebook 123 12 Updated Oct 27, 2023

Robust Speech Recognition via Large-Scale Weak Supervision

Python 81,522 9,795 Updated Jan 4, 2025

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…

C++ 10,463 1,420 Updated May 12, 2025

Stable Diffusion web UI

Python 152,298 28,334 Updated May 3, 2025

WebUI extension for ControlNet

Python 17,604 2,017 Updated Aug 12, 2024

Fast and memory-efficient exact attention

Python 17,317 1,676 Updated May 8, 2025

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

7,486 399 Updated Jul 16, 2023

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Jupyter Notebook 1,566 97 Updated Feb 16, 2024

天涯 kkndme 神贴聊房价

19,022 3,841 Updated Aug 27, 2023

Accessible large language models via k-bit quantization for PyTorch.

Python 7,000 693 Updated May 9, 2025

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 2,102 168 Updated Mar 27, 2024

4 bits quantization of LLaMA using GPTQ

Python 3,050 460 Updated Jul 13, 2024

A Gradio web UI for Large Language Models with support for multiple inference backends.

Python 43,552 5,612 Updated May 11, 2025

A collection of libraries to optimise AI model performances

Python 8,373 636 Updated Jul 22, 2024

Code plagiarism detection tool

Python 295 40 Updated Apr 6, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 144,200 28,910 Updated May 12, 2025

A cloud-native Go microservices framework with cli tool for productivity.

Go 30,942 4,103 Updated May 11, 2025

A simple, efficient and powerful micro front-end framework. 一款简约、高效、功能强大的微前端框架

CSS 5,918 594 Updated May 12, 2025

Workrave is a program that assists in the recovery and prevention of Repetitive Strain Injury (RSI). The program frequently alerts you to take micro-pauses, rest breaks and restricts you to your da…

C++ 1,651 205 Updated May 10, 2025

Implementation examples of module federation , by the creators of module federation

JavaScript 5,889 1,806 Updated Mar 7, 2025

Create React App boilerplate with React 17, Webpack 5, Tailwind 2, Module Federation, and TypeScript.

JavaScript 81 12 Updated Sep 6, 2024

micro-app 案例

JavaScript 241 128 Updated Feb 5, 2025

A GUI client for Windows, Linux and macOS, support Xray and sing-box and others

C# 79,559 12,614 Updated May 11, 2025

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Python 29,448 3,493 Updated May 12, 2025

Profiling and inspecting memory in pytorch

Python 1,057 37 Updated Aug 6, 2024

Making large AI models cheaper, faster and more accessible

Python 40,871 4,506 Updated May 8, 2025

Scalable PaLM implementation of PyTorch

Python 190 27 Updated Dec 19, 2022

Optimizing AlphaFold Training and Inference on GPU Clusters

Python 604 90 Updated Jul 16, 2024
Next
0