8000 maaoBit (maao) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View maaoBit's full-sized avatar

Block or report maaoBit

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Python 34,169 4,818 Updated May 16, 2025

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

Python 8,929 371 Updated Apr 25, 2025

Supercharge Your LLM Application Evaluations 🚀

Python 9,220 917 Updated May 17, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,677 770 Updated May 19, 2025

JuiceFS is a distributed POSIX file system built on top of Redis and S3.

Go 11,612 1,027 Updated May 22, 2025

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3.

Python 1,257 145 Updated May 18, 2025

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 144,653 29,042 Updated May 22, 2025

Karpenter is a Kubernetes Node Autoscaler built for flexibility, performance, and simplicity.

Go 989 272 Updated May 22, 2025

SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.

Python 19,156 1,950 Updated May 21, 2025

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 77,480 8,518 Updated May 22, 2025

LeaderWorkerSet: An API for deploying a group of pods as a unit of replication

Go 447 77 Updated May 22, 2025

FlashMLA: Efficient MLA decoding kernels

Cuda 11,564 834 Updated Apr 29, 2025

FlashInfer: Kernel Library for LLM Serving

Cuda 3,017 310 Updated May 22, 2025

A streamlined and customizable framework for efficient large model evaluation and performance benchmarking

Python 1,007 109 Updated May 22, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 14,558 1,818 Updated May 22, 2025

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 16+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

Python 8,124 653 Updated May 22, 2025

Bring portraits to life!

Python 14,891 1,612 Updated Feb 28, 2025

Underlay and RDMA network solution of the Kubernetes, for bare metal, VM and any public cloud

Go 9976 580 83 Updated May 22, 2025

Fast container image distribution plugin with lazy pulling

Go 1,301 123 Updated May 20, 2025

Nydus - the Dragonfly image service, providing fast, secure and easy access to container images.

Rust 1,320 218 Updated Apr 28, 2025

Dragonfly is an open source P2P-based file distribution and image acceleration system. It is hosted by the Cloud Native Computing Foundation (CNCF) as an Incubating Level Project.

Go 2,556 317 Updated May 22, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 95,673 12,342 Updated May 22, 2025
Python 12 1 Updated Aug 1, 2023

Making large AI models cheaper, faster and more accessible

Python 40,894 4,510 Updated May 22, 2025

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 18,485 1,874 Updated May 21, 2025

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 16,223 1,609 Updated May 11, 2025

Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"

Python 1,570 191 Updated Aug 12, 2020

Heterogeneous AI Computing Virtualization Middleware

Go 1,630 298 Updated May 22, 2025

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 17,768 2,091 Updated May 1, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 14,118 995 Updated May 21, 2025
Next
0