SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.

Python 19,156 1,950 Updated May 21, 2025

comfyanonymous / ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 77,480 8,518 Updated May 22, 2025

kubernetes-sigs / lws

LeaderWorkerSet: An API for deploying a group of pods as a unit of replication

Go 447 77 Updated May 22, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient MLA decoding kernels

Cuda 11,564 834 Updated Apr 29, 2025

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 3,017 310 Updated May 22, 2025

modelscope / evalscope

A streamlined and customizable framework for efficient large model evaluation and performance benchmarking

Python 1,007 109 Updated May 22, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 14,558 1,818 Updated May 22, 2025

skypilot-org / skypilot

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 16+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

Python 8,124 653 Updated May 22, 2025

KwaiVGI / LivePortrait

Bring portraits to life!

Python 14,891 1,612 Updated Feb 28, 2025

spidernet-io / spiderpool

Underlay and RDMA network solution of the Kubernetes, for bare metal, VM and any public cloud

Go 9976 580 83 Updated May 22, 2025

containerd / stargz-snapshotter

Fast container image distribution plugin with lazy pulling

Go 1,301 123 Updated May 20, 2025

dragonflyoss / nydus

Nydus - the Dragonfly image service, providing fast, secure and easy access to container images.

Rust 1,320 218 Updated Apr 28, 2025

dragonflyoss / dragonfly

Dragonfly is an open source P2P-based file distribution and image acceleration system. It is hosted by the Cloud Native Computing Foundation (CNCF) as an Incubating Level Project.

Go 2,556 317 Updated May 22, 2025

open-webui / open-webui

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 95,673 12,342 Updated May 22, 2025

run-ai / k8s-launcher

Python 12 1 Updated Aug 1, 2023

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 40,894 4,510 Updated May 22, 2025

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 18,485 1,874 Updated May 21, 2025

NirDiamant / RAG_Techniques

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 16,223 1,609 Updated May 11, 2025

openai / sparse_attention

Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"

Python 1,570 191 Updated Aug 12, 2020

Project-HAMi / HAMi

Heterogeneous AI Computing Virtualization Middleware

Go 1,630 298 Updated May 22, 2025

liguodongiot / llm-action

本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）

HTML 17,768 2,091 Updated May 1, 2025

kvcache-ai / ktransformers

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 14,118 995 Updated May 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

maao maaoBit

Achievements

Achievements

Block or report maaoBit

Stars

harry0703 / MoneyPrinterTurbo

arogozhnikov / einops

explodinggradients / ragas

deepseek-ai / DeepEP

juicedata / juicefs

SafeAILab / EAGLE

huggingface / transformers

kubernetes-sigs / karpenter

searxng / searxng