Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 97,533 14,617 Updated May 17, 2025

Linear95 / bert-intent-slot-detector

BERT-based intent and slots detector for chatbots.

Python 180 27 Updated Feb 21, 2025

HPMLL / BurstGPT

A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems

Python 167 9 Updated Oct 15, 2024

tiingweii-shii / Awesome-Resource-Efficient-LLM-Papers

a curated list of high-quality papers on resource-efficient LLMs 🌱

120 7 Updated Mar 15, 2025

ServerlessLLM / ServerlessLLM

Serverless LLM Serving for Everyone.

Python 466 42 Updated Apr 24, 2025

pentium3 / sys_reading

system paper reading notes

244 13 Updated Mar 3, 2022

AmberLJC / LLMSys-PaperList

Large Language Model (LLM) Systems Paper List

1,223 69 Updated May 17, 2025

horseee / Awesome-Efficient-LLM

A curated list for Efficient Large Language Models

Python 1,660 134 Updated Apr 23, 2025

MicrosoftDocs / semantic-kernel-docs

Semantic Kernel (SK) is a lightweight SDK enabling integration of AI Large Language Models (LLMs) with conventional programming languages.

Mermaid 216 126 Updated May 16, 2025

togettoyou / hub-mirror

🚀 Docker 镜像代理，通过 GitHub Actions 将 docker.io、gcr.io、registry.k8s.io、k8s.gcr.io、quay.io、ghcr.io 等国外镜像转换为国内镜像加速下载

Go 1,088 679 Updated Feb 25, 2025

yuanmu97 / secure-transformer-inference

Secure Transformer Inference is a protocol for serving Transformer-based models securely.

Python 92 22 Updated May 8, 2024

microsoft / semantic-kernel

Integrate cutting-edge LLM technology quickly and easily into your apps

C# 24,480 3,824 Updated May 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

guopeng li guopeng-gpli

Highlights

Block or report guopeng-gpli

Stars

cactus-compute / cactus

deepseek-ai / 3FS

kvcache-ai / ktransformers

deepseek-ai / FlashMLA

sir-lab / data-release

tensorzero / tensorzero

thustorage / Medusa

paclopes / HungarianGPU

LMCache / LMCache

tleers / serverless-llm-app-factory

ubc-cirrus-lab / caribou

cgdsss / thesis_proposal_ustc

uw-mad-dash / bagpipe

AlibabaPAI / llumnix

xlite-dev / Awesome-LLM-Inference

ShishirPatil / gorilla

taishan1994 / pytorch_bert_intent_classification_and_slot_filling

stevelaskaridis / awesome-mobile-llm

langgenius / dify