-
server Public
Forked from triton-inference-server/serverThe Triton Inference Server provides an optimized cloud and edge inferencing solution.
Python BSD 3-Clause "New" or "Revised" License UpdatedMay 12, 2025 -
fil_backend Public
Forked from triton-inference-server/fil_backendFIL backend for the Triton Inference Server
Jupyter Notebook Apache License 2.0 UpdatedMay 12, 2025 -
onnxruntime_backend Public
Forked from triton-inference-server/onnxruntime_backendThe Triton backend for the ONNX Runtime.
C++ BSD 3-Clause "New" or "Revised" License UpdatedMay 12, 2025 -
vllm_backend Public
Forked from triton-inference-server/vllm_backendPython BSD 3-Clause "New" or "Revised" License UpdatedMay 9, 2025 -
dali_backend Public
Forked from triton-inference-server/dali_backendThe Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.
C++ MIT License UpdatedMay 7, 2025 -
python_backend Public
Forked from triton-inference-server/python_backendTriton backend that enables pre-process, post-processing and other logic to be implemented in Python.
C++ BSD 3-Clause "New" or "Revised" License UpdatedMay 6, 2025 -
onnxruntime Public
Forked from microsoft/onnxruntimeONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
C++ MIT License UpdatedMay 6, 2025 -
LeetCUDA Public
Forked from xlite-dev/LeetCUDA📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA/Tensor Cores Kernels, HGEMM, FA-2 MMA etc.🔥
Cuda GNU General Public License v3.0 UpdatedApr 28, 2025 -
aibrix Public
Forked from vllm-project/aibrixCost-efficient and pluggable Infrastructure components for GenAI inference
Jupyter Notebook Apache License 2.0 UpdatedFeb 22, 2025 -
Parallel-Computing-Cuda-C Public
Forked from CisMine/Parallel-Computing-Cuda-CCUDA Learning guide
Cuda UpdatedJun 20, 2024 -
client Public
Forked from triton-inference-server/clientTriton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.
C++ BSD 3-Clause "New" or "Revised" License UpdatedMay 31, 2024 -
punica Public
Forked from punica-ai/punicaServing multiple LoRA finetuned LLM as one
Python Apache License 2.0 UpdatedMay 8, 2024 -
-
S-LoRA Public
Forked from S-LoRA/S-LoRAS-LoRA: Serving Thousands of Concurrent LoRA Adapters
Python Apache License 2.0 UpdatedJan 21, 2024 -
Android-Stable-diffusion-ONNX Public
Forked from ZTMIDGO/Android-Stable-diffusion-ONNX使用Android手机的CPU推理stable diffusion
Java UpdatedMay 22, 2023 -
TNN Public
Forked from Tencent/TNNTNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its…
C++ Other UpdatedApr 27, 2023 -
NewsManagement Public
公司门户网站后台新闻管理系统(SSM, Ueditor, Jquery, bootstrap)
-
k8s_awesome_document Public
Forked from 0voice/k8s_awesome_document【2021年新鲜出炉】K8s(Kubernetes)的工程师资料合辑,书籍推荐,面试题,精选文章,开源项目,PPT,视频,大厂资料
UpdatedNov 4, 2021 -
reading-source-code-of-nginx-1.19.10 Public
Forked from SmartKeyerror/reading-source-code-of-nginx-1.19.10nginx-1.19.10 源码阅读,分析关键组件与核心运转流程, 并使用图例进行描述
C UpdatedJun 25, 2021 -
-
-
CS-Book Public
Forked from iamshuaidi/CS-Book计算机类常用电子书整理,并且附带下载链接,包括Java,Python,Linux,Go,C,C++,数据结构与算法,人工智能,计算机基础,面试,设计模式,数据库,前端等书籍
UpdatedOct 13, 2020 -
-
-
advanced-java Public
Forked from doocs/advanced-java😮 互联网 Java 工程师进阶知识完全扫盲:涵盖高并发、分布式、高可用、微服务、海量数据处理等领域知识,后端同学必看,前端同学也可学习
Java Creative Commons Attribution Share Alike 4.0 International UpdatedApr 26, 2020