-
ScaleLLM Public
Forked from vectorch-ai/ScaleLLMA high-performance inference system for large language models, designed for production environments.
C++ Apache License 2.0 UpdatedApr 16, 2025 -
flux Public
Forked from bytedance/fluxA fast communication-overlapping library for tensor/expert parallelism on GPUs.
C++ Apache License 2.0 UpdatedMar 12, 2025 -
awesome-cuda-triton-hpc Public
Forked from coderonion/awesome-cuda-and-hpc🔥🔥🔥 A collection of some awesome public CUDA, cuBLAS, cuDNN, CUTLASS, TensorRT, TensorRT-LLM, Triton, TVM, MLIR and High Performance Computing (HPC) projects.
UpdatedJan 29, 2025 -
pdfium-binaries Public
Forked from bblanchon/pdfium-binaries📰 Binary distribution of PDFium
Shell UpdatedJan 27, 2025 -
heaptrack Public
Forked from KDE/heaptrackA heap memory profiler for Linux
C++ UpdatedJan 26, 2025 -
awesome-gemm Public
Forked from yuninxia/awesome-gemm📚 A curated list of awesome matrix-matrix multiplication (A * B = C) frameworks, libraries and software
MIT License UpdatedDec 21, 2024 -
albumentations Public
Forked from albumentations-team/albumentationsFast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
-
Clipper2 Public
Forked from AngusJohnson/Clipper2Polygon Clipping and Offsetting - C++, C# and Delphi
C++ Boost Software License 1.0 UpdatedOct 14, 2024 -
-
backward-cpp Public
Forked from bombela/backward-cppA beautiful stack trace pretty printer for C++
C++ MIT License UpdatedJun 24, 2024 -
CRCpp Public
Forked from d-bahr/CRCppEasy to use and fast C++ CRC library.
C++ Other UpdatedApr 23, 2024 -
inferflow Public
Forked from inferflow/inferflowInferflow is an efficient and highly configurable inference engine for large language models (LLMs).
C++ MIT License UpdatedJan 16, 2024 -
uni-algo Public
Forked from uni-algo/uni-algoUnicode Algorithms Implementation for C/C++
C++ Other UpdatedJan 5, 2024 -
libassert Public
Forked from jeremy-rifkin/libassertThe most over-engineered and overpowered C++ assertion library.
C++ MIT License UpdatedDec 7, 2023 -
perf-book Public
Forked from dendibakh/perf-bookThe book "Performance Analysis and Tuning on Modern CPU"
TeX Creative Commons Zero v1.0 Universal UpdatedDec 4, 2023 -
spconv Public
Forked from traveller59/spconvSpatial Sparse Convolution Library
Python Apache License 2.0 UpdatedOct 7, 2023 -
MPMCQueue Public
Forked from rigtorp/MPMCQueueA bounded multi-producer multi-consumer concurrent queue written in C++11
C++ MIT License UpdatedSep 18, 2023 -
INT8-Flash-Attention-FMHA-Quantization Public
Forked from jundaf2/INT8-Flash-Attention-FMHA-QuantizationCuda UpdatedSep 15, 2023 -
flash_attention_inference Public
Forked from ShaYeBuHui01/flash_attention_inferencePerformance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.
C++ MIT License UpdatedAug 31, 2023 -
onnxruntime Public
Forked from microsoft/onnxruntimeONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
C++ MIT License UpdatedAug 3, 2023 -
cpptrace Public
Forked from jeremy-rifkin/cpptraceLightweight, zero-configuration-required, and cross-platform stacktrace library for C++
C++ MIT License UpdatedJul 29, 2023 -
How_to_optimize_in_GPU Public
Forked from Liu-xiandong/How_to_optimize_in_GPUThis is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, s…
Cuda Apache License 2.0 UpdatedJul 29, 2023 -
lmdeploy Public
Forked from InternLM/lmdeployLMDeploy is a toolkit for compressing, deploying, and serving LLM
C++ Apache License 2.0 UpdatedJul 28, 2023 -
arm-gcc-inline-assembler Public
Forked from chunhuajiang/arm-gcc-inline-assemblerARM GCC 内联汇编参考手册 - 中文版
-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedJun 20, 2023 -
tokenizers-cpp Public
Forked from mlc-ai/tokenizers-cppUniversal cross-platform tokenizers binding to HF and sentencepiece
C++ Apache License 2.0 UpdatedJun 3, 2023 -
cudabmk Public
Forked from spthm/cudabmkSource for Demystifying GPU Microarchitecture through Microbenchmarking
Cuda UpdatedMay 29, 2023 -
excalidraw Public
Forked from excalidraw/excalidrawVirtual whiteboard for sketching hand-drawn like diagrams
-
-
sentry-native Public
Forked from getsentry/sentry-nativeSentry SDK for C, C++ and native applications.
C MIT License UpdatedApr 21, 2023