-
CUDA-Learn-Notes Public
Forked from xlite-dev/LeetCUDA📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).
Cuda GNU General Public License v3.0 UpdatedMar 16, 2025 -
Awesome-LLM-Inference Public
Forked from xlite-dev/Awesome-LLM-Inference📖A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, Flash-Attention, Paged-Attention, MLA, Parallelism, Prefix-Cache, Chunked-Prefill, etc. 🎉🎉
GNU General Public License v3.0 UpdatedMar 4, 2025 -
Efficient-LLMs-Survey Public
Forked from AIoT-MLSys-Lab/Efficient-LLMs-Survey[TMLR 2024] Efficient Large Language Models: A Survey
UpdatedSep 28, 2024 -
LLaMA-Factory Public
Forked from hiyouga/LLaMA-FactoryUnify Efficient Fine-Tuning of 100+ LLMs
Python Apache License 2.0 UpdatedMay 27, 2024 -
kubernetes-handbook Public
Forked from rootsongjc/kubernetes-handbookKubernetes中文指南/云原生应用架构实战手册 - https://jimmysong.io/kubernetes-handbook
Shell Creative Commons Attribution 4.0 International UpdatedApr 13, 2024 -
H2O Public
Forked from FMInference/H2O[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
Python UpdatedApr 4, 2024 -
FlexGen Public
Forked from FMInference/FlexLLMGenRunning large language models on a single GPU for throughput-oriented scenarios.
Python Apache License 2.0 UpdatedMar 29, 2024 -
Awesome-DL-Scheduling-Papers Public
Forked from S-Lab-System-Group/Awesome-DL-Scheduling-PapersUpdatedJan 22, 2024 -
Individual_Paper_Notes Public
Forked from DicardoX/Research-SpaceThis repository is designed to record personal notes for reading papers.
UpdatedJan 18, 2024 -
Fengshenbang-LM Public
Forked from IDEA-CCNL/Fengshenbang-LMFengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
Python Apache License 2.0 UpdatedJun 21, 2023 -
DeepLearningSystem Public
Forked from Infrasys-AI/AISystemDeep Learning System core principles introduction.
Jupyter Notebook Apache License 2.0 UpdatedMar 16, 2023 -
nanoGPT Public
Forked from karpathy/nanoGPTThe simplest, fastest repository for training/finetuning medium-sized GPTs.
Python MIT License UpdatedFeb 19, 2023 -
ai-edu Public
Forked from microsoft/ai-eduAI education materials for Chinese students, teachers and IT professionals.
HTML Other UpdatedJan 17, 2023 -
llvm Public
Forked from intel/llvmIntel staging area for llvm.org contribution. Home for Intel LLVM-based projects.
Other UpdatedJan 1, 2023 -
pai Public
Forked from microsoft/paiResource scheduling and cluster management for AI
JavaScript MIT License UpdatedDec 5, 2022 -
Paddle-Lite Public
Forked from PaddlePaddle/Paddle-LitePaddlePaddle High Performance Deep Learning Inference Engine for Mobile and Edge (飞桨高性能深度学习端侧推理引擎)
C++ Apache License 2.0 UpdatedNov 17, 2022 -
-
PLCT-Open-Reports Public
Forked from plctlab/PLCT-Open-ReportsPLCT实验室的公开演讲,或者决定公开的组内报告
Creative Commons Attribution Share Alike 4.0 International UpdatedAug 17, 2022 -
CS-Notes Public
Forked from CyC2018/CS-Notes📚 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计
UpdatedAug 11, 2022 -
LeetCode-Go Public
Forked from halfrost/LeetCode-Go✅ Solutions to LeetCode by Go, 100% test coverage, runtime beats 100% / LeetCode 题解
Go MIT License UpdatedJul 23, 2022 -
PINTO_model_zoo Public
Forked from PINTO0309/PINTO_model_zooA repository for storing models that have been inter-converted between various frameworks. Supported frameworks are TensorFlow, PyTorch, ONNX, OpenVINO, TFJS, TFTRT, TensorFlowLite (Float32/16/INT8…
Python MIT License UpdatedJul 5, 2022 -
PaddleViT Public
Forked from BR-IDL/PaddleViT🤖 PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+
Python Apache License 2.0 UpdatedJun 28, 2022 -
awesome-emdl Public
Forked from csarron/awesome-emdlEmbedded and mobile deep learning research resources
MIT License UpdatedMay 26, 2022 -
Paddle-Lite-Demo Public
Forked from PaddlePaddle/Paddle-Lite-Demolib, demo, model, data
C++ Apache License 2.0 UpdatedMay 11, 2022 -
Paddle Public
Forked from PaddlePaddle/PaddlePArallel Distributed Deep LEarning
C++ Apache License 2.0 UpdatedApr 6, 2022 -
-
DeepSpeed Public
Forked from deepspeedai/DeepSpeedDeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
Python MIT License UpdatedFeb 23, 2022 -
FleetX Public
Forked from PaddlePaddle/PaddleFleetXPaddle Distributed Training Examples. 飞桨分布式训练示例 Resnet Bert GPT MOE DataParallel ModelParallel PipelineParallel HybridParallel AutoParallel Zero Sharding Recompute GradientMerge Offload AMP DGC Loc…
Shell Apache License 2.0 UpdatedFeb 16, 2022 -
HugeCTR Public
Forked from NVIDIA-Merlin/HugeCTRHugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training
C++ Apache License 2.0 UpdatedFeb 11, 2022 -
resume Public
Forked from billryan/resumeAn elegant \LaTeX\ résumé template. 大陆镜像 https://gods.coding.net/p/resume/git
TeX MIT License UpdatedDec 15, 2021