- Jiangnan University, Wuxi, Jiangsu, CN
-
21:59
(UTC +08:00) - https://daydream0929.github.io/
Highlights
- Pro
Lists (28)
Sort Name ascending (A-Z)
Android
benchmark
blog
C
Cpp
CUDA
CUDA_GA
Database
DeepLearning
EC
GEMM
.gitignore
Inference
kernel
linux
linux/UNIX
llama
llm_inference
LLM inference frameworks.math
mlsys
Nvidia
Parallel
Python
risc-v
rust
vim
XUANTIE
yolov8
some yolov8 examples for human-pose-estimation on C920Starred repositories
手撸解释器教程《Crafting Interpreters》中文翻译
An open-source microcontroller system based on RISC-V
The minimal opencv for Android, iOS, ARM Linux, Windows, Linux, MacOS, HarmonyOS, WebAssembly, watchOS, tvOS, visionOS
Daydream0929 / cutlass
Forked from NVIDIA/cutlassCUDA Templates for Linear Algebra Subroutines
High-efficiency floating-point neural network inference operators for mobile, server, and Web
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.
Large Language Model Text Generation Inference
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
A machine learning compiler for GPUs, CPUs, and ML accelerators
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.
A high-throughput and memory-efficient inference and serving engine for LLMs
Master programming by recreating your favorite technologies from scratch.
A matrix extension proposal for AI applications under RISC-V architecture
经济学人(含音频)、纽约客、卫报、连线、大西洋月刊等英语杂志免费下载,支持epub、mobi、pdf格式, 每周更新
A modular graph-based Retrieval-Augmented Generation (RAG) system
Visualizer for neural network, deep learning and machine learning models