Lists (15)
Sort Name ascending (A-Z)
Stars
Curated list of some open-source codes for turbulent flow simulations, including turbulent multiphase, turbulent reacting flows, turbulent convection and turbulent atmospheric physics.
A massively parallel, high-level programming language
A game engine currently under development.
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
We are committed to the open-sourcing quantitative knowledge, aiming to bridge the information gap between the domestic and international quantitative finance industries.我们致力于量化知识的开源与汉化,打破国内外量化金融行业…
Tile primitives for speedy kernels
Python Implementation of Reinforcement Learning: An Introduction
[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling
Qlib is an AI-oriented quantitative investment platform that aims to realize the potential, empower research, and create value using AI technologies in quantitative investment, from exploring ideas…
flash attention tutorial written in python, triton, cuda, cutlass
https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via OpenCL. Free for non-commercial use.
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
校招、秋招、春招、实习好项目!带你从零实现一个高性能的深度学习推理库,支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step
Development repository for the Triton language and compiler
Hackable and optimized Transformers building blocks, supporting a composable construction.
The official SuiteSparse library: a suite of sparse matrix algorithms authored or co-authored by Tim Davis, Texas A&M University.
Assembler and Decompiler for NVIDIA (Maxwell Pascal Volta Turing Ampere) GPUs.