- Shanghai
Stars
OpenPPL / CuAssembler
Forked from cloudcores/CuAssemblerAn unofficial cuda assembler, for all generations of SASS, hopefully :)
graph based intermediate representation and backend for optimising compilers
Verilator open-source SystemVerilog simulator and lint system
a language for fast, portable data-parallel computation
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
For developers, who are building real-time data-driven applications, Redis is the preferred, fastest, and most feature-rich cache, data structure server, and document and vector query engine.
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.
Apache Spark - A unified analytics engine for large-scale data processing
Open deep learning compiler stack for cpu, gpu and specialized accelerators
An Open Source Machine Learning Framework for Everyone
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
CUDA integration for Python, plus shiny features
Keystone assembler framework: Core (Arm, Arm64, Hexagon, Mips, PowerPC, Sparc, SystemZ & X86) + bindings
Tensors and Dynamic neural networks in Python with strong GPU acceleration
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.