Stars
A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to your ML and analytics workloads.
An extensible, state of the art columnar file format. Formerly at @spiraldb, now part of the Linux Foundation.
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
A vendor neutral GPU multiplexing tool driven by VFIO & YAML.
Distributed data engine for Python/SQL designed for the cloud, powered by Rust
Staging area for ongoing enhancements to Ray focused on improving integration with AWS and other Amazon technologies.
Modin: Scale your Pandas workflows by changing a single line of code
A toolkit to run Ray applications on Kubernetes
Convenient pyarrow operations following the Pandas API
Secure and fast microVMs for serverless computing.
a fast, scalable, multi-language and extensible build system
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.