Stars
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
Streamlit — A faster way to build and share data apps.
A curated list of Rust code and resources.
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
Tuplex is a parallel big data processing framework that runs data science pipelines written in Python at the speed of compiled code. Tuplex has similar Python APIs to Apache Spark or Dask, but rath…
Efficiently computes derivatives of NumPy code.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
faster-cpython / cpython
Forked from python/cpythonThe Python programming language
Cinder is Meta's internal performance-oriented production version of CPython.
Backport of multiprocessing.shared_memory in Python 3.8
vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)
🔥 2D and 3D Face alignment library build using pytorch
The fundamental package for scientific computing with Python.
Fast numerical array expression evaluator for Python, NumPy, Pandas, PyTables and more
Machinery for building and testing Python Wheels for Linux, OSX and (less flexibly) Windows.
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
ODPS Python SDK and data analysis framework
The Python micro framework for building web applications.