Highlights
- Pro
Stars
An easy-to-use Supabase connector for Streamlit that caches your API calls to make querying fast and cheap.
SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC acβ¦
Example files used in the Unity Catalog RBAC blog
FastAPI and SQLAlchemy DDD (Domain Driven Development) Example
Langflow is a powerful tool for building and deploying AI-powered agents and workflows.
Distributed data engine for Python/SQL designed for the cloud, powered by Rust
A highly efficient daemon for streaming data from Kafka into Delta Lake
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
A native Rust library for Delta Lake, with bindings into Python
data load tool (dlt) is an open source Python library that makes data loading easy π οΈ
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
Apache Superset is a Data Visualization and Data Exploration Platform
Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.
π Collaborative cheatsheets for console commands
π» A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offline
Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.
An orchestration platform for the development, production, and observation of data assets.
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
DuckDB is an analytical in-process SQL database management system
Simple Python package to preview and develop streamlit apps in jupyter notebooks
π Multi-file boilerplate for Open API Specification
Template for a data contract used in a data mesh.