Mercari-helper tools
This is simple migration script, migrate pipenv to poetry
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
A modular SQL linter and auto-formatter with support for multiple dialects and templated code.
Converts a trace of Datadog to a sequence diagram of PlantUML/MermaidJS
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
Behavioral "black-box" testing for recommender systems
An end-to-end implementation of intent prediction with Metaflow and other cool tools
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable,…
do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning models.
Repository of sample applications for https://vespa.ai, the open big data serving engine
An open-source runtime for composable workflows. Great for AI agents and CI/CD.
a Docker + Kubernetes network trouble-shooting swiss-army container
💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows
Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.
A sidecar app which clones a git repo and keeps it in sync with the upstream.
dbt macros to stage external sources
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
An open source multi-tool for exploring and publishing data
Work with your web service, database, and streaming schemas in a single format.
A toolkit to run Ray applications on Kubernetes