-
data&mie Oy
- Finland
- https://datamie.fi
- in/simo-tumelius-00a27a162
Stars
This is a repo with links to everything you'd ever want to learn about data engineering
Welcome the "Fast track to Analytics Engineering with dbt" course!
The Metadata Platform for your Data and AI Stack
A dbt-core python package that automates the management and creation of dbt groups, contracts, access, and versions.
Leveraging standard Python tooling to unit test dbt models
CLI platform to experiment with codegen. Precursor to: https://lovable.dev
Blazing fast, instant realtime GraphQL APIs on all your data with fine grained access control, also trigger webhooks on database events.
Compare tables within or across databases
PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement
An extremely fast Python linter and code formatter, written in Rust.
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
dstack is an open-source container orchestrator that simplifies workload orchestration and drives GPU utilization for ML teams. It works with any GPU cloud, on-prem cluster, or accelerated hardware.
FastAPI framework, high performance, easy to learn, fast to code, ready for production
Manage terragrunt versions - the tgswitch command line tool lets you switch between different versions of terragrunt
☁️ Terraform plugin for machine learning workloads: spot instance recovery & auto-termination | AWS, GCP, Azure, Kubernetes
Access and analyze historical weather and climate data with Python.
Data Engineering Zoomcamp is a free nine-week course that covers the fundamentals of data engineering.
sqlfmt formats your dbt SQL files so you don't have to
Reference framework for building data workflows provided by Google. Accelerates authentication, logging, scheduling, and deployment of solutions using GCP. To borrow a tagline.. "The framework for …
re_data - fix data issues before your users & CEO would discover them 😊
re_data - fix data issues before your users & CEO would discover them 😊
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
Self-serve BI to 10x your data team ⚡️
A lightweight, object-oriented finite state machine implementation in Python with many extensions
do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning models.