8000 stumelius (Simo Tumelius) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View stumelius's full-sized avatar

Block or report stumelius

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This is a repo with links to everything you'd ever want to learn about data engineering

Jupyter Notebook 34,576 6,632 Updated Jul 1, 2025

Welcome the "Fast track to Analytics Engineering with dbt" course!

7 3 Updated Nov 22, 2024

The Metadata Platform for your Data and AI Stack

Java 10,802 3,153 Updated Jul 2, 2025

A dbt-core python package that automates the management and creation of dbt groups, contracts, access, and versions.

Python 122 5 Updated Jan 21, 2025

Leveraging standard Python tooling to unit test dbt models

Python 6 Updated Jun 8, 2023

CLI platform to experiment with codegen. Precursor to: https://lovable.dev

Python 54,404 7,187 Updated May 14, 2025

Blazing fast, instant realtime GraphQL APIs on all your data with fine grained access control, also trigger webhooks on database events.

TypeScript 31,588 2,808 Updated Jul 2, 2025

Compare tables within or across databases

Python 2,973 289 Updated May 17, 2024

PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement

Rust 10,321 236 Updated Jul 2, 2025

An extremely fast Python linter and code formatter, written in Rust.

Rust 40,520 1,417 Updated Jul 2, 2025

A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.

Python 2,090 94 Updated Mar 29, 2025

dstack is an open-source container orchestrator that simplifies workload orchestration and drives GPU utilization for ML teams. It works with any GPU cloud, on-prem cluster, or accelerated hardware.

Python 1,821 183 Updated Jul 2, 2025

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Python 86,839 7,548 Updated Jul 2, 2025

Manage terragrunt versions - the tgswitch command line tool lets you switch between different versions of terragrunt

Go 154 35 Updated Apr 19, 2024

Terraform version manager

Shell 4,732 466 Updated Jul 2, 2025

☁️ Terraform plugin for machine learning workloads: spot instance recovery & auto-termination | AWS, GCP, Azure, Kubernetes

Go 294 28 Updated Dec 11, 2024

Access and analyze historical weather and climate data with Python.

Python 504 65 Updated Jun 22, 2025

A curated list of awesome dbt resources

1,477 142 Updated Apr 22, 2025

Data Engineering Zoomcamp is a free nine-week course that covers the fundamentals of data engineering.

Jupyter Notebook 31,637 6,719 Updated Jun 27, 2025

sqlfmt formats your dbt SQL files so you don't have to

Python 462 21 Updated May 1, 2025

Reference framework for building data workflows provided by Google. Accelerates authentication, logging, scheduling, and deployment of solutions using GCP. To borrow a tagline.. "The framework for …

Python 171 51 Updated Apr 25, 2024

re_data - fix data issues before your users & CEO would discover them 😊

Python 98 41 Updated May 6, 2024

re_data - fix data issues before your users & CEO would discover them 😊

HTML 1,563 122 Updated Apr 30, 2024

dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

Python 11,044 1,760 Updated Jul 3, 2025

Self-serve BI to 10x your data team ⚡️

TypeScript 4,853 580 Updated Jul 2, 2025
Python 23 Updated Jan 3, 2022

A lightweight, object-oriented finite state machine implementation in Python with many extensions

Python 6,110 547 Updated Jul 2, 2025

dbt Cloud command line interface (CLI)

Python 76 9 Updated Mar 8, 2024

do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning models.

Python 854 76 Updated Apr 5, 2024
Next
0