Stars
Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.
The Malloy Visual Studio Code extension facilitates building Malloy data models, querying and transforming data, and creating simple visualizations and dashboards
Malloy Composer is a simple application to build dashboards or run ad-hoc queries using an existing Malloy model
Python client for the Malloy Publisher API
In this we explore into a Question Answering task on structured relational data (Tables) and CSV data
LLM Zoomcamp - a free online course about real-life applications of LLMs. In 10 weeks you will learn how to build an AI system that answers questions about your knowledge base.
Polars least squares extension - enables fast linear model polar expressions
A React Application for interacting with Looker data through natural language.
Vizro is a low-code toolkit for building high-quality data visualization apps.
A dbt package with a POC implementation of an interface to query activity streams that adhere to the Activity Schema 2.0 spec.
Repository for the ActivitySchema spec and supporting materials
Modernisation platform environments • This repository is defined and managed in Terraform
Single-topic LDA (DMM) with unsupervised clustering
Scalable and efficient data transformation framework - backwards compatible with dbt.
Port(ish) of Great Expectations to dbt test macros
Mathematical & Statistical topics to perform statistical analysis and tests; Linear Regression, Probability Theory, Monte Carlo Simulation, Statistical Sampling, Bootstrapping, Dimensionality reduc…
Single source of truth is the accumulator of different useful/productive resources about different Computer Science/Software Engineering Topics.
📊 Cube’s universal semantic layer platform is the next evolution of OLAP technology for AI, BI, spreadsheets, and embedded analytics
dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service …
Source code accompanying: BigQuery: The Definitive Guide by Lakshmanan & Tigani to be published by O'Reilly Media
🧙 Build, run, and manage data pipelines for integrating and transforming data.