Stars
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
Building a modern alternative to Salesforce, powered by the community.
Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.
Build and share delightful machine learning apps, all in Python. ๐ Star to support our work!
Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!
Scalable and efficient data transformation framework - backwards compatible with dbt.
This repo hosts the course content of Customer Analytics, taught at Tilburg University by George Knox last taught Fall 2022.
A Comprehensive Database on the FIFA World Cup (Men's and Women's)
Curated list of resources about Apache Airflow
Apache Spark - A unified analytics engine for large-scale data processing
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
A curated list of engineering blogs
A curated list of data engineering tools for software developers
Our implementation of the LSTM version of Deep Knowledge Tracing (DKT)
Python implementation of Bayesian Knowledge Tracing and extensions
๐๐ฎ๐๐ฎ, ๐๐ป๐ฎ๐น๐๐๐ถ๐ฐ๐ & ๐๐. Modern alternative to Snowflake. Cost-effective and simple for massive-scale analytics. https://databend.com
Accumulated knowledge and experience in the field of Data Engineering
Fast and Lightweight Logs, Metrics and Traces processor for Linux, BSD, OSX and Windows
Kafka Streams Topology Sketch Diagram Visualization
๐ Sync PostgreSQL to Elasticsearch via Debezium
Scripts and samples to support Confluent Demos, Talks, and Blogs. Not all of the examples in this repository are kept up to date. For automated tutorials and QA'd code, see https://github.com/conflโฆ
Open-Source Web UI for Apache Kafka Management
Java Language Support for Visual Studio Code