Stars
Real-world Spark pipelines examples
A curated list of awesome Apache Spark packages and resources.
Spark Gotchas. A subjective compilation of the Apache Spark tips and tricks
Generating fake data for the JVM (Java, Kotlin, Groovy) has never been easier!
Design patterns implemented in Scala.
A simple Spark-powered ETL framework that just works 🍺
Event-driven Automation Framework for Kubernetes
Declarative Continuous Deployment for Kubernetes
Dockerized Hadoop/Minio/Hive/Presto stack
Bigtop is an Apache Foundation project for Infrastructure Engineers and Data Scientists looking for comprehensive packaging, testing, and configuration of the leading open source big data components.
The repository for the free Scala at Light Speed mini-course
Mopidy Web client extension and hybrid app for mobile devices
An image (SD-card) to turn the Raspberry Pi into an easy to use MusicBox with Spotify playback and AirTunes streaming
Apache Camel is an open source integration framework that empowers you to quickly and easily integrate various systems consuming or producing data.
Simple, open source, lightweight and privacy-friendly web analytics alternative to Google Analytics.
Kafka GUI for Apache Kafka to manage topics, topics data, consumers group, schema registry, connect and more...
Free universal database tool and SQL client
Script to upgrade openmediavault from one major release into the next
The Apache Flink SQL Cookbook is a curated collection of examples, patterns, and use cases of Apache Flink SQL. Many of the recipes are completely self-contained and can be run in Ververica Platfor…
MinIO is a high-performance, S3 compatible object store, open sourced under GNU AGPLv3 license.