Stars
Docker Image for Spark History Server
Helm Chart for deploying Spark history server in Amazon EKS for S3 Spark Event Logs
Operator for Apache Spark-on-Kubernetes for Stackable Data Platform
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Apache Kyuubi is a distributed 8000 and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
A better notebook for Scala (and more)
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
CMAK is a tool for managing Apache Kafka clusters
🥑 ArangoDB is a native multi-model database with flexible data models for documents, graphs, and key-values. Build high performance applications using a convenient SQL-like query language or JavaSc…
A UI dashboard that allows CRUD operations on Zookeeper.
Apache Superset is a Data Visualization and Data Exploration Platform
A toolkit for querying and interacting with Big Data
Extensions, custom & experimental panels
JSON Logging Format Libraries for Python (and handlers for ElasticSearch, and MongoDB)
Easy & Flexible Alerting With ElasticSearch