Stars
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
Remote shuffle service for Apache Spark to store shuffle data on remote servers.
This is a powerful cli tool for Apache Ranger and AWS EMR automated installation & integration with OpenLDAP & Windows AD. It supports Open-Source Ranger and EMR-Native Ranger both, supports OpenLD…
Example code for running Spark and Hive jobs on EMR Serverless.
Best practices and recommendations for getting started with Amazon EMR on EKS.
An approach for GitOps of AWS backing resources like databases with CodePipeline together with Kubernetes via Flux
Performance optimization for Spark running on Kubernetes