Stars
A Rest Api Structured Streaming DataSource
Code base for the Learning PySpark book (in preparation)
ajbd2106 / hive-benchmark
Forked from kcheeeung/hive-benchmarkAutomated TPC-DS and TPC-H benchmark for Apache Hive LLAP
Automated TPC-DS and TPC-H benchmark for Apache Hive LLAP
Use the TPC-DS benchmark to test Spark SQL performance
Java Tutorial For Beginners with 500 Code Examples
Example blueprint application for processing high-speed trading data.
Simple Spark streaming example using a Java sample data generator
ajbd2106 / geoJSON_to_Cassandra_using_Spark
Forked from simonambridge/geoJSON_to_Cassandra_using_SparkSaving geoJSON Data To Cassandra Using User-Defined Types, Spark Dataframes and Spark SQL
Saving geoJSON Data To Cassandra Using User-Defined Types, Spark Dataframes and Spark SQL
A description of the processes and techniques required to migrate a relational schema to a Cassandra database using Spark and SparkSQL
ajbd2106 / RTFAP2
Forked from simonambridge/RTFAP2Real-Time Fraud Analysis and Prevention Using Kafka, Spark and Cassandra with a nodejs ReST Server
iandow / bb-mapr-demo
Forked from bigboards/bb-stack-trainingA workshop with the focus on building big data processing pipeline.
The purpose of this tiny project is to put things together with the know how that i learned from the course big data expert from formacionhadoop.com The idea is to show how to play with apache spar…
A scalable, cloud-ready environment for Data Science using Docker