8000 GitHub - bythebay/pipeline: Complete Pipeline Training at Big Data Scala By the Bay
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

bythebay/pipeline

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pipeline

Join the chat at https://gitter.im/bythebay/pipeline Complete Pipeline Training at Big Data Scala By the Bay

Pipeline Description

Dating ratings data => Akka app => Kafka => Spark Streaming => Cassandra => Dashboard

In addition, Spark MLLib, DataFrames will be demonstrated using a combination of the Cassandra real time data plus static Parquet data, on a notebook interface.

Follow the Wiki to continue exploring -->

About

Complete Pipeline Training at Big Data Scala By the Bay

Resources

Stars

Watchers

Forks

Packages

No packages published

Contributors 7

0