8000 GitHub - Amittank25/DataPipeline: This repository contains example implementation of different components used in a typical data pipeline and 2 different implementations of a basic data pipeline where reading a data from some source and using kafka to stream data and spark to process data and save result data to HDFS. This repository also includes implementation of Spark SQL and different ways of using SparkSQL and Spark Streaming.
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

This repository contains example implementation of different components used in a typical data pipeline and 2 different implementations of a basic data pipeline where reading a data from some source and using kafka to stream data and spark to process data and save result data to HDFS. This repository also includes implementation of Spark SQL and…

Notifications You must be signed in to change notification settings

Amittank25/DataPipeline

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 

About

This repository contains example implementation of different components used in a typical data pipeline and 2 different implementations of a basic data pipeline where reading a data from some source and using kafka to stream data and spark to process data and save result data to HDFS. This repository also includes implementation of Spark SQL and…

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

0