- Greater Seattle Area
Starred repositories
Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.
Sample code for the Twitter API v2 endpoints
Spark: The Definitive Guide's Code Repository
Home of the 2020 SIS Analytics Challenge
code for Data Science From Scratch book
Official repo for the #tidytuesday project
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Advanced Pandas Vault — Utilities, Functions and Snippets (by @firmai).
Programmatically collect normalized news from (almost) any website.