8000 dhanuja-k / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View dhanuja-k's full-sized avatar

Block or report dhanuja-k

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.

TypeScript 161 16 Updated Nov 26, 2024

A collective list of free APIs

Python 340,182 35,897 Updated Oct 31, 2024

🐍 Quick reference guide to common patterns & functions in PySpark.

541 172 Updated Feb 21, 2023

PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster

Python 457 224 Updated Oct 15, 2024

Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.

Scala 522 137 Updated Mar 16, 2022

A set of exercises to prepare for Certified Kubernetes Application Developer exam by Cloud Native Computing Foundation

9,314 5,764 Updated Mar 21, 2025

A curated and opinionated list of resources for Chief Technology Officers, with the emphasis on startups

30,650 1,871 Updated May 13, 2025

Inference code for Llama models

Python 58,244 9,771 Updated Jan 26, 2025

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Python 20,124 2,826 Updated May 12, 2025

In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our cloud provider.

29 13 Updated Dec 18, 2023

This repository contains the notebooks and presentations we use for our Databricks Tech Talks

HTML 714 437 Updated Jan 6, 2025

A topic-centric list of HQ open datasets.

63,130 10,115 Updated Nov 13, 2024

A list of publicly available datasets with real-time data maintained by the team at bytewax.io

783 39 Updated Feb 25, 2025

My Insight Data Engineering Fellowship project. I implemented a big data processing pipeline based on ​lambda architecture​, that aggregates Twitter and US stock market data for user sentiment anal…

Scala 506 128 Updated Aug 24, 2022

Jupyter notebooks for the code samples of the book "Deep Learning with Python"

Jupyter Notebook 19,186 8,825 Updated May 1, 2025

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python 9,153 1,014 Updated May 12, 2025

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

Python 45,419 18,495 Updated May 16, 2025

Roadmap to becoming a data engineer in 2021

12,627 1,349 Updated Jan 25, 2022

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN

Python 43,226 3,998 Updated May 16, 2025

The Game Analytics Pipeline is a customer deployable reference architecture to help game developers ingest, store, and analyze telemetry data from games and services.

JavaScript 43 31 Updated Jan 9, 2024

ETL with Python - Taught at DWH course 2017 (TAU)

Jupyter Notebook 103 54 Updated Aug 29, 2017

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

Python 20,144 4,779 Updated May 12, 2025

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 299,512 49,685 Updated Dec 2, 2024

A collection of learning resources for curious software engineers

Python 47,616 3,783 Updated May 12, 2025
0