Lists (1)
Sort Name ascending (A-Z)
Stars
OpenAI's Code Interpreter in your terminal, running locally
This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transform…
Multi-stage, config driven, SQL based ETL framework using PySpark
This repo contains code examples of processing and analysing data with Apache Spark and Python
Implementing best practices for PySpark ETL jobs and applications.