-
Hinge Health
- Surat, Gujarat
More
-
fast-dedupe Public
Forked from ayushgupta4897/fast-dedupeA minimalist but optimized Python package for deduplication tasks leveraging RapidFuzz internally, enabling super-fast approximate duplicate detection within a dataset with minimal config.
Python MIT License UpdatedMar 25, 2025 -
-
geopolitics_gdelt_insights Public
Intersection of analysing geopoloitical themes using GDELT Open source API, using flavour of distributed systems and Artificial Intelligence.
-
de_03_data_skew_mitigation Public
To demonstrate and mitigate data skew issues in distributed systems using Apache Spark.
UpdatedJun 22, 2024 -
PyScrub Public
Built on the powerful PySpark framework and packaged within a Docker environment, PyScrub offers a scalable, flexible, and efficient way to cleanse and profile large datasets. Whether you're dealin…
-
Databricks-Certified-Data-Engineer-Professional Public
Forked from derar-alhussein/Databricks-Certified-Data-Engineer-ProfessionalThe resources of the preparation course for Databricks Data Engineer Professional certification exam
Python UpdatedDec 9, 2023 -
de_02_data_profiling Public
Data Profiling for better Data Quality tracking, using Spark over Distributed Systems. Aviation data sourced from AirLabs API.
Python UpdatedNov 2, 2023 -
de_01_data_profiling Public
Data Profiling for better Data Quality tracking, using Spark over Distributed Systems. Aviation data sourced from AirLabs API.
UpdatedOct 22, 2023 -
de_01_data_dedup Public
Data Deduplication and it's need in Data Engineering.
-
chatgpt-wrapper Public
Forked from llm-workflow-engine/llm-workflow-engineAPI for interacting with ChatGPT using Python and from Shell.
Python MIT License UpdatedDec 14, 2022 -
nominatim-docker Public
Forked from mediagis/nominatim-docker100% working container for Nominatim
Shell Creative Commons Zero v1.0 Universal UpdatedAug 27, 2022 -
personal-training Public
personal training. Referecnes: https://docs.microsoft.com/en-us/azure/cognitive-services/form-recognizer/quickstarts/python-receipts
Python UpdatedMar 6, 2020 -
spotify-playlist-analysis Public
Analyzing my personal spotify playlists - Exploratory analysis, predictive analysis suggesting likeable/not-so-likeable songs
-
LyricsX Public
Forked from ddddxxx/LyricsX🎶 Lyrics for iTunes, Spotify, Vox and Audirvana Plus.
Swift GNU General Public License v3.0 UpdatedNov 18, 2019 -
-
-
YouTube-video-analytics Public
Analyzing KPIs like sentiment ratio, user engagement, views per category using youtube's official meta data and historical data
-
Using Collaborative Filtering techniques -- Matrix Factorization to build a movie recommender over Tensorflow
UpdatedNov 9, 2018 -
Twitter-ETL-using-Spark Public
Forked from IamDocxy/Twitter-ETL-using-SparkExtracted and transformed Twitter unstructured data in JSON format into a structured tabular format using Spark Dataframes and stored the data in parquet format in AWS S3 bucket for analyzing with …
-
spotifyreact Public
Forked from anantpanthri/spotifyreactA react js based desktop web applicaiton- uses spotify developer api for realtime albums
JavaScript UpdatedMar 4, 2018 -
models Public
Forked from tensorflow/modelsModels and examples built with TensorFlow
Python Apache License 2.0 UpdatedNov 20, 2017 -
hospitalMedicareAnalysis Public
Analyzing hospital medicare data across states in North Amercia
UpdatedOct 30, 2017 -
spark-stratifier Public
Forked from interviewstreet/spark-stratifierStratified Cross Validator for Spark
Python UpdatedOct 24, 2017 -
-
MachineLearningWithTensorflow Public
Forked from sruti-jain/MachineLearningWithTensorflowJupyter Notebook UpdatedSep 23, 2017 -
IPEDS_dataVisualization Public
IPEDS data visualization using GGPLOT2/RMD file
HTML UpdatedSep 20, 2017 -
star-tech-web-dev Public
An events' website that built for a live client with purpose of attracting local talent in Gandhinagar, India.
UpdatedJul 29, 2017 -
bubble_chart_v4 Public
Forked from vlandham/bubble_chart_v4d3v4 implementation of bubble charts.
JavaScript Other UpdatedJul 25, 2017 -
-
TravelExpress Public
Forked from anantpanthri/TravelExpressTravel Express is a travel base java web application that allows a registered user to book hotels and convenient mode of transportation towards his/her destination. A user can enter any USA county …
Java UpdatedMay 17, 2017