Stars
Uplift modeling and causal inference with machine learning algorithms
A Python library that helps data scientists to infer causation rather than observing correlation.
Free online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra course
Use fastai-v2 with HuggingFace's pretrained transformers
The fastai book, published as Jupyter Notebooks
Med-BERT, contextualized embedding model for structured EHR data
Fit interpretable models. Explain blackbox machine learning.
A high-level Python library for Quantum Natural Language Processing
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
Azure Machine Learning Lab Notebooks
Labs and demos for courses for GCP Training (http://cloud.google.com/training).
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
This is an extractive based text summarization.
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
A modern Python application packaging and distribution tool
Data Exploration in PySpark made easy - Pyspark_dist_explore provides methods to get fast insights in your Spark DataFrames.
LSTM and QRNN Language Model Toolkit for PyTorch
(Deprecated) Scikit-learn integration package for Apache Spark
TonY is a framework to natively run deep learning frameworks on Apache Hadoop.
Demonstration code for MLeap, both Jupyter notebooks and projects
Models and examples built with TensorFlow
An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models
Follow the tutorial for this project over at the official Hortonworks Tutorial: https://hortonworks.com/tutorial/storm-in-trucking-iot-on-hdf/
Apache Superset is a Data Visualization and Data Exploration Platform