DS Tools
Missing data visualization module for Python.
Build and share data reports in 100% Python
A terminal spreadsheet multitool for discovering and arranging data
Modin: Scale your Pandas workflows by changing a single line of code
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
Open source annotation tool for machine learning practitioners.
Transforms PDF, Documents and Images into Enriched Structured Data
Discover, try, install and share Streamlit re-usable bits we call "extras"!
Represent, send, store and search multimodal data
Automatically Visualize any dataset, any size with a single line of code. Created by Ram Seshadri. Collaborators Welcome. Permission Granted upon Request.
Python Extract Transform and Load Tables of Data
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Lightning ⚡️ fast forecasting with statistical and econometric models.
Streamline scikit-learn model comparison.
Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation
The simplest way to serve AI/ML models in production
Build dashboards in Jupyter Notebook with numeric and chart boxes
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
Data & AI Notebook templates catalog organized by tools, following the IMO (input, model, output) framework for easy usage and discovery..
💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows
🐦 Quickly annotate data from the comfort of your Jupyter notebook
The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
All the Fake Data for All Your Real Needs 🙂
Algorithms for outlier, adversarial and drift detection
A Smart, Automatic, Fast and Lightweight Web Scraper for Python