-
Tutte Institute for Mathematics and Computing
- Ottawa, ON, Canada
Stars
A fast multi-core implementation of HDBSCAN for low dimensional Euclidean spaces
Empirical study on high dimensional clustering
A visual labeling system implemented in Jupyter widgets.
Clustering sentence embeddings to extract message intent
Optimize clustering labels using Silhouette Score.
STUMPY is a powerful and scalable Python library for modern time series analysis
Vectorizers for a range of different data types
Apache Superset is a Data Visualization and Data Exploration Platform
Official repository for IPython itself. Other repos in the IPython organization contain things like the website, documentation builds, etc.
The NumFOCUS DISCOVER Cookbook (Diverse & Inclusive Spaces and Conferences: Overall Vision and Essential Resources). A guide for organizing more diverse and inclusive events and conferences, produc…
Poisson Binomial Probability Distribution for Python
A Python nearest neighbor descent for approximate nearest neighbors
A game theoretic approach to explain the output of any machine learning model.
Fast, flexible and easy to use probabilistic modelling in Python.
Sequential model-based optimization with a `scipy.optimize` interface
Semi-Supervised t-SNE using a Bayesian prior based on partial labelling
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
A high performance implementation of HDBSCAN clustering.