Starred repositories
A curated list of awesome works related to high dimensional structure/vector search & database
an algorithm to solve the similarity join problem
Language Models as Multi-Modal Query Planners
Seq2seq transformer for polynomial expansion in PyTorch.
📈 🐍 Multidimensional synthetic data generation with Copula and fPCA models in Python
🏆 The winner code for ACM SIGMOD 2023 Programming Contest, can build highly accurate KNN graphs efficiently
Focus on Database kernel Development, include Basic Skill Content.
Build a distributed SQL database from the ground up
Github Pages template based upon HTML and Markdown for personal, portfolio-based websites.
A movie recommendation model based on the MovieLens dataset.
Query-Aware LSH for Approximate NNS (In-Memory Version of QALSH)
Query-Aware LSH for Approximate NNS (PVLDB 2015 and VLDBJ 2017)
TinyDB is a lightweight document oriented database optimized for your happiness :)
A curated list of awesome PostgreSQL software, libraries, tools and resources, inspired by awesome-mysql
A fast high dimensional near neighbor search algorithm based on group testing and locality sensitive hashing
Automated Query Expansion using High Dimensional Clustering
Code for converting a photo to sketch-style image
Ultra fast JSON decoder and encoder written in C with Python bindings
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training
Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, video2vec, graph2vec, bert, inception, etc)