Lists (5)
Sort Name ascending (A-Z)
Stars
Minimalistic 4D-parallelism distributed training framework for education purpose
An open source implementation of CLIP.
Distributed data engine for Python/SQL designed for the cloud, powered by Rust
Synchronize your data across multiple clusters for lower latencies and higher availability
Code examples for my conference talk on implementing ddd with spring
Minimalistic large language model 3D-parallelism training
MINT-1T: A one trillion token multimodal interleaved dataset.
A simple bash script for switching between installed versions of CUDA.
Schedule-Free Optimization in PyTorch
Retrieval and Retrieval-augmented LLMs
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
Write scalable load tests in plain Python ππ¨
The latest research progress of Contrastive Learning(CL), Data Augmentation(DA) and Self-Supervised Learning(SSL) in Recommender Systems
Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search
just a bunch of useful embeddings for scikit-learn pipelines
π Find the k-nearest neighbors (k-NN) for your vector data
Is ChatGPT Good at Search? LLMs as Re-Ranking Agent [EMNLP 2023 Outstanding Paper Award]
Fast Open-Source Search & Clustering engine Γ for Vectors & π Strings Γ in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram π
A low code Machine Learning personalized ranking service for articles, listings, search results, recommendations that boosts user engagement. A friendly Learn-to-Rank engine
A fast implementation of Aho-Corasick in Rust.
Example π Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using π§ Amazon SageMaker.
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Design patterns implemented in Java
Practical concurrency guide in Go, communication by channels, patterns