Stars
A high-throughput and memory-efficient inference and serving engine for LLMs
Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
Move fast from data science prototype to pipeline. Capture, analyze, and transform messy notebooks into data pipelines with just two lines of code.
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
The Virtual Feature Store. Turn your existing data infrastructure into a feature store.
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of …
Production infrastructure for machine learning at scale