Highlights
- Pro
10000 Stars
Official inference framework for 1-bit LLMs
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
Convert PDF to markdown + JSON quickly with high accuracy
Lab Materials for MIT 6.S191: Introduction to Deep Learning
100 % FREE, Private (No Internet) DeepSeek’s Advanced RAG: Boost Your RAG Chatbot: Hybrid Retrieval (BM25 + FAISS) + Neural Reranking + HyDe🚀
A high-throughput and memory-efficient inference and serving engine for LLMs
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Python Script to Download YouTube Shorts
smart-llm-loader is a lightweight yet powerful Python package that transforms any document into LLM-ready chunks. Spend less time on preprocessing headaches and more time building what matters. Fro…
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
A library for efficient similarity search and clustering of dense vectors.
Janus-Series: Unified Multimodal Understanding and Generation Models
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Automate the process of making money online.
Latex one page resume based on posquit0/Awesome-CV
Master programming by recreating your favorite technologies from scratch.
Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.
Collection of Summer 2025 tech internships!
A Collection of application ideas which can be used to improve your coding skills.
An all-purpose window upscaler for Windows 10/11.