Stars
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
🔎 Open source distributed and RESTful search engine.
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
Repository for the book "Crafting Interpreters"
Alpaca dataset from Stanford, cleaned and curated
A compact LLM pretrained in 9 days by using high quality data
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
An extremely fast Python type checker and language server, written in Rust.
An open-source C++ library developed and used at Facebook.
Tool for generating high quality Synthetic datasets
The AI framework that adds the engineering to prompt engineering (Python/TS/Ruby/Java/C#/Rust/Go compatible)
All Cursor AI's official download links for both the latest and older versions, making it easy for you to update, downgrade, and choose any version. 🚀
Learn Go with test-driven development
💯Go Struct and Field validation, including Cross Field, Cross Struct, Map, Slice and Array diving
A WhatsApp client library for NodeJS that connects through the WhatsApp Web browser app
A curated list of awesome Recommender System (Books, Conferences, Researchers, Papers, Github Repositories, Useful Sites, Youtube Videos)
llama.cpp fork with additional SOTA quants and improved performance
davidbrowne17 / csm-streaming
Forked from SesameAILabs/csmRealtime demo, Streaming and Finetuning code for CSM
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
Expose your FastAPI endpoints as Model Context Protocol (MCP) tools, with Auth!
Memory for AI Agents; SOTA in AI Agent Memory; Announcing OpenMemory MCP - local and secure memory management.
The official implementation of the ICML 2024 paper "MemoryLLM: Towards Self-Updatable Large Language Models" and "M+: Extending MemoryLLM with Scalable Long-Term Memory"
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
stackblitz-labs / bolt.diy
Forked from stackblitz/bolt.newPrompt, run, edit, and deploy full-stack web applications using any LLM you want!