Starred repositories
KernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA problems
Code for our ACL 2023 Paper "Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models".
the AI-native open-source embedding database
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
AlayaLite – A Fast, Flexible Vector Database for Everyone.
Vector search engine inside Milvus, integrating FAISS, HNSW, DiskANN.
A library for efficient similarity search and clustering of dense vectors.
Elegant reading of real-time and hottest news
InkFuse - An Experimental Database Runtime Unifying Vectorized and Compiled Query Execution.
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
High-performance, low-footprint SQL database written in C++. Process millions of rows per second from Kafka/Pulsar, Iceberg, or ClickHouse, and seamlessly write results back. Supports powerful feat…
Global-Scale Sustainable Blockchain Fabric
OpenResume is a powerful open-source resume builder and resume parser. https://open-resume.com/
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
[🔥updating ...] AI 自动量化交易机器人(完全本地部署) AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant
Technically-oriented PDF Collection (Papers, Specs, Decks, Manuals, etc)
BCC - Tools for BPF-based Linux IO analysis, networking, monitoring, and more
Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from dataset creation to model training and evaluation. Comes with…
OpenHuFu is an open-sourced data federation system to support collaborative queries over multi databases with security guarantee.
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
This repo offers a simple interface that helps you to read&summerize research papers in pdf format. You can ask some questions after reading. This interface is developed based on openai API and usi…
Making large AI models cheaper, faster and more accessible
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Scalable Python DS & ML, in an API compatible & lightning fast way.
libSQL is a fork of SQLite that is both Open Source, and Open Contributions.
An SQL backend for the mlinspect framework to transpile, execute and inspect machine learning pipelines in a database system.
Convert pandas DataFrame manipulations to sql query string
A Database System for Research and Fast Prototyping
ByConity is an open source cloud data warehouse
Concurrency primitives, safe memory reclamation mechanisms and non-blocking (including lock-free) data structures designed to aid in the research, design and implementation of high performance conc…