Starred repositories
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Python Library for Causal and Probabilistic Modeling using Bayesian Networks
用SQL来描述Reactor API. 可用SQL来实现数据处理逻辑,支持实时数据处理,支持聚合,分组,自定义函数等功能,让数据处理更简单.
Learning summary and examples about data systems.
Empowering Data Intelligence with Distributed SQL for Sharding, Scalability, and Security Across All Databases.
大数据相关内容汇总,包括分布式存储引擎、分布式计算引擎、数仓建设等。关键词:Hadoop、HBase、ES、Kudu、Hive、Presto、Spark、Flink、Kylin、ClickHouse
mall学习教程,架构、业务、技术要点全方位解析。mall项目(60k+star)是一套电商系统,使用现阶段主流技术实现。涵盖了SpringBoot、MyBatis、Elasticsearch、RabbitMQ、Redis、MongoDB、MySQL等技术,采用Docker容器化部署。
https://blog.csdn.net/QXC1281/article/details/89070285
The official home of the Presto distributed SQL query engine for big data
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
TFX is an end-to-end platform for deploying production ML pipelines
Hopsworks - Data-Intensive AI platform with a Feature Store
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Efficient Retrieval Augmentation and Generation Framework
Forward-Looking Active REtrieval-augmented generation (FLARE)
Retrieval Augmented Generation (RAG) chatbot powered by Weaviate
Supercharge Your LLM Application Evaluations 🚀
Generative Agents: Interactive Simulacra of Human Behavior
A next-generation crawling and spidering framework.
Automatically visualize your pandas dataframe via a single print! 📊 💡
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
A Gradio web UI for Large Language Models with support for multiple inference backends.
TinyDB is a lightweight document oriented database optimized for your happiness :)