Stars
分享 GitHub 上有趣、入门级的开源项目。Share interesting, entry-level open source projects on GitHub.
ClickHouse® is a real-time analytics database management system
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
V2rayU,基于v2ray核心的mac版客户端,用于科学上网,使用swift编写,支持trojan,vmess,shadowsocks,socks5等服务协议,支持订阅, 支持二维码,剪贴板导入,手动配置,二维码分享等
A high performance caching library for Java
Alluxio, data orchestration for analytics and machine learning in the cloud
Mirror of the Xapian repository. You're welcome to open pull requests on github (they'll just get merged indirectly).
A library that provides an embeddable, persistent key-value store for fast storage.
大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
23 GoF Patterns: RAII-Centric C++ Implementation -> Explicit Ownership via unique_ptr/shared_ptr/weak_ptr
An extension for using Cursor in Visual Studio Code.
An industrial deep learning framework for high-dimension sparse data
一个搜索引擎迷你项目,涉及分词,建倒排索引,网页去重,计算相似度,文本聚类,多进程编程,网络编程,守护进程编写,makefile编写,工程组织等各方面内容