-
Kuaishou
- Shenzhen, CN
Starred repositories
Multi-platform high-performance compute language extension for Rust.
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, Du…
The native Rust implementation for Apache Hudi, with C++ & Python API bindings.
✨ Setup Apache Spark in GitHub Action workflows
OLAP Database Performance Tuning Guide
A fast, non-cryptographic, minimally DoS-resistant hashing algorithm for Rust.
A collection of RBIR projects and posts for anyone interested in joining this journey.
Frozen version of Rust standard library's hashbrown
Fast portable hash algorithm with highest bulk throughput and lowest small key latency (1-32 bytes) among portable hashes listed in SMHasher3, and made improvements for quality and usability.
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
A library that toggles Scala code at compile-time, like #if in C/C++
All the things about TPC-DS in Apache Spark
The fast Rust-based web bundler with webpack-compatible API 🦀️
I2V-Adapter: A General Image-to-Video Adapter for Diffusion Models
Implementation of Apache ORC file format use Apache Arrow in-memory format
BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)
LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.
`SlimmerBox<T>` is a packed alternative to `Box<T>` whose 'fat' pointer is 'slimmer'