8000 lalaguozhe (yukang.chen) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View lalaguozhe's full-sized avatar
  • dianping.com
  • shanghai

Organizations

@dianping @dp-bigdata

Block or report lalaguozhe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

FireFlyer Record file format, writer and reader for DL training samples.

Python 225 24 Updated Dec 1, 2022

Java bindings for https://github.com/facebookincubator/velox

Java 28 8 Updated Jun 28, 2025

Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.

Go 2,952 1,412 Updated Jun 27, 2025

View parquet files online

Rust 161 8 Updated Jun 28, 2025

Fluss is a streaming storage built for real-time analytics.

Java 1,230 330 Updated Jun 29, 2025

Dataframes powered by a multithreaded, vectorized query engine, written in Rust

Rust 34,230 2,284 Updated Jun 29, 2025

python implementation of the parquet columnar file format.

Python 837 183 Updated Mar 24, 2025

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 4,679 245 Updated Jun 28, 2025

Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.

Rust 1,490 158 Updated Jun 28, 2025

搜索引擎原理

1,719 142 Updated Apr 19, 2024
Java 208 102 Updated Jun 28, 2025

A QoS-based scheduling system brings optimal layout and status to workloads such as microservices, web services, big data jobs, AI jobs, etc.

Go 1,535 370 Updated Jun 26, 2025

OpenAI Api Client in Java

Java 4,790 1,220 Updated Jun 6, 2024

BibiGPT v1 · one-Click AI Summary for Audio/Video & Chat with Learning Content: Bilibili | YouTube | Tweet丨TikTok丨Dropbox丨Google Drive丨Local files | Websites丨Podcasts | Meetings | Lectures, etc. 音视…

TypeScript 5,645 748 Updated Feb 17, 2024

🔬 Online Heap Dump, GC Log, Thread Dump & JFR File Analyzer.

Java 618 105 Updated May 27, 2025

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Scala 3,436 555 Updated Jun 24, 2025

Flowchart for debugging Spark applications

Shell 105 27 Updated Sep 25, 2024

A better notebook for Scala (and more)

Jupyter Notebook 4,571 397 Updated May 19, 2025

A query predictor pipeline and service to predict resource usages of Presto queries

Python 15 5 Updated May 2, 2023

Apache Amoro (incubating) is a Lakehouse management system built on open data lake formats.

Java 996 343 Updated Jun 26, 2025

LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.

Java 2,837 408 Updated Jun 25, 2025

Warp is a modern, Rust-based terminal with AI built in so you and your team can build great software, faster.

23,898 477 Updated Jun 25, 2025

Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.

Scala 1,378 552 Updated Jun 29, 2025

The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data 📊

Clojure 42,528 5,668 Updated Jun 29, 2025

🔥 人人可用的开源 BI 工具,数据可视化神器。An open-source BI tool alternative to Tableau.

Java 20,411 3,625 Updated Jun 27, 2025

The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them

Python 135 38 Updated Oct 25, 2023

Data Lineage Tracking And Visualization Solution

Scala 634 159 Updated Jun 29, 2025

Databricks Scala Coding Style Guide

2,768 588 Updated Apr 5, 2024

Readings in Databases

7,851 912 Updated Sep 9, 2024
Next
0