- San Francisco Bay Area
-
gravitino Public
Forked from apache/gravitinoWorld's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
Java Apache License 2.0 UpdatedJun 11, 2024 -
gluten Public
Forked from apache/incubator-glutenGluten: Plugin to Double SparkSQL's Performance
Scala Apache License 2.0 UpdatedMar 13, 2024 -
iceberg Public
Forked from apache/icebergApache Iceberg
Java Apache License 2.0 UpdatedJul 13, 2023 -
spark Public
Forked from apache/sparkMirror of Apache Spark
-
substrait Public
Forked from substrait-io/substraitA cross platform way to express data transformation, relational algebra, standardized record expression and plans.
HTML Apache License 2.0 UpdatedApr 27, 2022 -
OpenMetadata Public
Forked from open-metadata/OpenMetadataOpen Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
Java Apache License 2.0 UpdatedOct 2, 2021 -
parquet-mr Public
Forked from apache/parquet-javaMirror of Apache Parquet
Java Apache License 2.0 UpdatedOct 8, 2020 -
dotfiles Public
Forked from startup-class/dotfilesDevelopment configuration files for Linux.
Vim Script UpdatedFeb 20, 2020 -
presto Public
Forked from prestodb/prestoDistributed SQL query engine for big data
Java Apache License 2.0 UpdatedApr 21, 2019 -
crypt Public
WIP: Notebook to manage digital assets across crypto exchanges
Jupyter Notebook UpdatedJan 15, 2018 -
hive Public
Forked from apache/hiveMirror of Apache Hive
Java Apache License 2.0 UpdatedJul 27, 2017 -
pinball Public
Forked from pinterest/pinballPinball is a scalable workflow manager
JavaScript Apache License 2.0 UpdatedJun 23, 2017 -
kafka Public
Forked from apache/kafkaMirror of Apache Kafka
Scala Apache License 2.0 UpdatedJan 5, 2017 -
hadoop Public
Forked from apache/hadoopMirror of Apache Hadoop
Java Apache License 2.0 UpdatedAug 25, 2016 -
sentry Public
Forked from apache/sentryMirror of Apache Sentry
Java Apache License 2.0 UpdatedAug 24, 2016 -
stream-reactor Public
Forked from lensesio/stream-reactorStreaming reference architecture built around Kafka.
-
calamus Public
Sentiment analysis on Tweets in Real-Time
-
kafka-connect-twitter Public
Forked from rollulus/kafka-connect-twitterKafka Connect Source for Twitter
Scala Apache License 2.0 UpdatedApr 24, 2016 -
incubator-sentry Public
Forked from apache/incubator-sentryMirror of Apache Sentry
Java Apache License 2.0 UpdatedApr 5, 2016 -
pandas Public
Forked from pandas-dev/pandasFlexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Python Other UpdatedMar 26, 2016 -
Whatsapp Public
Forked from leohmoraes/WhatsappSome "useful" whatsapp codes
JavaScript UpdatedMar 25, 2016 -
ducktape Public
Forked from confluentinc/ducktapeSystem integration and performance tests
Python UpdatedSep 21, 2015 -
nifi Public
Forked from apache/nifi9200Mirror of Apache NiFi
Java Apache License 2.0 UpdatedAug 27, 2015 -
parquet-format Public
Forked from apache/parquet-formatMirror of Apache Parquet
Java Apache License 2.0 UpdatedMay 19, 2015 -
-
strapdown Public
Forked from arturadib/strapdownInstant and elegant Markdown documents in the browser
JavaScript MIT License UpdatedFeb 15, 2015 -
-
hive-testbench Public
Forked from brockn/hive-testbenchTestbench for experimenting with Apache Hive at any data scale.
Java UpdatedDec 11, 2014 -
setup Public
Forked from startup-class/setupSets up development environment on Linux machine.
-
bash-boilerplate Public
Forked from oxyc/bash-boilerplateA simple starting point for bash scripts
Shell MIT License UpdatedDec 3, 2013