-
spark-fast-tests Public
Forked from mrpowers-io/spark-fast-testsApache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)
Scala MIT License UpdatedMay 20, 2025 -
elasticsearch-hadoop Public
Forked from elastic/elasticsearch-hadoop🐘 Elasticsearch real-time search and analytics natively integrated with Hadoop
Java Apache License 2.0 UpdatedMay 10, 2025 -
deequ Public
Forked from awslabs/deequDeequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Scala Apache License 2.0 UpdatedApr 13, 2025 -
spark Public
Forked from apache/sparkApache Spark - A unified analytics engine for large-scale data processing
Scala Apache License 2.0 UpdatedApr 5, 2025 -
learning-notes Public
This is where I stash all the cool stuff I've read, heard, and learned from – books, articles, papers, and talks that I found interesting!
Scala UpdatedMar 23, 2025 -
unitycatalog Public
Forked from unitycatalog/unitycatalogOpen, Multi-modal Catalog for Data & AI
Python Apache License 2.0 UpdatedMar 19, 2025 -
h3-spark Public
Forked from nuzigor/h3-sparkBrings H3 - Hexagonal hierarchical geospatial indexing system support to Apache Spark SQL
Scala Apache License 2.0 UpdatedMar 16, 2025 -
airflow Public
Forked from apache/airflowApache Airflow - A platform to programmatically author, schedule, and monitor workflows
Python Apache License 2.0 UpdatedMar 11, 2025 -
-
spark-daria Public
Forked from mrpowers-io/spark-dariaEssential Spark extensions and helper methods ✨😲
Scala MIT License UpdatedJan 5, 2025 -
sql-spark-connector Public
Forked from microsoft/sql-spark-connectorApache Spark Connector for SQL Server and Azure SQL
Scala Apache License 2.0 UpdatedNov 14, 2024 -
polars Public
Forked from pola-rs/polarsDataframes powered by a multithreaded, vectorized query engine, written in Rust
Rust Other UpdatedNov 6, 2024 -
slatedb Public
Forked from slatedb/slatedbA cloud native embedded storage engine built on object storage.
Rust Apache License 2.0 UpdatedOct 25, 2024 -
awesome-spark Public
Forked from awesome-spark/awesome-sparkA curated list of awesome Apache Spark packages and resources.
Shell Creative Commons Zero v1.0 Universal UpdatedOct 25, 2024 -
tsumugi-spark Public
Forked from mrpowers-io/tsumugi-sparkSparkConnect Server plugin and protobuf messages for the Amazon Deequ Data Quality Engine.
Python Apache License 2.0 UpdatedOct 20, 2024 -
datafusion-comet Public
Forked from apache/datafusion-cometApache DataFusion Comet Spark Accelerator
Rust Apache License 2.0 UpdatedOct 13, 2024 -
quinn Public
Forked from mrpowers-io/quinnpyspark methods to enhance developer productivity 📣 👯 🎉
Python Apache License 2.0 UpdatedOct 12, 2024 -
datafusion Public
Forked from apache/datafusionApache DataFusion SQL Query Engine
Rust Apache License 2.0 UpdatedSep 27, 2024 -
levi Public
Forked from mrpowers-io/leviDelta Lake helper methods. No Spark dependency.
Python MIT License UpdatedSep 9, 2024 -
jodie Public
Forked from mrpowers-io/jodieDelta lake and filesystem helper methods
Scala MIT License UpdatedSep 8, 2024 -
mack Public
Forked from MrPowers/mackDelta Lake helper methods in PySpark
Python MIT License UpdatedSep 5, 2024 -
zeppelin Public
Forked from apache/zeppelinWeb-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
Java Apache License 2.0 UpdatedJun 3, 2024 -
python-deequ Public
Forked from awslabs/python-deequPython API for Deequ
Python Apache License 2.0 UpdatedApr 26, 2024 -
delta Public
Forked from delta-io/deltaAn open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Scala Apache License 2.0 UpdatedApr 22, 2024 -
chispa Public
Forked from MrPowers/chispaPySpark test helper methods with beautiful error messages
Python MIT License UpdatedApr 19, 2024 -
SSH.NET Public
Forked from sshnet/SSH.NETSSH.NET is a Secure Shell (SSH) library for .NET, optimized for parallelism.
C# MIT License UpdatedMar 24, 2024 -
mssql-jdbc Public
Forked from microsoft/mssql-jdbcThe Microsoft JDBC Driver for SQL Server is a Type 4 JDBC driver that provides database connectivity with SQL Server through the standard JDBC application program interfaces (APIs).
Java MIT License UpdatedOct 12, 2023 -
-
-