-
spark-clickhouse-plugin Public
Forked from The-Analytics-Gladiators/spark-clickhouse-pluginThe most intuitive Spark Plugin for interacting with Clickhouse
Scala MIT License UpdatedJul 14, 2023 -
spark-partition-sizing Public
Forked from AbsaOSS/spark-partition-sizingSizing partitions in Spark
Scala Apache License 2.0 UpdatedMar 21, 2023 -
spark-platform Public
Forked from joomcode/spark-platformBasic Spark utilities
Scala MIT License UpdatedFeb 7, 2023 -
spark-docker Public
Forked from apache/spark-dockerOfficial Dockerfile for Apache Spark
Shell Apache License 2.0 UpdatedNov 15, 2022 -
sparkMeasure Public
Forked from LucaCanali/sparkMeasureThis is the development repository for sparkMeasure, a tool for performance troubleshooting of Apache Spark workloads. It simplifies the collection and analysis of Spark task and stage metrics data.
Scala Apache License 2.0 UpdatedAug 29, 2022 -
data-generator Public
Forked from bartosz25/data-generatorUser web sessions data generator written in Python, for Kafka, Kinesis or local file system sinks
Python GNU General Public License v3.0 UpdatedAug 21, 2022 -
ru-neophyte-guide-to-scala Public
Forked from anton-k/ru-neophyte-guide-to-scalaПеревод на русский серии статей Daniel Westheide "The Neophyte's Guide to Scala"
UpdatedJan 26, 2022 -
delta Public
Forked from delta-io/deltaAn open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
Scala Apache License 2.0 UpdatedDec 2, 2021 -
-
scala-best-practices Public
Forked from alexandru/scala-best-practicesA collection of Scala best practices
UpdatedJul 25, 2021 -
connectors Public
Forked from windpiger/connectorsConnectors for Delta Lake
Scala Apache License 2.0 UpdatedJun 23, 2021 -
-
sope Public
Forked from mayur2810/sopeApache Spark ETL Utilities
Scala Apache License 2.0 UpdatedFeb 15, 2021 -
spark-scala-examples Public
Forked from spark-examples/spark-scala-examplesThis project provides Apache Spark SQL, RDD, DataFrame and Dataset examples in Scala language
Scala UpdatedJan 24, 2021 -
spark-schema-registry Public
Forked from hortonworks-spark/spark-schema-registrySchema Registry integration for Apache Spark
Scala Apache License 2.0 UpdatedJan 20, 2021 -
spark-http-streaming Public template
Forked from cchandurkar/spark-http-streamingRunning Apache Spark Structured Streaming job on the local machine with an HTTP web server as a streaming source.
Scala MIT License UpdatedNov 4, 2020 -
spark-sql-kafka-offset-committer Public
Forked from HeartSaVioR/spark-sql-kafka-offset-committerKafka offset committer for structured streaming query
Scala Apache License 2.0 UpdatedJun 28, 2020 -
deordie-meetups Public
Forked from deordie/deordie-meetupsDE or DIE meetup made by data engineers for data engineers. Currently in Russian.
Creative Commons Attribution 4.0 International UpdatedMay 13, 2020 -
waimak Public
Forked from CoxAutomotiveDataSolutions/waimakWaimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.
Scala Apache License 2.0 UpdatedOct 23, 2019 -
deequ Public
Forked from awslabs/deequDeequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Scala Apache License 2.0 UpdatedSep 26, 2019 -
scalacaster Public
Forked from vkostyukov/scalacasterPurely Functional Algorithms and Data Structures in Scala
Scala UpdatedSep 2, 2019 -
spark-utils Public
Forked from tupol/spark-utilsBasic framework utilities to quickly start writing production ready Apache Spark applications
Scala MIT License UpdatedAug 28, 2019 -
scala-exercises Public
Forked from scala-exercises/scala-exercisesThe easy way to learn Scala.
Scala Other UpdatedJun 21, 2019 -
spark-structured-streaming-jdbc-sink Public
Forked from mshtelma/spark-structured-streaming-jdbc-sinkSpark Structured Streaming JDBC Sink
Scala Apache License 2.0 UpdatedMay 31, 2019 -
sbt-common-settings Public
Forked from kotobotov/sbt-common-settingscollections of common plugins and settings for sbt
Scala UpdatedMay 28, 2019 -
-
metorikku Public
Forked from YotpoLtd/metorikkuA simplified, lightweight ETL Framework based on Apache Spark
Scala MIT License UpdatedMar 9, 2019 -
spark-scala-playground Public
Forked from bartosz25/spark-scala-playgroundSample processing code using Spark 2.1+ and Scala
Scala UpdatedFeb 6, 2019 -
odsc-west-streaming-trends Public
Forked from newfront/odsc-west-streaming-trendsAll Data, Relevant Information, Scripts, and Applications for the Open Data Science Conference (2018)
Scala GNU General Public License v3.0 UpdatedNov 2, 2018 -
data-model-generator Public
Forked from piotr-kalanski/data-model-generatorData model generator based on Scala case classes
Scala Apache License 2.0 UpdatedJun 14, 2018