These serie 5BAC s of spark study notes will deep dive into Apache Spark source codes, which means there will be advanced topics and readers need to be alreadly familiar with Spark APIs.
- Execution Model and Job Scheduling
- Shuffle
- Spark SQL
- DataSet
- Catalyst Optimizer
- Spark Streaming
- and so on