# LearningSparkV2 **Repository Path**: mirrors_databricks/LearningSparkV2 ## Basic Information - **Project Name**: LearningSparkV2 - **Description**: This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition] - **Primary Language**: Unknown - **License**: Apache-2.0 - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 1 - **Created**: 2020-09-24 - **Last Updated**: 2025-10-19 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # Learning Spark 2nd Edition Welcome to the GitHub repo for Learning Spark 2nd Edition. Chapters [2](chapter2/README.md), [3](chapter3/README.md), [6](chapter6/README.md), and [7](chapter7/README.md) contain stand-alone Spark applications. You can build all the JAR files for each chapter by running the Python script: `python build_jars.py`. Or you can cd to the chapter directory and build jars as specified in each README. Also, include `$SPARK_HOME/bin` in `$PATH` so that you don't have to prefix `SPARK_HOME/bin/spark-submit` for these standalone applications. For all the other chapters, we have provided notebooks in the [notebooks](notebooks) folder. We have also included notebook equivalents for a few of the stand-alone Spark applications in the aforementioned chapters. Have Fun, Cheers!