# simplesparkapp **Repository Path**: mirrors_sryza/simplesparkapp ## Basic Information - **Project Name**: simplesparkapp - **Description**: Simple Spark Application - **Primary Language**: Unknown - **License**: Apache-2.0 - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2020-08-18 - **Last Updated**: 2026-03-29 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README Simple Spark Application ============== A simple Spark application that counts the occurrence of each word in a corpus and then counts the occurrence of each character in the most popular words. Includes the same program implemented in Java and Scala. To make a jar: mvn package To run from a gateway node in a CDH5 cluster: spark-submit --class com.cloudera.sparkwordcount.SparkWordCount --master local \ target/sparkwordcount-0.0.1-SNAPSHOT.jar 2 This will run the application in a single local process. If the cluster is running a Spark standalone cluster manager, you can replace "--master local" with "--master spark://``:``". If the cluster is running YARN, you can replace "--master local" with "--master yarn".