# simplesparkapp
**Repository Path**: mirrors_sryza/simplesparkapp
## Basic Information
- **Project Name**: simplesparkapp
- **Description**: Simple Spark Application
- **Primary Language**: Unknown
- **License**: Apache-2.0
- **Default Branch**: master
- **Homepage**: None
- **GVP Project**: No
## Statistics
- **Stars**: 0
- **Forks**: 0
- **Created**: 2020-08-18
- **Last Updated**: 2026-03-29
## Categories & Tags
**Categories**: Uncategorized
**Tags**: None
## README
Simple Spark Application
==============
A simple Spark application that counts the occurrence of each word in a corpus and then counts the
occurrence of each character in the most popular words. Includes the same program implemented in
Java and Scala.
To make a jar:
mvn package
To run from a gateway node in a CDH5 cluster:
spark-submit --class com.cloudera.sparkwordcount.SparkWordCount --master local \
target/sparkwordcount-0.0.1-SNAPSHOT.jar 2
This will run the application in a single local process. If the cluster is running a Spark standalone
cluster manager, you can replace "--master local" with "--master spark://``:``".
If the cluster is running YARN, you can replace "--master local" with "--master yarn".