# ExcelRecordReaderMapReduce
**Repository Path**: kk25114/ExcelRecordReaderMapReduce
## Basic Information
- **Project Name**: ExcelRecordReaderMapReduce
- **Description**: MapReduce InputFormat that can read Excel files
- **Primary Language**: Unknown
- **License**: Apache-2.0
- **Default Branch**: master
- **Homepage**: None
- **GVP Project**: No
## Statistics
- **Stars**: 0
- **Forks**: 0
- **Created**: 2021-04-23
- **Last Updated**: 2021-04-23
## Categories & Tags
**Categories**: Uncategorized
**Tags**: None
## README
ExcelRecordReaderMapReduce
==========================
MapReduce Input format for hadoop mapreduce to read Microsoft Excel spreadsheet
License
Apache licensed.
Usage
1. Download and run ant.
2. Include ExcelRecordReaderMapReduce-0.0.1-SNAPSHOT.jar in your environment
3. Use ExcelInputFormat class as Mapper's input format.
Check src/test/resource/test.xls to see demofile.
The key returned is the file offset which starts with zero and value is the all columns value for single row.
Zip files are not supported
Execute the job as
> hadoop jar ExcelRecordReaderMapReduce-0.0.1-SNAPSHOT-jar-with-dependencies.jar in out
After the job has completed you can examine the contents of the output directory in HDFS.
>
hadoop fs -cat out/part*
>
0 Buffet Jimmy Somewhere on the Beach Key West FL
1 Bush George 1600 Pennsylvania Ave Washington DC
2 Cartman Eric 84 Bigboned Way South Park CO
3 Crockett Davey The Alamo San Antonio TX
4 Doe Jane 821 Zimbabwe Ave DC
5 Gates Bill 1 Microsoft Way Redmond WA