# LearnAnalytics-SparkML **Repository Path**: mirrors_Azure/LearnAnalytics-SparkML ## Basic Information - **Project Name**: LearnAnalytics-SparkML - **Description**: No description available - **Primary Language**: Unknown - **License**: MIT - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2020-08-08 - **Last Updated**: 2026-03-28 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # Spark with HDInsight - Enterprise Ready Machine Learning and Interactive Data Analysis at Scale This repository contains the materials for the course entitled: _Spark with HDInsight - Enterprise Ready Machine Learning and Interactive Data Analysis at Scale_. # For Instructors: 1. The Instructor-Resources directory contains useful scripts for redelivering this course. ## For Students: 1. The rendered course materials are available in the Student-Resources folder. 2. [Class Playlist](https://open.spotify.com/user/pakmanaz/playlist/02R6d9fLRwxI06EHcm2Mcs) * As your instructor, I'll also be your workshop dj. Feel free to make requests. ## Schedule ### Day One + Spark Fundamentals + Running Spark Applications on HDInsight and YARN + Spark SQL and the DataFrames and Datasets API ### Day Two + SparkML Pipelines + Tokenization and Text Featurization + `mmlspark` ### Day Three 1. R Server on Spark 2. GraphFrames & Structured Streaming 3. Hackathon ## Contributing Contributions in any form are appreciated and encouraged! If you find errors, please submit an Issue or, for brownie/beer points, create a pull request with a fix! This project has adopted the [Microsoft Open Source Code of Conduct](https://opensource.microsoft.com/codeofconduct/). For more information see the [Code of Conduct FAQ](https://opensource.microsoft.com/codeofconduct/faq/) or contact [opencode@microsoft.com](mailto:opencode@microsoft.com) with any additional questions or comments.