Demonstration of how Spark jobs are executed on Amazon Elastic MapReduce (EMR)
Amazon Elastic MapReduce (EMR) is used for processing big data workloads on AWS Cloud using open-source tools such as Apache Spark, Hive, Pig, HBase etc.
Amazon EMR makes it easy to set up, operate, and scale your big data environments by automating time-consuming tasks like provisioning capacity and tuning clusters.
In this LIVE session, we will be demonstrating on how Spark jobs are executed on Amazon EMR cluster with a Live Demo.
Topics covered in this session:
- Introduction to Amazon EMR
- Architecture of EMR
- Benefits of EMR
- Spark workloads on EMR - Live Demo
- Q&A
About the speaker(s)
Vijay Nathari
Vijay is a Certified AWS Solutions Architect – Professional with experience in building enterprise-scale applications on both on-premises and cloud. He also has, Big Data analytics experience with good understanding of open-source tools like Spark, Hive, Pig and Sqoop.
Additionally, Vijay is experienced in designing the business reports using BI tools like Oracle BI Publisher and Quadient Inspire.