Get in Touch

Course Outline

Introduction

  • Apache Spark versus Hadoop MapReduce

Overview of Apache Spark Features and Architecture

Choosing a Programming Language

Setting up Apache Spark

Creating a Sample Application

Choosing the Data Set

Running Data Analysis on the Data

Processing Structured Data with Spark SQL

Processing Streaming Data with Spark Streaming

Integrating Apache Spark with Third-Party Machine Learning Tools

Using Apache Spark for Graph Processing

Optimising Apache Spark

Troubleshooting

Summary and Conclusion

Requirements

  • Experience with the Linux command line
  • A general understanding of data processing
  • Programming experience in Java, Scala, Python, or R

Audience

  • Developers
 21 Hours

Number of participants


Price per participant

Testimonials (3)

Provisional Upcoming Courses (Require 5+ participants)

Related Categories