Get in Touch

Course Outline

Introduction to Programming Big Data with R (pbdR)

  • Setting up your environment to use pbdR
  • Scope and tools available in pbdR
  • Packages commonly used with Big Data alongside pbdR

Message Passing Interface (MPI)

  • Using pbdR MPI
  • Parallel processing
  • Point-to-point communication
  • Sending matrices
  • Summing matrices
  • Collective communication
  • Summing matrices with Reduce
  • Scatter / Gather
  • Other MPI communications

Distributed Matrices

  • Creating a distributed diagonal matrix
  • SVD of a distributed matrix
  • Building a distributed matrix in parallel

Statistics Applications

  • Monte Carlo integration
  • Reading datasets
  • Reading across all processes
  • Broadcasting from a single process
  • Reading partitioned data
  • Distributed regression
  • Distributed bootstrap
 21 Hours

Number of participants


Price per participant

Testimonials (2)

Provisional Upcoming Courses (Require 5+ participants)

Related Categories