Get in Touch

Course Outline

  1. Big Data Fundamentals
    • The role of Big Data in the corporate world
    • Phases of developing a Big Data strategy within a corporation
    • Understanding the rationale behind a holistic approach to Big Data
    • Essential components of a Big Data platform
    • Big Data storage solutions
    • Limits of traditional technologies
    • Overview of database types
    • The four dimensions of Big Data
  2. Impact of Big Data on Business
    • The business importance of Big Data
    • Challenges in extracting useful data
    • Integrating Big Data with traditional data
  3. Big Data Storage Technologies
    • Overview of Big Data technologies
      • Data storage models
      • Hadoop
      • Hive
      • Cassandra
      • MongoDB
    • Choosing the right Big Data technology
  4. Processing Big Data
    • Connecting to and extracting data from databases
    • Transforming and preparing data for processing
    • Using Hadoop MapReduce for processing distributed data
    • Monitoring and executing Hadoop MapReduce jobs
    • Building blocks of the Hadoop Distributed File System
    • MapReduce and YARN
    • Handling streaming data with Spark
  5. Big Data Analysis Tools and Technologies
    • Programming Hadoop with Pig Latin
    • Querying Big Data with Hive
    • Mining data with Mahout
    • Visualisation and reporting tools
  6. Big Data in Business
    • Managing and establishing Big Data requirements
    • The business importance of Big Data
    • Selecting the right Big Data tools for the problem at hand

Data Warehousing Concepts

  • What is a Data Warehouse?
  • Differences between OLTP and Data Warehousing
  • Data Acquisition
  • Data Extraction
  • Data Transformation
  • Data Loading
  • Data Marts
  • Dependent vs Independent Data Marts
  • Database Design

ETL Testing Concepts:

  • Introduction
  • Software Development Life Cycle
  • Testing Methodologies
  • ETL Testing Workflow Process
  • ETL Testing Responsibilities in the Data Stage

Big Data Fundamentals

  • The role of Big Data in the corporate world
  • Phases of developing a Big Data strategy within a corporation
  • Understanding the rationale behind a holistic approach to Big Data
  • Essential components of a Big Data platform
  • Big Data storage solutions
  • Limits of traditional technologies
  • Overview of database types

NoSQL Databases

Hadoop

MapReduce

Apache Spark

Requirements

Delegates should have a foundational awareness and some practical experience with storage tools, as well as an understanding of how to handle large data sets.

 14 Hours

Number of participants


Price per participant

Testimonials (1)

Provisional Upcoming Courses (Require 5+ participants)

Related Categories