Home
Big Data Training
Hadoop Training
Hadoop Administration on MapR Training Course

Hadoop Administration on MapR Training Course

Audience:

This course is designed to demystify big data and Hadoop technology, demonstrating that it is accessible and straightforward to understand.

This course is available as onsite live training in New Zealand or online live training.

Thank you for sending your enquiry! One of our team members will contact you shortly.

Thank you for sending your booking! One of our team members will contact you shortly.

Course Outline

Big Data Overview:

What is Big Data?
Why Big Data is gaining popularity
Big Data case studies
Key characteristics of Big Data
Solutions for working with Big Data.

Hadoop & Its Components:

What is Hadoop, and what are its core components?
Hadoop architecture and the types of data it can handle and process.
A brief history of Hadoop, including the companies that use it and the reasons behind their adoption.
Detailed explanation of the Hadoop framework and its components.
Understanding HDFS (Hadoop Distributed File System), including read and write operations.
Setting up a Hadoop cluster in various modes: standalone, pseudo-distributed, and multi-node clusters.

(This includes configuring a Hadoop cluster in VirtualBox, KVM, or VMware, managing critical network configurations, running Hadoop daemons, and testing the cluster).

What is the MapReduce framework and how does it operate?
Executing MapReduce jobs on a Hadoop cluster.
Understanding replication, mirroring, and rack awareness in the context of Hadoop clusters.

Hadoop Cluster Planning:

How to plan your Hadoop cluster effectively.
Understanding the hardware and software requirements for cluster planning.
Analysing workloads and planning the cluster to prevent failures and ensure optimal performance.

What is MapR and Why Choose MapR:

An overview of MapR and its architecture.
Understanding and working with the MapR Control System, MapR Volumes, snapshots, and mirrors.
Planning a cluster within the MapR context.
Comparing MapR with other distributions and Apache Hadoop.
Installing MapR and deploying a cluster.

Cluster Setup & Administration:

Managing services, nodes, snapshots, mirror volumes, and remote clusters.
Understanding and managing individual nodes.
Understanding Hadoop components and installing them alongside MapR services.
Accessing data on the cluster, including via NFS, as well as managing services and nodes.
Managing data using volumes, handling users and groups, assigning roles to nodes, commissioning and decommissioning nodes, cluster administration, performance monitoring, configuring and analysing metrics for performance tracking, and configuring and administering MapR security.
Understanding and working with M7, the native storage solution for MapR tables.
Configuring and tuning the cluster for optimum performance.

Cluster Upgrade and Integration with Other Setups:

Upgrading the MapR software version and understanding the different types of upgrades.
Configuring a MapR cluster to access an HDFS cluster.
Setting up a MapR cluster on Amazon Elastic MapReduce.

All the above topics include demonstrations and practical sessions to provide learners with hands-on experience of the technology.

Requirements

Basic knowledge of the Linux File System (FS)
Fundamental Java skills
Familiarity with Apache Hadoop (recommended)

28 Hours

Number of participants

Online

Classroom

Select Location

Please select a Venue

Price per participant

Open Training Courses require 5+ participants.

Hadoop Administration on MapR Training Course - Booking

Full Name *

Email *

Phone *

Job Title

Company Name

Address 1 *

City *

State / Province

Country *

Postcode *

Start Date

Tax ID

Dates are subject to availability and take place between 09:30 and 16:30.

Payment *

Bank Transfer (Invoice, PO)

Debit / Credit Card

Comments

Terms and Conditions *

I am an authorised representative of the above named client and I wish to book the above courses or services in accordance with NobleProg Terms and Conditions and Privacy Policy.

Inform me about discounts and promotions

Please read our Privacy Policy to find out how we use your data

Hadoop Administration on MapR Training Course - Enquiry

Full Name *

Email *

Phone *

Number of participants

Company Name

Company Address

How do you want to take the course?

Client Premises

Online

Classroom

Comments

Inform me about discounts and promotions

Please read our Privacy Policy to find out how we use your data

Hadoop Administration on MapR - Consultancy Enquiry

Full Name *

Phone *

Email *

Company Name

Consultancy Subject *

Consultancy Goal

Who will the consultant work with?

Consultancy Urgency *

Comments

Inform me about discounts and promotions

Please read our Privacy Policy to find out how we use your data

Testimonials (1)

practical things of doing, also theory was served good by Ajay

Dominik Mazur - Capgemini Polska Sp. z o.o.

Course - Hadoop Administration on MapR

26880 NZD (Classroom)

Related Courses

Administrator Training for Apache Hadoop

35 Hours

Audience:

This course is designed for IT professionals seeking a solution to store and process large datasets within a distributed system environment.

Goal:

To develop in-depth expertise in Hadoop cluster administration.

Big Data Analytics in Health

21 Hours

Big data analytics involves the process of examining large volumes of diverse datasets to uncover correlations, hidden patterns, and other valuable insights.

The health sector manages vast quantities of complex, heterogeneous medical and clinical data. Applying big data analytics to health data holds significant potential for generating insights that can enhance healthcare delivery. However, the sheer scale of these datasets presents considerable challenges for analysis and practical application in clinical settings.

In this instructor-led, live training (delivered remotely), participants will learn how to perform big data analytics in the health domain by working through a series of hands-on, live lab exercises.

By the end of this training, participants will be able to:

Install and configure big data analytics tools such as Hadoop MapReduce and Spark
Understand the characteristics of medical data
Apply big data techniques to manage medical data effectively
Examine big data systems and algorithms within the context of health applications

Audience

Developers
Data Scientists

Course Format

A blend of lecture, discussion, exercises, and extensive hands-on practice.

Note

To request a customised training programme for this course, please contact us to make arrangements.

Hadoop For Administrators

21 Hours

Apache Hadoop is the most popular framework for processing Big Data across clusters of servers. In this three-day course (with an optional fourth day), attendees will explore the business benefits and use cases for Hadoop and its ecosystem, learn how to plan cluster deployment and growth, and gain hands-on experience installing, maintaining, monitoring, troubleshooting and optimising Hadoop. Participants will also perform bulk data loads into clusters, become familiar with various Hadoop distributions, and practice installing and managing Hadoop ecosystem tools. The course concludes with a discussion on securing clusters using Kerberos.

“…The materials were very well prepared and covered thoroughly. The Lab was very helpful and well organised”
— Andrew Nguyen, Principal Integration DW Engineer, Microsoft Online Advertising

Audience

Hadoop administrators

Format

Lectures and hands-on labs, with an approximate balance of 60% lectures and 40% labs.

Hadoop for Developers (4 days)

28 Hours

Apache Hadoop is the most widely used framework for processing Big Data across clusters of servers. This course introduces developers to the various components of the Hadoop ecosystem, including HDFS, MapReduce, Pig, Hive, and HBase.

Advanced Hadoop for Developers

21 Hours

Apache Hadoop is one of the most popular frameworks for processing Big Data across clusters of servers. This course explores data management in HDFS, along with advanced techniques in Pig, Hive, and HBase. These advanced programming skills will be particularly valuable for experienced Hadoop developers.

Audience: developers

Duration: three days

Format: lectures (50%) and hands-on labs (50%).

Hadoop and Spark for Administrators

35 Hours

This instructor-led, live training in New Zealand (online or on-site) is tailored for system administrators who want to learn how to set up, deploy, and manage Hadoop clusters within their organisation.

By the end of this training, participants will be able to:

Install and configure Apache Hadoop.
Understand the four core components of the Hadoop ecosystem: HDFS, MapReduce, YARN, and Hadoop Common.
Leverage the Hadoop Distributed File System (HDFS) to scale a cluster to hundreds or even thousands of nodes.
Configure HDFS as the storage engine for on-premises Spark deployments.
Set up Spark to interface with alternative storage solutions such as Amazon S3 and NoSQL database systems including Redis, Elasticsearch, Couchbase, Aerospike, and others.
Perform key administrative tasks such as provisioning, managing, monitoring, and securing an Apache Hadoop cluster.

HBase for Developers

21 Hours

This course introduces HBase – a NoSQL store built on top of Hadoop. It is designed for developers who will use HBase to build applications, as well as administrators responsible for managing HBase clusters.

We will guide developers through HBase architecture, data modelling, and application development on HBase. The course also covers using MapReduce with HBase and explores key administrative topics, including performance optimisation. With a strong emphasis on practical learning, the course includes numerous hands-on lab exercises.

Duration : 3 days

Audience : Developers & Administrators

Apache NiFi for Administrators

21 Hours

Apache NiFi is an open-source, flow-based data integration and event-processing platform. It enables automated, real-time data routing, transformation, and system mediation between disparate systems, with a web-based UI and fine-grained control.

This instructor-led, live training (onsite or remote) is aimed at intermediate-level administrators and engineers who wish to deploy, manage, secure, and optimise NiFi dataflows in production environments.

By the end of this training, participants will be able to:

Install, configure, and maintain Apache NiFi clusters.
Design and manage dataflows from varied sources and sinks.
Implement flow automation, routing, and transformation logic.
Optimise performance, monitor operations, and troubleshoot issues.

Format of the Course

Interactive lecture with real-world architecture discussion.
Hands-on labs: building, deploying, and managing flows.
Scenario-based exercises in a live-lab environment.

Course Customisation Options

To request a customised training for this course, please contact us to arrange.

Apache NiFi for Developers

7 Hours

In this instructor-led, live training in New Zealand, participants will learn the fundamentals of flow-based programming as they develop a range of demo extensions, components, and processors using Apache NiFi.

By the end of this training, participants will be able to:

Understand NiFi's architecture and dataflow concepts.
Develop extensions using NiFi and third-party APIs.
Custom-develop their own Apache NiFi processor.
Ingest and process real-time data from diverse and uncommon file formats and data sources.

Python, Spark, and Hadoop for Big Data

21 Hours

This instructor-led, live training in New Zealand (available online or on-site) is tailored for developers seeking to use and integrate Spark, Hadoop, and Python to process, analyse, and transform large and complex data sets.

By the end of this training, participants will be able to:

Set up the necessary environment to begin processing big data using Spark, Hadoop, and Python.
Understand the key features, core components, and architecture of both Spark and Hadoop.
Learn how to integrate Spark, Hadoop, and Python for efficient big data processing.
Explore tools within the Spark ecosystem, including Spark MLlib, Spark Streaming, Kafka, Sqoop, and Flume.
Build collaborative filtering recommendation systems similar to those used by Netflix, YouTube, Amazon, Spotify, and Google.
Use Apache Mahout to scale machine learning algorithms.

Hadoop Administration on MapR Training Course

Audience:

Course Outline

Big Data Overview:

Hadoop & Its Components:

Hadoop Cluster Planning:

What is MapR and Why Choose MapR:

Cluster Setup & Administration:

Cluster Upgrade and Integration with Other Setups:

Requirements

Testimonials (1)

Dominik Mazur - Capgemini Polska Sp. z o.o.

Course - Hadoop Administration on MapR

Provisional Upcoming Courses (Require 5+ participants)

Hadoop Administration on MapR

Hadoop Administration on MapR

Hadoop Administration on MapR

Hadoop Administration on MapR

Hadoop Administration on MapR

Related Categories

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites