MarkLogic Data Hub Training Course
MarkLogic Data Hub is an open-source, consolidated data repository that provides a comprehensive suite of tools and libraries to accelerate enterprise data integration and delivery.
This instructor-led, live training (available online or on-site) is designed for system administrators, database administrators, data architects, and developers who wish to install, configure, and manage MarkLogic Data Hub to consolidate and govern data from disparate silos.
By the end of this training, participants will be able to customise, secure, track, and manage enterprise data effectively using the capabilities and tools of MarkLogic Data Hub.
Course Format
- Interactive lectures and discussions.
- Abundant exercises and hands-on practice.
- Real-world implementation within a live-lab environment.
Course Customisation Options
- To request a customised version of this training, please contact us to arrange.
Course Outline
Introduction
Overview of MarkLogic Data Hub Features and Architecture
Getting Started with MarkLogic Data Hub
Importing, Migrating, and Converting Existing Artifacts
Exploring MarkLogic Data Hub Concepts
Setting up Users, Roles, and Privileges
Deploying Security Configuration Using QuickStart and ml-gradle
Working with Data Ingestion and Flow Pipelines
Working with Steps, Mapping, and Modules
Configuring Project Steps and Flows
Understanding Key Semantic Data Modelling Concepts
Accessing Data Using JavaScript APIs and SPARQL
Managing Data on DHS Using Hub Central
Managing On-Premises Data, Projects, Flows, and Steps
Serving Data Out of MarkLogic Using REST and ODBC
Tracking Data History and Data Lineage Origin
Replicating Existing Data Flow with a New Data Source
Using Smart Mastering with MarkLogic Data Hub
Troubleshooting
Summary and Conclusion
Requirements
- Experience with database management systems
- Familiarity with JavaScript, C, C++, or any other programming language
Audience
- System administrators
- Database administrators
- Data architects
- Developers
Open Training Courses require 5+ participants.
MarkLogic Data Hub Training Course - Booking
MarkLogic Data Hub Training Course - Enquiry
MarkLogic Data Hub - Consultancy Enquiry
Testimonials (2)
The variety of the information shared and the clarity to explain terms in plain English.
Arisbe Mendoza - Fairtrade International
Course - GDPR Workshop
It's a hands-on session.
Vorraluck Sarechuer - Total Access Communication Public Company Limited (dtac)
Course - Talend Open Studio for ESB
Provisional Upcoming Courses (Require 5+ participants)
Related Courses
Data Ethics
14 HoursData Ethics is the discipline focused on the responsible collection, use, and decision-making around data in ways that uphold human rights, privacy, transparency, and fairness.
This instructor-led, live training (delivered online or on-site) is designed for public sector professionals with little to no prior training in data ethics who manage or govern data and wish to understand ethical risks, evaluate real-world dilemmas, and apply principles of responsible data use that align with institutional values and public trust.
By the end of this training, participants will be able to:
- Define key concepts and frameworks in data ethics.
- Identify ethical risks and trade-offs in data collection, analysis, and deployment.
- Apply principles of transparency, consent, and fairness to real-world scenarios.
- Integrate ethical review into governance or operational workflows.
Course Format
- Interactive lecture and discussion.
- Hands-on analysis of real-world data ethics cases.
- Guided exercises focused on ethical evaluation and policy alignment.
Course Customisation Options
- To request a customised training session for this course based on your department's workflows or internal tools, please contact us to arrange.
Data Integrity and Availability
14 HoursData Integrity and Availability is the discipline of ensuring that data remains accurate, complete, consistent, and accessible when needed, especially in high-trust public sector environments.
This instructor-led, live training (online or on-site) is designed for public sector professionals responsible for managing or safeguarding data—regardless of their technical background—who wish to ensure the reliability, consistency, and availability of critical datasets and systems under their control.
By the end of this training, participants will be able to:
- Define and differentiate the principles of integrity and availability within the data lifecycle.
- Detect and prevent data corruption, inconsistency, or unauthorised alterations.
- Design data environments that ensure high availability and business continuity.
- Implement policies and controls that promote long-term data reliability.
Course Format
- Interactive lecture and discussion.
- Hands-on evaluation of data risks and failure points.
- Guided exercises focused on policy development and incident prevention.
Course Customisation Options
- To request a customised training session for this course based on your department's workflows or internal tools, please contact us to arrange.
Data Policies and Standards
14 HoursData Policies and Standards represents a structured approach to ensuring that government data is created, maintained, accessed, and used in ways that are consistent, secure, and aligned with legal and ethical guidelines.
This instructor-led, live training, available online or onsite, is designed for public sector professionals responsible for establishing or applying data policies—irrespective of their technical background—who aim to standardise, document, and enforce data practices across departments or systems.
By the conclusion of this training, participants will be able to:
- Define and distinguish between data policies, standards, and procedures.
- Draft and evaluate data governance policies that align with national and international frameworks.
- Promote consistent, high-quality data practices across teams and departments.
- Establish a foundation for compliance, audit readiness, and trustworthy data systems.
Course Format
- Interactive lectures and discussions.
- Hands-on drafting of sample policies and standards.
- Guided evaluation of existing data workflows and controls.
Course Customisation Options
- To request a customised training session for this course tailored to your department's workflows or internal tools, please contact us to arrange.
Data Strategy
14 HoursA Data Strategy represents the long-term plan for how an organisation will manage, utilise, and invest in data to advance its mission, enhance public services, and ensure accountability.
This instructor-led, live training (available online or on-site) is designed for public sector professionals with limited or emerging experience in data strategy who shape or influence strategic decisions and wish to develop sustainable, mission-aligned data strategies across their organisation or department.
By the end of this training, participants will be able to:
- Define the key elements of a comprehensive data strategy.
- Align data initiatives with organisational objectives and public value.
- Develop roadmaps for data governance, infrastructure, skills, and innovation.
- Evaluate maturity and progress toward becoming a data-driven organisation.
Course Format
- Interactive lectures and discussions.
- Hands-on development of strategy components and roadmaps.
- Guided analysis of public sector case studies and strategic frameworks.
Course Customisation Options
- To request a customised training session for this course tailored to your department's workflows or internal tools, please contact us to arrange.
EBX5 for Developers
21 HoursThis instructor-led, live training in New Zealand (available online or on-site) is intended for developers who wish to leverage EBX5 (TIBCO EBX) to implement a Master Data Management solution within their organisation.
By the end of this training, participants will be able to:
- Interpret requirements and architect an MDM solution.
- Enable the management and integration of master data.
- Integrate and transfer data across multiple systems.
- Import data into EBX5 using match and merge logic.
- Design, create and document a data model that addresses their organisation's business requirements.
- Integrate EBX5 with third-party services.
GDPR Workshop
7 HoursGain mastery over the core principles of the General Data Protection Regulation in a comprehensive one-day workshop tailored for managers, department heads, and compliance professionals. The programme covers GDPR fundamentals, data subject rights, data protection principles, consent requirements, breach notification duties, and privacy by design. It offers practical frameworks to embed GDPR compliance strategies throughout your organisation, ensuring lawful data processing and fostering a culture of accountability in data protection.
How to Audit GDPR Compliance
14 HoursThis course is primarily designed for auditors and other administrative professionals responsible for ensuring that their control systems and IT environments comply with current laws and regulations. It begins by establishing a clear understanding of key GDPR concepts and how they impact the work of auditors. Participants will also examine data subjects' rights, the obligations of data controllers and processors, and the enforcement and compliance mechanisms within the framework of the Regulation. The training further covers the audit programme provided by ISACA, equipping auditors to review GDPR governance and response mechanisms, as well as supporting processes that help manage risks associated with non-compliance.
Oracle GoldenGate
14 HoursThis instructor-led, live training in New Zealand (online or on-site) is tailored for system administrators and developers who wish to set up, deploy, and manage Oracle GoldenGate for data transformation purposes.
By the end of this training, participants will be able to:
- Install and configure Oracle GoldenGate.
- Gain an understanding of Oracle database replication using the Oracle GoldenGate tool.
- Comprehend the Oracle GoldenGate architecture.
- Configure and execute database replication and migration tasks.
- Optimise Oracle GoldenGate performance and troubleshoot any issues that arise.
Personal Data Protection Officer - Basic Level
21 HoursPurpose of the Training
- Introducing participants to the structured and comprehensive aspects of personal data protection under Polish and European law
- Delivering practical knowledge on the updated rules for processing personal data
- Highlighting key legal risk areas arising from the implementation of the GDPR
- Providing hands-on preparation to independently carry out the role of a Personal Data Protection Officer
Personal Data Protection Officer - Advanced Level
14 HoursPurpose of the Training
- Gaining practical knowledge on how to perform the tasks of the Inspector
- Gaining practical knowledge of how to audit and how to assess risk
- Providing practical knowledge about the new rules for the processing of personal data
Privacy in Federal Institutions (Requirements under the Privacy Act)
7 HoursPrivacy in Federal Institutions is a foundational course focused on the Privacy Act and its requirements for protecting personal information in government operations.
This instructor-led, live training (delivered online or onsite) is designed for public sector professionals with limited or emerging experience in privacy legislation, who manage or process citizen data and seek to ensure compliance with the Privacy Act and related federal standards.
By the end of this training, participants will be able to:
- Understand the key provisions and principles of the Privacy Act.
- Identify personal information and handle it in line with legal obligations.
- Develop and implement privacy-compliant practices in day-to-day operations.
- Respond effectively to access to information and correction requests.
Course Format
- Interactive lecture and discussion.
- Hands-on application of policy scenarios in public sector contexts.
- Guided exercises focusing on compliance, documentation, and reporting.
Course Customisation Options
- To request a customised training session for this course tailored to your department's workflows or internal tools, please contact us to arrange.
Talend Administration Center (TAC)
14 HoursThis instructor-led, live training in New Zealand (online or on-site) is designed for system administrators, data scientists, and business analysts seeking to establish and manage Talend Administration Center for deploying and overseeing organisational roles and tasks.
By the conclusion of this training, participants will be able to:
- Install and configure Talend Administration Center.
- Understand and implement the fundamentals of Talend management.
- Build, deploy, and execute business projects or tasks within Talend.
- Monitor dataset security and develop business routines leveraging the TAC framework.
- Gain a broader understanding of big data applications.
Talend Big Data Integration
28 HoursThis instructor-led, live training in New Zealand (online or on-site) is intended for technical professionals who wish to deploy Talend Open Studio for Big Data to streamline the process of reading and analysing big data.
By the end of this training, participants will be able to:
- Install and configure Talend Open Studio for Big Data.
- Connect to big data systems such as Cloudera, HortonWorks, MapR, Amazon EMR and Apache.
- Understand and configure Open Studio's big data components and connectors.
- Set parameters to automatically generate MapReduce code.
- Use Open Studio's drag-and-drop interface to run Hadoop jobs.
- Prototype big data pipelines.
- Automate big data integration projects.
Talend Data Stewardship
14 HoursThis instructor-led, live training in New Zealand (delivered either online or on-site) is designed for beginner to intermediate-level data analysts who wish to deepen their understanding and enhance their skills in managing and improving data quality using Talend Data Stewardship.
By the end of this training, participants will be able to:
- Gain a comprehensive understanding of the role of data stewardship in maintaining high-quality data.
- Use Talend Data Stewardship to effectively manage data quality tasks.
- Create, assign, and manage tasks within Talend Data Stewardship, including customising workflows.
- Leverage the tool's reporting and monitoring capabilities to track data quality and stewardship initiatives.
Talend Open Studio for ESB
21 HoursIn this instructor-led, live training in New Zealand, participants will learn how to use Talend Open Studio for ESB to create, connect, mediate, and manage services along with their interactions.
By the end of this training, participants will be able to:
- Integrate, enhance, and deliver ESB technologies as unified packages across various deployment environments.
- Understand and effectively utilise Talend Open Studio's most commonly used components.
- Integrate any application, database, API, or web service.
- Seamlessly connect heterogeneous systems and applications.
- Embed existing Java code libraries to extend project capabilities.
- Leverage community-driven components and code to expand project functionality.
- Rapidly integrate systems, applications, and data sources within a drag-and-drop Eclipse environment.
- Reduce development time and maintenance costs by generating optimised, reusable code.