Date: 21 September 2021 – 23 September 2021
Duration: 3 Days
Locations: Online

Register Now

Take your knowledge to the next level with Cloudera’s NiFi Training

Students who successfully complete this course will be able to:

• Describe Cloudera Flow Management in the context of the Cloudera Dataflow Data-in-Motion Platform

• Explain how NiFi and MiNiFi fit into the Cloudera Edge to AI paradigm

• Describe NiFi Architecture, including standalone and clustered configurations

• Describe the key features, concepts, and benefits of NiFi

• Explain how FlowFiles, processors, process groups, controllers, and connections work together to define a NiFi dataflow

• Navigate, configure dataflows, and use dataflow information with the NiFi User Interface

• Trace the life of data, its origin, transformation and destination, using data provenance

• Organize and simplify dataflows

• Manage dataflow versions using the NiFi Registry

• Use the NiFi Expression Language to control dataflows

• Describe dataflow optimization methods and available monitoring and reporting features

• Connect dataflows with other systems, such as Kafka, HDFS, and HBase

• Describe aspects of NiFi security

Prerequisites

This course is designed for Developers, Data Engineers, Data Scientists, and Data Stewards. It provides a no-code, graphical approach to configuring real-time data streaming, ingestion, and management solutions for a variety of use cases.

Though programming experience is not required, basic experience with Linux is presumed. Exposure to big data concepts and applications is helpful.

Outline

Introduction to Apache NiFi

  • Overview of Cloudera Flow Management and NiFi
  • The NiFi User Interface

Processors

  • Overview of Processors
  • Processor Surface Panel
  • Processor Configuration

Connections

  • Overview of Connections
  • Connection Configuration
  • Connector Context Menu

Dataflows

  • Command and Control of a Dataflow
  • Processor Relationships
  • Back Pressure
  • Prioritizers
  • Labels

Processor Groups

  • Anatomy of Process Group
  • Input and Output Ports

FlowFile Provenance

  • Data Provenance Events
  • FlowFile Lineage
  • Replaying a FlowFile

Dataflow Templates

  • Templates Overview
  • Managing Templates

Apache NiFi Registry

  • Apache NiFi Registry Overview
  • Using the Registry

FlowFile Attributes

  • FlowFile Attribute Overview
  • Routing on Attributes

NiFi Expression Language

  • NiFi Expression Language
    Overview
  • Syntax
  • Expression Language Editor
  • Setting Conditional Values

Dataflow Optimization

  • Dataflow Optimization
  • Control Rate
  • Managing Compute

NiFi Architecture

  • NiFi Architecture Overview
  • Cluster Architecture
  • Heartbeats
  • Managing Clusters

Site-to-Site Dataflows

  • Site-to-Site Theory
  • Site-to-Site Architecture
  • Anatomy of a Remote Process Group
  • Adding and Configuring Remote Process Groups

Cloudera Edge Management and MiNiFi

  • Overview of MiNiFi
  • Example Walk-through

Monitoring and Reporting

  • Monitoring from NiFi
  • Overview of Reporting
  • Examples of Common Reporting
    Tasks

Controller Services

  • Controller Services Overview
  • Common Controller Services

Integrating NiFi with the Cloudera Ecosystem

  • NiFi Integration Architecture
  • NiFi Ecosystem Processors
  • A Closer Look at NiFi and Apache
    Hive
  • A Closer Look at NiFi and Apache
    Kafka

NiFi Security

  • NiFi Security Overview
  • Securing Access to the NiFi UI
  • Authentication
  • The Importance of Kerberos
  • NiFi Registry Security
  • NiFi Security Summary

How can we help with Training?

Thank you for your message. It has been sent.
There was an error trying to send your message. Please try again later.