Take your knowledge to the next level with Cloudera’s NiFi Training
Students who successfully complete this course will be able to:
• Describe Cloudera Flow Management in the context of the Cloudera Dataflow Data-in-Motion Platform
• Explain how NiFi and MiNiFi fit into the Cloudera Edge to AI paradigm
• Describe NiFi Architecture, including standalone and clustered configurations
• Describe the key features, concepts, and benefits of NiFi
• Explain how FlowFiles, processors, process groups, controllers, and connections work together to define a NiFi dataflow
• Navigate, configure dataflows, and use dataflow information with the NiFi User Interface
• Trace the life of data, its origin, transformation and destination, using data provenance
• Organize and simplify dataflows
• Manage dataflow versions using the NiFi Registry
• Use the NiFi Expression Language to control dataflows
• Describe dataflow optimization methods and available monitoring and reporting features
• Connect dataflows with other systems, such as Kafka, HDFS, and HBase
• Describe aspects of NiFi security
Prerequisites
This course is designed for Developers, Data Engineers, Data Scientists, and Data Stewards. It provides a no-code, graphical approach to configuring real-time data streaming, ingestion, and management solutions for a variety of use cases.
Though programming experience is not required, basic experience with Linux is presumed. Exposure to big data concepts and applications is helpful.
Outline
Introduction to Apache NiFi
- Overview of Cloudera Flow Management and NiFi
- The NiFi User Interface
Processors
- Overview of Processors
- Processor Surface Panel
- Processor Configuration
Connections
- Overview of Connections
- Connection Configuration
- Connector Context Menu
Dataflows
- Command and Control of a Dataflow
- Processor Relationships
- Back Pressure
- Prioritizers
- Labels
Processor Groups
- Anatomy of Process Group
- Input and Output Ports
FlowFile Provenance
- Data Provenance Events
- FlowFile Lineage
- Replaying a FlowFile
Dataflow Templates
- Templates Overview
- Managing Templates
Apache NiFi Registry
- Apache NiFi Registry Overview
- Using the Registry
FlowFile Attributes
- FlowFile Attribute Overview
- Routing on Attributes
NiFi Expression Language
- NiFi Expression Language
Overview - Syntax
- Expression Language Editor
- Setting Conditional Values
Dataflow Optimization
- Dataflow Optimization
- Control Rate
- Managing Compute
NiFi Architecture
- NiFi Architecture Overview
- Cluster Architecture
- Heartbeats
- Managing Clusters
Site-to-Site Dataflows
- Site-to-Site Theory
- Site-to-Site Architecture
- Anatomy of a Remote Process Group
- Adding and Configuring Remote Process Groups
Cloudera Edge Management and MiNiFi
- Overview of MiNiFi
- Example Walk-through
Monitoring and Reporting
- Monitoring from NiFi
- Overview of Reporting
- Examples of Common Reporting
Tasks
Controller Services
- Controller Services Overview
- Common Controller Services
Integrating NiFi with the Cloudera Ecosystem
- NiFi Integration Architecture
- NiFi Ecosystem Processors
- A Closer Look at NiFi and Apache
Hive - A Closer Look at NiFi and Apache
Kafka
NiFi Security
- NiFi Security Overview
- Securing Access to the NiFi UI
- Authentication
- The Importance of Kerberos
- NiFi Registry Security
- NiFi Security Summary