Working with Apache Kafka (AA-TTDSKFKA2)


Course Description

Apache Kafka is a real-time data pipeline processor. It's scalable, fault tolerance, has high execution speeds with fluid integrations are some of the key hallmarks that make Kafka an integral part of many Enterprise Data architectures. In this lab intensive two day course, you will learn how to use Kafka to build streaming solutions.

Course Outline

Introduction to Streaming Systems

  • Fast data
  • Streaming architecture
  • Lambda architecture
  • Message queues
  • Streaming processors

Introduction to Kafka

  • Architecture
  • Comparing Kafka with other queue systems (JMS / MQ)
  • Kaka concepts : Messages, Topics, Partitions, Brokers, Producers, and commit logs
  • Kafka and Zookeeper
  • Producing messages
  • Consuming messages (Consumers and Consumer Groups)
  • Message retention
  • Scaling Kafka

Programming With Kafka

  • Configuration parameters
  • Producer API (Sending messages to Kafka)
  • Consumer API (consuming messages from Kafka)
  • Commits, Offsets, Seeking
  • Schema with Avro

Kafka Streams

  • Streams overview and architecture
  • Streams use cases and comparison with other platforms
  • Learning Kafka Streaming concepts (KStream, KTable, and KStore)
  • KStreaming operations (transformations, filters, joins, and aggregations)

Administering Kafka

  • Hardware/Software requirements
  • Deploying Kafka
  • Configuration of brokers/topics/partitions/producers/consumers
  • Security: How secure Kafka cluster, and secure client communications (SASL and Kerberos)
  • Monitoring: monitoring tools
  • Capacity Planning: estimating usage and demand
  • Trouble shooting: failure scenarios and recovery

Monitoring and Instrumenting Kafka

  • Monitoring Kafka
  • Instrumenting with Metrics library
  • Instrument Kafka applications and monitor their performance

Course Objectives

Join an engaging hands-on learning environment, where you'll explore:

  • Overview of Streaming technologies
  • Kafka concepts and architecture
  • Programming using Kafka API
  • Kafka Streams
  • Monitoring Kafka
  • Tuning/Troubleshooting Kafka

This course has a 50% hands-on labs to 50% lecture ratio with engaging instruction, demos, group discussions, labs, and project work.

Course Prerequisites

Before attending this course, you should:

  • Be comfortable with Java
  • Have experience working with databases
  • Able to navigate the Linux command line
  • Have basic knowledge of Linux editors (such as VI/nano) for editing code

 

Course Information

Length: 2 day

Format: Lecture and Lab

Delivery Method: n/a

Max. Capacity: 16



Schedule

Contact Us

UPCOMING COURSES
Date
Geography & Location
Days
Cost
CLC
GTR
Dec 05, 2024 - 2 day(s)
Dec 05, 2024
AMER
Remote-EST
AMER, Remote-EST
2
$1895 USD
$1895 USD

Do you have more questions? We're delighted to assist you!

1-877-797-2799
info@firefly.cloud

Who Should Attend

Experienced Java Developers with database experience.