Back to Devops Exercises

Apache Kafka

topics/kafka/README.md

latest1.9 KB
Original Source

Apache Kafka

Kafka Exercises

NameTopicObjective & InstructionsSolutionComments

Kafka Self Assessment

<a name="questions-kafka-101"></a>

Kafka 101

<details> <summary>What is Kafka?</summary> <b>

kafka.apache.org: "Apache Kafka is an open-source distributed event streaming platform used by thousands of companies for high-performance data pipelines, streaming analytics, data integration, and mission-critical applications."

In other words, Kafka is a sort of distributed log where you can store events, read them and distribute them to different services and do it in high-scale and real-time. </b></details>

<details> <summary>What Kafka is used for?</summary> <b>
  • Real-time e-commerce
  • Banking
  • Health Care
  • Automotive (traffic alerts, hazard alerts, ...)
  • Real-time Fraud Detection </b></details>
<details> <summary>What is a "Producer" in regards to Kafka?</summary> <b>

An application that publishes data to the Kafka cluster. </b></details>

<a name="questions-kafka-architecture"></a>

Kafka Architecture

<details> <summary>What's in a Kafka cluster?</summary> <b>
  • Broker: a server with kafka process running on it. Such server has local storage. In a single Kafka clusters there are usually multiple brokers. </b></details>
<details> <summary>What is the role of ZooKeeper is Kafka?</summary> <b> In Kafka, Zookeeper is a centralized controller that manages metadata for producers, brokers, and consumers. Zookeeper also: <ul> <li>Tracks which brokers are part of the Kafka cluster</li> <li> Determines which broker is the leader of a given partition and topic </li> <li> Performs leader elections </li> <li> Manages cluster membership of brokers </li> </ul> </b> </details>