This document introduces Apache Kafka, a fast, scalable, and fault-tolerant publish-subscribe messaging system used for building data pipelines. It covers key topics such as Kafka's high-level overview, use cases, partition distribution, and the replication protocol, along with basic operational commands. The document also references various resources for further exploration of Kafka's functionalities.