Apache Kafka logo

Apache Kafka

Open-source distributed event streaming platform for high-performance data pipelines and streaming analytics

Open Source Alternative to
Repository activity
  • Stars32.8k
  • Forks15.3k
  • Open Issues420
apache-kafka health score - Linux Foundation Insights
License

Apache-2.0

Languages
  • Java
  • Scala
  • Python
Apache Kafka screenshot

About Apache Kafka

Apache Kafka is an open source distributed event streaming platform used by thousands of companies for high-performance data pipelines, streaming analytics, data integration, and mission-critical applications. Producers write streams of events to topics and consumers read them, in real time or replayed later.

Events are stored durably as an ordered, append-only log, partitioned and replicated across a cluster of brokers for scale and fault tolerance. On top of this, Kafka Streams offers a library for stateful stream processing, and Kafka Connect provides ready-made connectors to move data in and out of external systems.

Written in Java and Scala, Kafka runs as a self-hosted cluster and is governed by the Apache Software Foundation under the Apache License 2.0.

Key features

  • Publish and subscribe to streams of events via topics
  • Durable, replicated, partitioned event log storage
  • Stateful stream processing with Kafka Streams
  • Data integration through Kafka Connect connectors
  • Horizontal scaling and fault tolerance across brokers

Details

First released
2011
Written in
Java · Scala
Deployment
self-hostable · docker
Stream processing
Kafka Streams library
Integration
Kafka Connect connectors
Governance
Apache Software Foundation