Open-source distributed event streaming platform for high-performance data pipelines and streaming analytics
Apache-2.0
- Java
- Scala
- Python

About Apache Kafka
Apache Kafka is an open source distributed event streaming platform used by thousands of companies for high-performance data pipelines, streaming analytics, data integration, and mission-critical applications. Producers write streams of events to topics and consumers read them, in real time or replayed later.
Events are stored durably as an ordered, append-only log, partitioned and replicated across a cluster of brokers for scale and fault tolerance. On top of this, Kafka Streams offers a library for stateful stream processing, and Kafka Connect provides ready-made connectors to move data in and out of external systems.
Written in Java and Scala, Kafka runs as a self-hosted cluster and is governed by the Apache Software Foundation under the Apache License 2.0.
Key features
- Publish and subscribe to streams of events via topics
- Durable, replicated, partitioned event log storage
- Stateful stream processing with Kafka Streams
- Data integration through Kafka Connect connectors
- Horizontal scaling and fault tolerance across brokers
Details
- First released
- 2011
- Written in
- Java · Scala
- Deployment
- self-hostable · docker
- Stream processing
- Kafka Streams library
- Integration
- Kafka Connect connectors
- Governance
- Apache Software Foundation
