Availability: Open Source
Written in : Java, Scala
It is an open source near real time, an asynchronous computational framework for stream processing
Kafka, YARN, Samza API fits together to form Samza
It is made up of three layers:
1. A streaming layer.
2. An execution layer.
3. A processing layer.
It supports all three layers in terms of the out of the box support
1. Streaming: Kafka
2. Execution: YARN
3. Processing: Samza API
Simple API
Managed State
Fault tolerance
Durability, Scalability
Pluggable
Process isolation