Spark Streaming is a framework for scalable, high-throughput, fault-tolerant stream processing of live data streams. It provides simple APIs that allow complex algorithms to be implemented. Spark Streaming integrates with Spark's batch and interactive processing, can process streams from sources like Kafka at low (second-scale) latencies, and scales to process large volumes of data across many nodes in a fault-tolerant manner. It addresses requirements for frameworks processing live streaming data at large scales with low latency.