This blog describes the integration between Kafka and Spark. Apache Kafka is a pub-sub solution; where producer publishes data to a topic and a consumer subscribes to that topic to receive the data.  It is used for building real-time data pipelines and streaming apps. It is horizontally scalable, fault-tolerant,...