In this post, we will be discussing Spark RDD and will be performing some basic operations on RDD. Let’s start our discussion with MapReduce and the challenges associated with it which led to the innovation of Spark. MapReduce greatly simplified “Big Data” analysis on large clusters, but its inefficiency to...