The Spark Python API (PySpark) exposes the Spark programming model to Python. ApacheĀ® Sparkā¢ is an open source and is one of the most popular Big Data frameworks for scaling up your tasks in a cluster. It was developed to utilize distributed, in-memory data structures to improve data processing speeds.
The goal of this lesson is to teach novice programmers to write python code using map-reduce programming model.