Spark Tutorials

Apache Spark is a distributed, in-memory, and disk-based optimized open-source framework which does real-time analytics using Resilient Distributed Data(RDD) sets. It includes a streaming library, and a rich set of programming interfaces to make data processing and transformation easier.

This page will guide you through different topics one needs for learning Spark  and its related technologies.

Basics Concepts

Advanced Topics