Spark SQL: Relational Data Processing in Spark

Spark SQL is successor of Shark, SQL engine built on top of Spark. Spark SQL makes two contributions: (1) DataFrame (2) Catalyst.

DataFrame

Catalyst

Discussion

References