High Performance Spark: Best practices for scaling and optimizing Apache Spark

If you’ve successfully used Apache Spark to solve medium sized-problems, but still struggle to realize the "Spark promise" of unparalleled performance on big data, this book is for you. High Performance Spark shows you how take advantage of Spark at scale, so you can grow beyond the novice-level. It’s ideal for software engineers, data engineers, developers, and system administrators working with large-scale data applications.Learn how to make Spark jobs run fasterProductionize exploratory data science with SparkHandle even larger data sets with SparkReduce pipeline running times for faster insights

Author: Holden Karau

Learn more

Deals