By adding real-time capabilities to Hadoop, Apache Spark is opening the world of big data to possibilities previously unheard of. Spark and Hadoop will empower companies of all sizes across all industries to convert streaming big data and sensor information into immediately actionable insights, enabling use cases such as personalized recommendations, predictive pricing, proactive patient care, and more.
In this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with Spark.
Download our complimentary book excerpt to read about:
- An introduction to Apache Spark on Hadoop
- An introduction to Data Analysis with Scala and Spark
- The Spark Programming Model
- The steps for getting started
- And a lot more!