Data Analytics with Spark Using Python (Addison Wesley)
Thursday, 02 August 2018

This book combines a language-agnostic introduction to foundational Spark concepts with extensive programming examples using the PySpark development environment. Author Jeffrey Aven covers all aspects of Spark development, from basic programming to SparkSQL, SparkR, Spark Streaming, Messaging, NoSQL and Hadoop integration. He also covers the management of all forms of data with Spark: streaming, structured, semi-structured, and unstructured. Concise topic overviews and extensive hands-on exercises prepare you to solve real problems.

<ASIN:013484601X>

 

Author: Jeffrey Aven
Publisher: Addison Wesley
Date: June 2018
Pages: 320
ISBN: 978-0134846019
Print: 013484601X
Kindle:  B07D3BP8C8
Audience: Python developers wanting to learn Spark
Level: Intermediate
Category: Python  and Data Science

  •  Understand Spark’s evolving role in the Big Data and Hadoop ecosystems
  •  Create Spark clusters using various deployment modes
  •  Control and optimize the operation of Spark clusters and applications
  •  Master Spark Core RDD API programming techniques
  •  Extend, accelerate, and optimize Spark routines with advanced API platform constructs, including shared variables, RDD storage, and partitioning
  •  Efficiently integrate Spark with both SQL and non-relational data stores
  •  Perform stream processing and messaging with Spark Streaming and Apache Kafka
  •  Implement predictive modeling with SparkR and Spark MLlib

For recommendations of Python books see Books for Pythonistas and Python Books For Beginners in our Programmer's Bookshelf section.

For recommendations of Big Data books see Reading Your Way Into Big Data in our Programmer's Bookshelf section.

 

For more Book Watch just click.

Book Watch is I Programmer's listing of new books and is compiled using publishers' publicity material. It is not to be read as a review where we provide an independent assessment. Some, but by no means all, of the books in Book Watch are eventually reviewed.

To have new titles included in Book Watch contact  BookWatch@i-programmer.info

Follow @bookwatchiprog on Twitter or subscribe to I Programmer's Books RSS feed for each day's new addition to Book Watch and for new reviews.

 

 

Banner
 


Lean DevOps

Author: Robert Benefield
Publisher: Addison-Wesley
Pages: 368
ISBN: 978-0133847505
Print:  0133847500
Kindle: B0B126ST43
Audience: Managers of devops teams
Rating: 3 for developers, 4.5 for managers
Reviewer: Kay Ewbank

The problem this book sets out to address is that of how to deliver on-demand se [ ... ]



Seriously Good Software

Author: Marco Faella
Publisher: Manning
Date: March 2020
Pages: 328
ISBN: 978-1617296291
Print: 1617296295
Kindle: B09782DKN8
Audience: Relatively experienced Java programmers
Rating: 4.5
Reviewer: Mike James
Don't we all want to write seriously good software?


More Reviews