Mastering Large Datasets with Python (Manning) |
Friday, 31 January 2020 | |||
In this book, subtitled "Parallelize and Distribute Your Python Code", author J.T. Wolohan shows how to take a small project and scale it up using a functionally influenced approach to Python coding. The book explores methods and built-in Python tools that lend themselves to clarity and scalability, like the high-performing parallelism method, as well as distributed technologies that allow for high data throughput. The book explores tools like Hadoop and PySpark to efficiently process massive distributed datasets, speeding up decision-making with machine learning, and simplifying data storage with AWS S3 <ASIN:1617296236> . Author: J.T. Wolohan
Topics include:
For recommendations of Python books see Books for Pythonistas and Python Books For Beginners in our Programmer's Bookshelf section. For more Book Watch just click. Book Watch is I Programmer's listing of new books and is compiled using publishers' publicity material. It is not to be read as a review where we provide an independent assessment. Some, but by no means all, of the books in Book Watch are eventually reviewed. To have new titles included in Book Watch contact BookWatch@i-programmer.info Follow @bookwatchiprog on Twitter or subscribe to I Programmer's Books RSS feed for each day's new addition to Book Watch and for new reviews.
|