This book shows how to solve real data science problems with Hadoop and Spark. Authors Ofer Mendelevitch, Casey Stella, and Douglas Eadline draw on their experience with Hadoop and big data to bring together everything you need: high-level concepts, deep-dive techniques, real-world use cases, practical applications, and hands-on tutorials.
<ASIN:0134024141>
The essentials of data science and the modern Hadoop ecosystem are introduced, along with guidance on data ingestion, data cleansing, and visualization.
The book then moves on to focus on specific applications, including machine learning, predictive modeling for sentiment analysis, clustering for document analysis, anomaly detection, and natural language processing (NLP).
Author: Ofer Mendelevitch, Casey Stella, and Douglas Eadline Publisher: Addison Wesley Date: December 2016 Pages: 256 ISBN: 978-0134024141 Print: 0134024141 Kindle: B01N7G1M8J Audience: Data mining developers Level: Intermediate Category: Data Science
Covers:
- What data science is, how it has evolved, and how to plan a data science career
- How data volume, variety, and velocity shape data science use cases
- Hadoop and its ecosystem, including HDFS, MapReduce, YARN, and Spark
- Data importation with Hive and Spark
- Data quality, preprocessing, preparation, and modeling
- Visualization: surfacing insights from huge data sets
- Machine learning: classification, regression, clustering, and anomaly detection
- Algorithms and Hadoop tools for predictive modeling
- Cluster analysis and similarity functions
- Large-scale anomaly detection
- NLP: applying data science to human language
Related Articles
Reading Your Way Into Big Data
What is a Data Scientist and How Do I Become One?
Follow @bookwatchiprog on Twitter or subscribe to I Programmer's Books RSS feed for each day's new addition to Book Watch and for new reviews.
To have new titles included in Book Watch contact BookWatch@i-programmer.info
Grokking Machine Learning
Author: Luis G. Serrano Publisher: Manning Date: December 2021 Pages: 512 ISBN: 978-1617295911 Print: 1617295914 Kindle: B09LK7KBSL Audience: Python developers interested in machine learning Rating: 5 Reviewer: Mike James Another book on machine learning - surely we have enough by now?
|
Algorithms: Absolute Beginner's Guide
Author: Kirupa Chinnathambi Publisher: Addison-Wesley Date: November 2023 Pages: 416 ISBN: 978-0138222291 Print: 0138222290 Kindle: B0CCTZ37DQ Audience: General Rating: 4.5 Reviewer: Kay Ewbank
Subtitled 'a practical introduction to data structures and algorithms in JavaScript', this book is split into tw [ ... ]
| More Reviews |
|