This book shows how to use the open source Pandas library with Python to automate and perform data analysis tasks. Author Daniel Chen introduces key concepts through simple but practical examples, incrementally building on them to solve more difficult, real-world problems. Using a realistic dataset, the book covers combining datasets, handling missing data, and structuring datasets for easier analysis and visualization.
<ASIN:0134546938>
Data cleaning techniques, from basic string manipulation to applying functions simultaneously across dataframes, are also covered. The book also shows how to fit models for prediction, clustering, inference, and exploration.
Author: Daniel Chen Publisher: Addison-Wesley Date: Jan 2018 Pages: 416 ISBN: 978-0134546933 Print: 0134546938 Kindle: B0789WKTKJ Audience: Python developers Level: intermediate Category: Data Science
- Work with DataFrames and Series, and import or export data
- Create plots with matplotlib, seaborn, and pandas
- Combine datasets and handle missing data
- Reshape, tidy, and clean datasets so they’re easier to work with
- Convert data types and manipulate text strings
- Apply functions to scale data manipulations
- Aggregate, transform, and filter large datasets with groupby
- Use Pandas’ advanced date and time capabilities
- Fit linear models using statsmodels and scikit-learn libraries
- Use generalized linear modeling to fit models with different response variables
- Compare multiple models to select the “best”
- Regularize to overcome overfitting and improve performance
- Use clustering in unsupervised machine learning
For recommendations of Python books see Books for Pythonistas and Python Books For Beginners in our Programmer's Bookshelf section.
For more Book Watch just click.
Book Watch is I Programmer's listing of new books and is compiled using publishers' publicity material. It is not to be read as a review where we provide an independent assessment. Some, but by no means all, of the books in Book Watch are eventually reviewed.
To have new titles included in Book Watch contact BookWatch@i-programmer.info
Follow @bookwatchiprog on Twitter or subscribe to I Programmer's Books RSS feed for each day's new addition to Book Watch and for new reviews.
SQL Server 2022 Query Performance Tuning (Apress)
Author: Grant Fritchey Publisher: Apress Pages: 745 ISBN:978-1484288900 Print:1484288904 Kindle:B0BLYD98SQ Audience: DBAs & SQL Devs Rating: 4.7 Reviewer: Ian Stirk
A popular performance tuning book gets updated for SQL Server 2022, how does it fare?
|
PostgresSQL 14 Administration Cookbook
Author: Simon Riggs and Gianno Ciolli Publisher: Packt Publishing Pages: 608 ISBN: 978-1803248974 Print:1803248971 Kindle: B09R4VBHX3 Audience: PostgresSQL developers and administrators Rating: 4.5 Reviewer: Kay Ewbank
While this book describes itself as a cookbook, the recipes in it work through the nec [ ... ]
| More Reviews |
|