Principles of Data Wrangling (O'Reilly)
Monday, 14 August 2017

This practical guide shows how data wrangling, the process of converting raw data into something truly useful, can be achieved. Authors Tye Rattenbury, Joe Hellerstein, Jeffrey Heer, Sean Kandel and Connor Carreras provide business analysts with an overview of various data wrangling techniques and tools, and put the practice of data wrangling into context by asking, "What are you trying to do and why?"

<ASIN:1491938927>

Wrangling data consumes roughly 50-80% of an analyst's time before any kind of analysis is possible. Written by executives at Trifacta (who have a platform for exploring and preparing data for analysis), the book explores several factors--time, granularity, scope, and structure.

Author: Tye Rattenbury, Joe Hellerstein, Jeffrey Heer, Sean Kandel and Connor Carreras
Publisher: O'Reilly
Date: July 2017
Pages: 94
ISBN: 978-1491938928
Print: 1491938927
Kindle: B073HMH8XG
Audience: Data managers
Level: Introductory
Category: Data Science

 

 

  • Understand what kind of data is available
  • Choose which data to use and at what level of detail
  • Meaningfully combine multiple sources of data
  • Decide how to distill the results to a size and shape that can drive downstream analysis

For recommendations of Big Data books see Reading Your Way Into Big Data in our Programmer's Bookshelf section.

Follow @bookwatchiprog on Twitter or subscribe to I Programmer's Books RSS feed for each day's new addition to Book Watch and for new reviews.

To have new titles included in Book Watch contact  BookWatch@i-programmer.info

Banner
 


Modern Fortran

Author: Milan Curcic
Publisher: Manning
Date: November 2020
Pages: 416
ISBN: 978-1617295287
Print: 1617295280
Audience: Fortran programmers
Rating: 5
Reviewer: Mike James
Not your parents' Fortran?



The Async-First Playbook

Author: Sumeet Gayathri Moghe
Publisher: Addison-Wesley
Pages: 368
ISBN: 978-0138187538
Print: 0138187533
Kindle: B0CCTZHB9N
Audience: Agile developers
Rating: 4
Reviewer: Kay Ewbank

The driver behind this book was the pandemic and the need to find ways to make remote working effective for teams. So do [ ... ]


More Reviews