Data Engineering with Python (Packt)
Monday, 09 November 2020

This book looks at how to tackle challenges commonly faced in different aspects of data engineering. Paul Crickard starts with an introduction to the basics of data engineering, along with the technologies and frameworks required to build data pipelines to work with large datasets. He then looks at how to transform and clean data and perform analytics to get the most out of data.The book also covers how to work with big data of varying complexity and production databases, and build data pipelines.

<ASIN:183921418X>

 

Author: Paul Crickard
Publisher: Packt
Date: October 2020
Pages: 356
ISBN: 978-1839214189
Print: 183921418X
Kindle: B08DSLVFNR
Audience: Python developers
Level: Intermediate
Category: Python

dataeng

 

 

  • Become well-versed in data architectures, data preparation, and data optimization skills with the help of practical examples
  • Design data models and learn how to extract, transform, and load (ETL) data using Python
  • Schedule, automate, and monitor complex data pipelines in production
  • Understand how data engineering supports data science workflows
  • Discover how to extract data from files and databases and then clean, transform, and enrich it
  • Configure processors for handling different file formats as well as both relational and NoSQL databases
  • Find out how to implement a data pipeline and dashboard to visualize results
  • Use staging and validation to check data before landing in the warehouse
  • Build real-time pipelines with staging areas that perform validation and handle failures
  • Get to grips with deploying pipelines in the production environment

For recommendations of Python books see Books for Pythonistas and Python Books For Beginners in our Programmer's Bookshelf section.

For more Book Watch just click.

Book Watch is I Programmer's listing of new books and is compiled using publishers' publicity material. It is not to be read as a review where we provide an independent assessment. Some, but by no means all, of the books in Book Watch are eventually reviewed.

To have new titles included in Book Watch contact  BookWatch@i-programmer.info

Follow @bookwatchiprog on Twitter or subscribe to I Programmer's Books RSS feed for each day's new addition to Book Watch and for new reviews.

 

 

Banner
 


High Performance SQL Server, 2nd Ed (Apress)

Author: Benjamin Nevarez
Publisher: Apress
Pages: 420
ISBN: 978-1484264904
Print: 1484264908
Kindle: B08TQR3NMF
Audience: SQL Server DBAs and Devs
Rating: 4.5
Reviewer: Ian Stirk

This book aims to improve the performance of your SQL Server, how does it fare?



Pro SQL Server 2019 Administration

Author: Peter Carter
Publisher: Apress
Pages: 940
ISBN: 978-1484250884
Print: 1484250885
Kindle: B07ZC1XC9Z
Audience: SQL Server DBAs
Rating: 5
Reviewer: Kay Ewbank

Administering SQL Server can seem like a dark art; this book aims to make it more transparent.


More Reviews