This book looks at how to tackle challenges commonly faced in different aspects of data engineering. Paul Crickard starts with an introduction to the basics of data engineering, along with the technologies and frameworks required to build data pipelines to work with large datasets. He then looks at how to transform and clean data and perform analytics to get the most out of data.The book also covers how to work with big data of varying complexity and production databases, and build data pipelines.
<ASIN:183921418X>
Author: Paul Crickard Publisher: Packt Date: October 2020 Pages: 356 ISBN: 978-1839214189 Print: 183921418X Kindle: B08DSLVFNR Audience: Python developers Level: Intermediate Category: Python
- Become well-versed in data architectures, data preparation, and data optimization skills with the help of practical examples
- Design data models and learn how to extract, transform, and load (ETL) data using Python
- Schedule, automate, and monitor complex data pipelines in production
- Understand how data engineering supports data science workflows
- Discover how to extract data from files and databases and then clean, transform, and enrich it
- Configure processors for handling different file formats as well as both relational and NoSQL databases
- Find out how to implement a data pipeline and dashboard to visualize results
- Use staging and validation to check data before landing in the warehouse
- Build real-time pipelines with staging areas that perform validation and handle failures
- Get to grips with deploying pipelines in the production environment
For recommendations of Python books see Books for Pythonistas and Python Books For Beginners in our Programmer's Bookshelf section.
For more Book Watch just click.
Book Watch is I Programmer's listing of new books and is compiled using publishers' publicity material. It is not to be read as a review where we provide an independent assessment. Some, but by no means all, of the books in Book Watch are eventually reviewed.
To have new titles included in Book Watch contact BookWatch@i-programmer.info
Follow @bookwatchiprog on Twitter or subscribe to I Programmer's Books RSS feed for each day's new addition to Book Watch and for new reviews.
SQL Query Design Patterns and Best Practices
Author: Steve Hughes et al Publisher: Packt Publishing Pages: 270 ISBN: 978-1837633289 Print: 1837633282 Kindle: B0BWRD7HQ7 Audience: Query writers Rating: 2.5 Reviewer: Ian Stirk
This book aims to improve your SQL queries using design patterns, how does it fare?
|
Algorithmic Thinking, 2nd Ed (No Starch Press)
Author: Dr. Daniel Zingaro Publisher: No Starch Date: January 2024 Pages: 480 ISBN: 978-1718503229 Print: 1718503229 Kindle: B0BZGZHK3B Audience: C programmers Rating: 4 Reviewer: Mike James What exactly is algorithmic thinking?
| More Reviews |
|