This book looks at how to tackle challenges commonly faced in different aspects of data engineering. Paul Crickard starts with an introduction to the basics of data engineering, along with the technologies and frameworks required to build data pipelines to work with large datasets. He then looks at how to transform and clean data and perform analytics to get the most out of data.The book also covers how to work with big data of varying complexity and production databases, and build data pipelines.
<ASIN:183921418X>
Author: Paul Crickard Publisher: Packt Date: October 2020 Pages: 356 ISBN: 978-1839214189 Print: 183921418X Kindle: B08DSLVFNR Audience: Python developers Level: Intermediate Category: Python
- Become well-versed in data architectures, data preparation, and data optimization skills with the help of practical examples
- Design data models and learn how to extract, transform, and load (ETL) data using Python
- Schedule, automate, and monitor complex data pipelines in production
- Understand how data engineering supports data science workflows
- Discover how to extract data from files and databases and then clean, transform, and enrich it
- Configure processors for handling different file formats as well as both relational and NoSQL databases
- Find out how to implement a data pipeline and dashboard to visualize results
- Use staging and validation to check data before landing in the warehouse
- Build real-time pipelines with staging areas that perform validation and handle failures
- Get to grips with deploying pipelines in the production environment
For recommendations of Python books see Books for Pythonistas and Python Books For Beginners in our Programmer's Bookshelf section.
For more Book Watch just click.
Book Watch is I Programmer's listing of new books and is compiled using publishers' publicity material. It is not to be read as a review where we provide an independent assessment. Some, but by no means all, of the books in Book Watch are eventually reviewed.
To have new titles included in Book Watch contact BookWatch@i-programmer.info
Follow @bookwatchiprog on Twitter or subscribe to I Programmer's Books RSS feed for each day's new addition to Book Watch and for new reviews.
Code: The Hidden Language of Computer Hardware and Software 2nd Ed
Top Book 2023 Author: Charles Petzold Publisher: Microsoft Press Date: August 2022 Pages: 480 ISBN: 978-0137909100 Print: 0137909101 Kindle: B0B123P5GV Audience: General Rating: 5 Reviewer: Mike James Code! We all need to know about it.
|
Object-Oriented Python
Author: Irv Kalb Publisher: No Starch Press Date: January 2022 Pages: 416 ISBN: 978-1718502062 Print: 1718502060 Kindle: B0957SHYQL Audience: Python developers Rating: 3 Reviewer: Mike James Python, Object-Oriented? Not a lot of programmers know that!
| More Reviews |
|