Crickard, Paul

Data Engineering with Python Work with Massive Datasets to Design Data Models and Automate Data Pipelines Using Python [electronic resource] : - Birmingham : Packt Publishing, Limited, ©2020. - xii, 357 pages : illustrations ; 23 cm

Description based upon print version of record

What is data engineering? -- Building our data engineering infrastructure --Reading and writing files -- Working with databases - Cleaning, transforming, and enriching data --Building a 311 data pipeline -- Features of a production pipeline -- Version control with the NiFi registry --Monitoring data pipelines -- Deploying data pipelines -- Building a production data pipeline -- Buildings Kafka cluster -- Streaming data with Apache Kafka -- Data processing with Apache Spark -- Real-time edge data with MiNiFi, Kafka, and Spark

Available to OhioLINK libraries

This book is a comprehensive introduction to building data pipelines, that will have you moving and transforming data in no time. You'll learn how to build data pipelines, transform and clean data, and deliver it to provide value to users. You will learn to deploy production data pipelines that include logging, monitoring, and version control

1839212306

9781839212307 Packt Publishing 8ED877EE-D46C-4D62-A51A-55817E09DE5A OverDrive, Inc. http://www.overdrive.com

GBC0C6754 bnb


Database management
Python (Computer program language)


Electronic books
Electronic books

QA76.9.D3

005.7565