Data Engineering with Python [electronic resource] : Work with Massive Datasets to Design Data Models and Automate Data Pipelines Using Python

By:

Crickard, Paul

Contributor(s):

Ohio Library and Information Network

Material type: Text

text

Media type:

computer

Carrier type:

online resource

ISBN:

1839212306

Subject(s):

Genre/Form:

DDC classification:

005.7565 23

LOC classification:

QA76.9.D3

Online resources:

O'Reilly Connect to resource

Contents:

What is data engineering? -- Building our data engineering infrastructure --Reading and writing files -- Working with databases - Cleaning, transforming, and enriching data --Building a 311 data pipeline -- Features of a production pipeline -- Version control with the NiFi registry --Monitoring data pipelines -- Deploying data pipelines -- Building a production data pipeline -- Buildings Kafka cluster -- Streaming data with Apache Kafka -- Data processing with Apache Spark -- Real-time edge data with MiNiFi, Kafka, and Spark

Summary: This book is a comprehensive introduction to building data pipelines, that will have you moving and transforming data in no time. You'll learn how to build data pipelines, transform and clean data, and deliver it to provide value to users. You will learn to deploy production data pipelines that include logging, monitoring, and version control

Tags from this library: No tags from this library for this title. Log in to add tags.

Average rating: 0.0 (0 votes)

Holdings
Item type	Current library	Collection	Call number	Status	Notes	Date due	Barcode	Item holds
Books	Junaid Zaidi Library, COMSATS University Islamabad Ground Floor	Books	006.312 CRI-D 62551 (Browse shelf(Opens below))	Available	Paperback.		10001000062551

Total holds: 0

Description based upon print version of record

Available to OhioLINK libraries

This book is a comprehensive introduction to building data pipelines, that will have you moving and transforming data in no time. You'll learn how to build data pipelines, transform and clean data, and deliver it to provide value to users. You will learn to deploy production data pipelines that include logging, monitoring, and version control

There are no comments on this title.

to post a comment.