Delta Lake: Up and Running

Delta Lake: Up and Running
Author :
Publisher : "O'Reilly Media, Inc."
Total Pages : 267
Release :
ISBN-10 : 9781098139698
ISBN-13 : 1098139690
Rating : 4/5 (690 Downloads)

Book Synopsis Delta Lake: Up and Running by : Bennie Haelen

Download or read book Delta Lake: Up and Running written by Bennie Haelen and published by "O'Reilly Media, Inc.". This book was released on 2023-10-16 with total page 267 pages. Available in PDF, EPUB and Kindle. Book excerpt: With the surge in big data and AI, organizations can rapidly create data products. However, the effectiveness of their analytics and machine learning models depends on the data's quality. Delta Lake's open source format offers a robust lakehouse framework over platforms like Amazon S3, ADLS, and GCS. This practical book shows data engineers, data scientists, and data analysts how to get Delta Lake and its features up and running. The ultimate goal of building data pipelines and applications is to gain insights from data. You'll understand how your storage solution choice determines the robustness and performance of the data pipeline, from raw data to insights. You'll learn how to: Use modern data management and data engineering techniques Understand how ACID transactions bring reliability to data lakes at scale Run streaming and batch jobs against your data lake concurrently Execute update, delete, and merge commands against your data lake Use time travel to roll back and examine previous data versions Build a streaming data quality pipeline following the medallion architecture


Delta Lake: Up and Running Related Books

Delta Lake: Up and Running
Language: en
Pages: 267
Authors: Bennie Haelen
Categories: Computers
Type: BOOK - Published: 2023-10-16 - Publisher: "O'Reilly Media, Inc."

DOWNLOAD EBOOK

With the surge in big data and AI, organizations can rapidly create data products. However, the effectiveness of their analytics and machine learning models dep
Data Engineering with Apache Spark, Delta Lake, and Lakehouse
Language: en
Pages: 480
Authors: Manoj Kukreja
Categories: Computers
Type: BOOK - Published: 2021-10-22 - Publisher: Packt Publishing Ltd

DOWNLOAD EBOOK

Understand the complexities of modern-day data engineering platforms and explore strategies to deal with them with the help of use case scenarios led by an indu
Data Lakehouse in Action
Language: en
Pages: 206
Authors: Pradeep Menon
Categories: Computers
Type: BOOK - Published: 2022-03-17 - Publisher: Packt Publishing Ltd

DOWNLOAD EBOOK

Propose a new scalable data architecture paradigm, Data Lakehouse, that addresses the limitations of current data architecture patterns Key FeaturesUnderstand h
Trino: The Definitive Guide
Language: en
Pages: 310
Authors: Matt Fuller
Categories: Computers
Type: BOOK - Published: 2021-04-14 - Publisher: "O'Reilly Media, Inc."

DOWNLOAD EBOOK

Perform fast interactive analytics against different data sources using the Trino high-performance distributed SQL query engine. With this practical guide, you'
Learning Spark
Language: en
Pages: 400
Authors: Jules S. Damji
Categories: Computers
Type: BOOK - Published: 2020-07-16 - Publisher: O'Reilly Media

DOWNLOAD EBOOK

Data is bigger, arrives faster, and comes in a variety of formats—and it all needs to be processed at scale for analytics or machine learning. But how can you