Data Profiling

Data Profiling
Author :
Publisher : Springer Nature
Total Pages : 136
Release :
ISBN-10 : 9783031018657
ISBN-13 : 3031018656
Rating : 4/5 (656 Downloads)

Book Synopsis Data Profiling by : Ziawasch Abedjan

Download or read book Data Profiling written by Ziawasch Abedjan and published by Springer Nature. This book was released on 2022-06-01 with total page 136 pages. Available in PDF, EPUB and Kindle. Book excerpt: Data profiling refers to the activity of collecting data about data, {i.e.}, metadata. Most IT professionals and researchers who work with data have engaged in data profiling, at least informally, to understand and explore an unfamiliar dataset or to determine whether a new dataset is appropriate for a particular task at hand. Data profiling results are also important in a variety of other situations, including query optimization, data integration, and data cleaning. Simple metadata are statistics, such as the number of rows and columns, schema and datatype information, the number of distinct values, statistical value distributions, and the number of null or empty values in each column. More complex types of metadata are statements about multiple columns and their correlation, such as candidate keys, functional dependencies, and other types of dependencies. This book provides a classification of the various types of profilable metadata, discusses popular data profiling tasks, and surveys state-of-the-art profiling algorithms. While most of the book focuses on tasks and algorithms for relational data profiling, we also briefly discuss systems and techniques for profiling non-relational data such as graphs and text. We conclude with a discussion of data profiling challenges and directions for future work in this area.


Data Profiling Related Books

Data Profiling
Language: en
Pages: 136
Authors: Ziawasch Abedjan
Categories: Computers
Type: BOOK - Published: 2022-06-01 - Publisher: Springer Nature

DOWNLOAD EBOOK

Data profiling refers to the activity of collecting data about data, {i.e.}, metadata. Most IT professionals and researchers who work with data have engaged in
Principles of Data Wrangling
Language: en
Pages: 117
Authors: Tye Rattenbury
Categories: Computers
Type: BOOK - Published: 2017-06-29 - Publisher: "O'Reilly Media, Inc."

DOWNLOAD EBOOK

A key task that any aspiring data-driven organization needs to learn is data wrangling, the process of converting raw data into something truly useful. This pra
Data Profiling and Insurance Law
Language: en
Pages: 312
Authors: Brendan McGurk
Categories: Law
Type: BOOK - Published: 2019-03-21 - Publisher: Bloomsbury Publishing

DOWNLOAD EBOOK

The winner of the 2020 British Insurance Law Association Book Prize, this timely, expertly written book looks at the legal impact that the use of 'Big Data' wil
Child Data Citizen
Language: en
Pages: 233
Authors: Veronica Barassi
Categories: Computers
Type: BOOK - Published: 2020-12-22 - Publisher: MIT Press

DOWNLOAD EBOOK

An examination of the datafication of family life--in particular, the construction of our children into data subjects. Our families are being turned into data,
Database Archiving
Language: en
Pages: 310
Authors: Jack E. Olson
Categories: Computers
Type: BOOK - Published: 2010-07-28 - Publisher: Morgan Kaufmann

DOWNLOAD EBOOK

With the amount of data a business accumulates now doubling every 12 to 18 months, IT professionals need to know how to develop a system for archiving important