Practical Synthetic Data Generation: Balancing Privacy and the Broad Availability of Data - Softcover

Emam, Khaled El ; Mosquera, Lucy ; Hoptroff, Richard

4.10 out of 5 stars

10 ratings by Goodreads

9781492072744: Practical Synthetic Data Generation: Balancing Privacy and the Broad Availability of Data

Softcover

ISBN 10: 1492072745 ISBN 13: 9781492072744

Publisher: O'Reilly Media, 2020

View all copies of this ISBN edition

5 Used

From US$ 36.07

21 New

From US$ 45.82

Building and testing machine learning models requires access to large and diverse data. But where can you find usable datasets without running into privacy issues? This practical book introduces techniques for generating synthetic data—fake data generated from real data—so you can perform secondary analysis to do research, understand customer behaviors, develop new products, or generate new revenue.

Data scientists will learn how synthetic data generation provides a way to make such data broadly available for secondary purposes while addressing many privacy concerns. Analysts will learn the principles and steps for generating synthetic data from real datasets. And business leaders will see how synthetic data can help accelerate time to a product or solution.

This book describes:

Steps for generating synthetic data using multivariate normal distributions
Methods for distribution fitting covering different goodness-of-fit metrics
How to replicate the simple structure of original data
An approach for modeling data structure to consider complex relationships
Multiple approaches and metrics you can use to assess data utility
How analysis performed on real data can be replicated with synthetic data
Privacy implications of synthetic data and methods to assess identity disclosure

"synopsis" may belong to another edition of this title.

About the Authors

Dr. Khaled El Emam is a senior scientist at the Children’s Hospital of Eastern Ontario (CHEO) Research Institute and Director of the multi-disciplinary Electronic Health Information Laboratory, conducting academic research on synthetic data generation methods, and re- identification risk measurement, and he is also a Professor in the Faculty of Medicine (Pediatrics) at the University of Ottawa.

He is the founder, CEO, and President of Privacy Analytics. Khaled has been performing data analysis since the early 90s, building statistical and machine learning models for prediction and evaluation. Since 2004 he has been developing technologies to facilitate the sharing of data for secondary analysis, from basic research on algorithms to applied solutions development that have been deployed globally. These technologies addressed problems in anonymization & pseudonymization, synthetic data, secure computation, and data watermarking. He has (co- )written multiple books on various privacy and software engineering topics. In 2003 and 2004, he was ranked as the top systems and software engineering scholar worldwide by the Journal of Systems and Software based on his research on measurement and quality evaluation and improvement. Previously, Khaled was a Senior Research Officer at the National Research Council of Canada. He also served as the head of the Quantitative Methods Group at the Fraunhofer Institute in Kaiserslautern, Germany. He held the Canada Research Chair in Electronic Health Information at the University of Ottawa from 2005 to 2015, and has a PhD from the Department of Electrical and Electronics Engineering, King’s College, at the University of London, England.

Lucy Mosquera has a bachelor's degree in Biology and Mathematics from Queen's University and is a current graduate student in the department of statistics at the University of British Columbia. During her time at Queen's, Lucy provided data management support on a dozen clinical trials and observational studies run through Kingston General Hospital's Clinical Evaluation Research Unit. Lucy has also worked on clinical trial data sharing methods based on homomorphic encryption and secret sharing protocols. At Replica Analytics, Lucy is responsible for developing statistical and machine learning models for data generation, and integrating subject area expertise in clinical trial data into synthetic data generation methods, as well as the statistical assessments of our synthetic data generation.

Dr. Richard Hoptroff is a long term technology inventor, investor and entrepreneur. Awarded a PhD in Physics by King’s College London for his work in optical computing and artificial intelligence, in 1992, together with Ravensbeck, he founded Right Information Systems, a neural network forecasting software company which was in 1997 sold to Cognos Inc (part of IBM). He then worked as a postdoc at the Research Laboratory for Archaeology and the History of Art at Oxford University and in 2001, created Flexipanel Ltd, a company supplying Bluetooth modules to the electronics industry.

In 2010, he founded the Hoptroff London, with the aim to develop smart, hyper-accurate watch movements and create a new watch brand. In 2013 he established a new commercial category when he brought to market the first commercial atomic timepiece and atomic wristwatch.

Hoptroff has now leveraged his expertise in timing technology and software to develop a hyper- accurate synchronised timestamping solution for the financial services sector, based on a unique combination of grandmaster atomic clock engineering and proprietary software.

"About this title" may belong to another edition of this title.

Publisher

O'Reilly Media

Publication date

2020

Language

English

ISBN 10

1492072745

ISBN 13

9781492072744

Binding

Paperback

Edition number

Number of pages

166

Rating

4.10 out of 5 stars

10 ratings by Goodreads

Buy Used

Condition: Very Good

Item in very good condition! Textbooks...

View this item

US$ 36.07

Free Shipping
Ships within U.S.A.

Add to basket

Buy New

View this item

US$ 45.82

US$ 2.64 shipping
Ships within U.S.A.

Add to basket

Free 30-day returns

Search results for Practical Synthetic Data Generation: Balancing Privacy...

Stock Image

Practical Synthetic Data Generation: Balancing Privacy and the Broad Availability of Data

Emam, Khaled El

Published by O'Reilly Media, 2020

ISBN 10: 1492072745 ISBN 13: 9781492072744

Used Softcover

Seller: World of Books (was SecondSale), Montgomery, IL, U.S.A.

Seller rating 5 out of 5 stars

Condition: Very Good. Item in very good condition! Textbooks may not include supplemental items i.e. CDs, access codes etc. Seller Inventory # 00105706329

Contact seller

Buy Used

US$ 36.07

Free Shipping
Ships within U.S.A.

Quantity: 1 available

Add to basket

Seller Image

Practical Synthetic Data Generation: Balancing Privacy and the Broad Availability of Data

Emam, Khaled El; Mosquera, Lucy; Hoptroff, Richard

Published by O'Reilly Media, 2020

ISBN 10: 1492072745 ISBN 13: 9781492072744

Used Softcover

Seller: Bay State Book Company, North Smithfield, RI, U.S.A.

Seller rating 5 out of 5 stars

Condition: very_good. Seller Inventory # BSM.134TC

Contact seller

Buy Used

US$ 36.08

Free Shipping
Ships within U.S.A.

Quantity: 1 available

Add to basket

Seller Image

Practical Synthetic Data Generation: Balancing Privacy and the Broad Availability of Data

Emam, Khaled El; Mosquera, Lucy; Hoptroff, Richard

Published by O'Reilly Media, 2020

ISBN 10: 1492072745 ISBN 13: 9781492072744

Used Softcover

Seller: Goodbooks Company, Springdale, AR, U.S.A.

Seller rating 5 out of 5 stars

Condition: good. Book has corner edge dings and or scratches and signs of light wear. Seller Inventory # GBV.1492072745.G

Contact seller

Buy Used

US$ 32.89

US$ 4.99 shipping
Ships within U.S.A.

Quantity: 1 available

Add to basket

Seller Image

Practical Synthetic Data Generation : Balancing Privacy and the Broad Availability of Data

Emam, Khaled El; Mosquera, Lucy; Hoptroff, Richard

Published by O'Reilly Media, 2020

ISBN 10: 1492072745 ISBN 13: 9781492072744

New Softcover

Seller: GreatBookPrices, Columbia, MD, U.S.A.

Seller rating 5 out of 5 stars

Condition: New. Seller Inventory # 39708295-n

Contact seller

Buy New

US$ 45.82

US$ 2.64 shipping
Ships within U.S.A.

Quantity: 2 available

Add to basket

Stock Image

Practical Synthetic Data Generation: Balancing Privacy and the Broad Availability of Data

O'Reilly Media

Published by O'Reilly Media, 2020

ISBN 10: 1492072745 ISBN 13: 9781492072744

New Softcover

Seller: Lakeside Books, Benton Harbor, MI, U.S.A.

Seller rating 5 out of 5 stars

Condition: New. Brand New! Not Overstocks or Low Quality Book Club Editions! Direct From the Publisher! We're not a giant, faceless warehouse organization! We're a small town bookstore that loves books and loves it's customers! Buy from Lakeside Books! Seller Inventory # OTF-S-9781492072744

Contact seller

Buy New

US$ 44.48

US$ 3.99 shipping
Ships within U.S.A.

Quantity: Over 20 available

Add to basket

Stock Image

Practical Synthetic Data Generation

Khaled El Emam

Published by O'Reilly Media, 2020

ISBN 10: 1492072745 ISBN 13: 9781492072744

New PAP

Seller: PBShop.store US, Wood Dale, IL, U.S.A.

Seller rating 5 out of 5 stars

PAP. Condition: New. New Book. Shipped from UK. Established seller since 2000. Seller Inventory # WO-9781492072744

Contact seller

Buy New

US$ 48.85

Free Shipping
Ships within U.S.A.

Quantity: 2 available

Add to basket

Seller Image

Practical Synthetic Data Generation

Lucy Mosquera, Khaled El Emam, Richard Hoptroff

Published by O'Reilly Media, US, 2020

ISBN 10: 1492072745 ISBN 13: 9781492072744

New Paperback

Seller: Rarewaves USA, OSWEGO, IL, U.S.A.

Seller rating 5 out of 5 stars

Paperback. Condition: New. Building and testing machine learning models requires access to large and diverse data. But where can you find usable datasets without running into privacy issues? This practical book introduces techniques for generating synthetic data-fake data generated from real data-so you can perform secondary analysis to do research, understand customer behaviors, develop new products, or generate new revenueData scientists will learn how synthetic data generation provides a way to make such data broadly available for secondary purposes while addressing many privacy concerns. Analysts will learn the principles and steps for generating synthetic data from real datasets. And business leaders will see how synthetic data can help accelerate time to a product or solution. This book describes:Steps for generating synthetic data using multivariate normal distributionsMethods for distribution fitting covering different goodness-of-fit metrics How to replicate the simple structure of original data An approach for modeling data structure to consider complex relationshipsMultiple approaches and metrics you can use to assess data utilityHow analysis performed on real data can be replicated with synthetic dataPrivacy implications of synthetic data and methods to assess identity disclosure. Seller Inventory # LU-9781492072744

Contact seller

Buy New

US$ 48.92

Free Shipping
Ships within U.S.A.

Quantity: Over 20 available

Add to basket

Seller Image

Practical Synthetic Data Generation: Balancing Privacy and the Broad Availability of Data (Paperback or Softback)

Emam, Khaled El

Published by O'Reilly Media 6/9/2020, 2020

ISBN 10: 1492072745 ISBN 13: 9781492072744

New Paperback or Softback

Seller: BargainBookStores, Grand Rapids, MI, U.S.A.

Seller rating 5 out of 5 stars

Paperback or Softback. Condition: New. Practical Synthetic Data Generation: Balancing Privacy and the Broad Availability of Data. Book. Seller Inventory # BBS-9781492072744

Contact seller

Buy New

US$ 49.90

Free Shipping
Ships within U.S.A.

Quantity: 5 available

Add to basket

Seller Image

Practical Synthetic Data Generation : Balancing Privacy and the Broad Availability of Data

Emam, Khaled El; Mosquera, Lucy; Hoptroff, Richard

Published by O'Reilly Media, 2020

ISBN 10: 1492072745 ISBN 13: 9781492072744

Used Softcover

Seller: GreatBookPrices, Columbia, MD, U.S.A.

Seller rating 5 out of 5 stars

Condition: As New. Unread book in perfect condition. Seller Inventory # 39708295

Contact seller

Buy Used

US$ 50.05

US$ 2.64 shipping
Ships within U.S.A.

Quantity: 2 available

Add to basket

Stock Image

Practical Synthetic Data Generation

Khaled El Emam

Published by O'Reilly Media, 2020

ISBN 10: 1492072745 ISBN 13: 9781492072744

New PAP

Seller: PBShop.store UK, Fairford, GLOS, United Kingdom

Seller rating 5 out of 5 stars

PAP. Condition: New. New Book. Shipped from UK. Established seller since 2000. Seller Inventory # WO-9781492072744

Contact seller

Buy New

US$ 50.68

US$ 5.60 shipping
Ships from United Kingdom to U.S.A.

Quantity: 2 available

Add to basket

There are 16 more copies of this book

View all search results for this book

Practical Synthetic Data Generation: Balancing Privacy and the Broad Availability of Data - Softcover

Emam, Khaled El ; Mosquera, Lucy ; Hoptroff, Richard

About the Authors

Other Popular Editions of the Same Title

Featured Edition

Search results for Practical Synthetic Data Generation: Balancing Privacy...

Practical Synthetic Data Generation: Balancing Privacy and the Broad Availability of Data

Buy Used

Practical Synthetic Data Generation: Balancing Privacy and the Broad Availability of Data

Buy Used

Practical Synthetic Data Generation: Balancing Privacy and the Broad Availability of Data

Buy Used

Practical Synthetic Data Generation : Balancing Privacy and the Broad Availability of Data

Buy New

Practical Synthetic Data Generation: Balancing Privacy and the Broad Availability of Data

Buy New

Practical Synthetic Data Generation

Buy New

Practical Synthetic Data Generation

Buy New

Practical Synthetic Data Generation: Balancing Privacy and the Broad Availability of Data (Paperback or Softback)

Buy New

Practical Synthetic Data Generation : Balancing Privacy and the Broad Availability of Data

Buy Used

Practical Synthetic Data Generation

Buy New

There are 16 more copies of this book

Practical Synthetic Data Generation: Balancing Privacy and the Broad Availability of Data - Softcover

Synopsis

About the Authors

Other Popular Editions of the Same Title

Featured Edition

Search results for Practical Synthetic Data Generation: Balancing Privacy...

Buy Used

Buy Used

Buy Used

Buy New

Buy New

Buy New

Buy New

Buy New

Buy Used

Buy New

There are 16 more copies of this book