The world's businesses ingest a combined 2.5 quintillion bytes of data every day. But how much of this vast amount of data--used to build products, power AI systems, and drive business decisions--is poor quality or just plain bad? This practical book shows you how to ensure that the data your organization relies on contains only high-quality records.
Most data engineers, data analysts, and data scientists genuinely care about data quality, but they often don't have the time, resources, or understanding to create a data quality monitoring solution that succeeds at scale. In this book, Jeremy Stanley and Paige Schwartz from Anomalo explain how you can use automated data quality monitoring to cover all your tables efficiently, proactively alert on every category of issue, and resolve problems immediately.
This book will help you:
"synopsis" may belong to another edition of this title.
Jeremy Stanley is co-founder and CTO at Anomalo. Prior to Anomalo, Jeremy was the VP of Data Science at Instacart, where he led machine learning and drove multiple initiatives to improve the company's profitability. Previously, he led data science and engineering at other hyper-growth companies like Sailthru. He's applied machine learning and AI technologies to everything from insurance and accounting to ad-tech and last-mile delivery logistics. He's also a recognized thought leader in the data science community with hugely popular blog posts like Deep Learning with Emojis (not Math). Jeremy holds a BS in Mathematics from Wichita State University and an MBA from Columbia University.
Paige Schwartz is a professional technical writer at Anomalo who has written for clients such as Airbnb, Grammarly, and OpenAI. She specializes in communicating complex software engineering topics to a general audience and has spent her career working with machine learning and data systems, including 5 years as a product manager on Google Search. She holds a joint BA in Computer Science and English from UC Berkeley.
"About this title" may belong to another edition of this title.
US$ 3.99 shipping within U.S.A.
Destination, rates & speedsSeller: Books From California, Simi Valley, CA, U.S.A.
paperback. Condition: Good. manufacturing defect minor wear and creasing. Seller Inventory # mon0003610779
Quantity: 1 available
Seller: SecondSale, Montgomery, IL, U.S.A.
Condition: Very Good. Item in very good condition! Textbooks may not include supplemental items i.e. CDs, access codes etc. Seller Inventory # 00088986706
Quantity: 1 available
Seller: GreatBookPrices, Columbia, MD, U.S.A.
Condition: New. Seller Inventory # 46041629-n
Quantity: 1 available
Seller: BargainBookStores, Grand Rapids, MI, U.S.A.
Paperback or Softback. Condition: New. Automating Data Quality Monitoring: Scaling Beyond Rules with Machine Learning 0.79. Book. Seller Inventory # BBS-9781098145934
Quantity: 5 available
Seller: Lakeside Books, Benton Harbor, MI, U.S.A.
Condition: New. Brand New! Not Overstocks or Low Quality Book Club Editions! Direct From the Publisher! We're not a giant, faceless warehouse organization! We're a small town bookstore that loves books and loves it's customers! Buy from Lakeside Books! Seller Inventory # OTF-S-9781098145934
Quantity: Over 20 available
Seller: PBShop.store US, Wood Dale, IL, U.S.A.
PAP. Condition: New. New Book. Shipped from UK. Established seller since 2000. Seller Inventory # WO-9781098145934
Quantity: 3 available
Seller: California Books, Miami, FL, U.S.A.
Condition: New. Seller Inventory # I-9781098145934
Quantity: Over 20 available
Seller: PBShop.store UK, Fairford, GLOS, United Kingdom
PAP. Condition: New. New Book. Shipped from UK. Established seller since 2000. Seller Inventory # WO-9781098145934
Quantity: 3 available
Seller: GreatBookPrices, Columbia, MD, U.S.A.
Condition: As New. Unread book in perfect condition. Seller Inventory # 46041629
Quantity: 1 available
Seller: Grand Eagle Retail, Fairfield, OH, U.S.A.
Paperback. Condition: new. Paperback. The world's businesses ingest a combined 2.5 quintillion bytes of data every day. But how much of this vast amount of data--used to build products, power AI systems, and drive business decisions--is poor quality or just plain bad? This practical book shows you how to ensure that the data your organization relies on contains only high-quality records.Most data engineers, data analysts, and data scientists genuinely care about data quality, but they often don't have the time, resources, or understanding to create a data quality monitoring solution that succeeds at scale. In this book, Jeremy Stanley and Paige Schwartz from Anomalo explain how you can use automated data quality monitoring to cover all your tables efficiently, proactively alert on every category of issue, and resolve problems immediately.This book will help you:Learn why data quality is a business imperativeUnderstand and assess unsupervised learning models for detecting data issuesImplement notifications that reduce alert fatigue and let you triage and resolve issues quicklyIntegrate automated data quality monitoring with data catalogs, orchestration layers, and BI and ML systemsUnderstand the limits of automated data quality monitoring and how to overcome themLearn how to deploy and manage your monitoring solution at scaleMaintain automated data quality monitoring for the long term In this book, Jeremy Stanley and Paige Schwartz from Anomalo explain how you can use automated data quality monitoring to cover all your tables efficiently, proactively alert on every category of issue, and resolve problems immediately. Shipping may be from multiple locations in the US or from the UK, depending on stock availability. Seller Inventory # 9781098145934
Quantity: 1 available