Advanced Analytics with Spark: Patterns for Learning from Data at Scale

4 avg rating
( 96 ratings by Goodreads )
 
9781491972953: Advanced Analytics with Spark: Patterns for Learning from Data at Scale
View all copies of this ISBN edition:
 
 

In the second edition of this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with Spark. The authors bring Spark, statistical methods, and real-world data sets together to teach you how to approach analytics problems by example. Updated for Spark 2.1, this edition acts as an introduction to these techniques and other best practices in Spark programming.

You’ll start with an introduction to Spark and its ecosystem, and then dive into patterns that apply common techniques—including classification, clustering, collaborative filtering, and anomaly detection—to fields such as genomics, security, and finance.

If you have an entry-level understanding of machine learning and statistics, and you program in Java, Python, or Scala, you’ll find the book’s patterns useful for working on your own data applications.

With this book, you will:

  • Familiarize yourself with the Spark programming model
  • Become comfortable within the Spark ecosystem
  • Learn general approaches in data science
  • Examine complete implementations that analyze large public data sets
  • Discover which machine learning tools make sense for particular problems
  • Acquire code that can be adapted to many uses

"synopsis" may belong to another edition of this title.

About the Author:

Sandy Ryza develops algorithms for public transit at Remix. Prior, he was a senior data scientist at Cloudera and Clover Health. He is an Apache Spark committer, Apache Hadoop PMC member, and founder of the Time Series for Spark project. He holds the Brown University computer science department's 2012 Twining award for "Most Chill".

Uri Laserson is an Assistant Professor of Genetics at the Icahn School of Medicine at Mount Sinai, where he develops scalable technology for genomics and immunology using the Hadoop ecosystem.

Sean Owen is Director of Data Science at Cloudera. He is an ApacheSpark committer and PMC member, and was an Apache Mahout committer.

Josh Wills is the Head of Data Engineering at Slack, the founder of the Apache Crunch project, and wrote a tweet about data scientists once.

"About this title" may belong to another edition of this title.

Other Popular Editions of the Same Title

9789352135714: Advanced Analytics with Spark: Patterns for Learning from Data at Scale

Featured Edition

ISBN 10:  9352135717 ISBN 13:  9789352135714
Softcover

Top Search Results from the AbeBooks Marketplace

International Edition
International Edition

1.

Ryza, Sandy; Laserson, Uri; Owen, Sean; Wills, Josh
Published by O'Reilly Media
ISBN 10: 1491972955 ISBN 13: 9781491972953
New PAPERBACK Quantity Available: 8
International Edition
Seller:
Ben's Book Shop
(Wilmington, DE, U.S.A.)
Rating
[?]

Book Description O'Reilly Media. PAPERBACK. Condition: New. 1491972955 Paperback. Book Condition: New. This is an International Edition. Brand new. Seller Inventory # INDMKT-9789352135714

More information about this seller | Contact this seller

Buy New
US$ 17.10
Convert currency

Add to Basket

Shipping: FREE
Within U.S.A.
Destination, rates & speeds
International Edition
International Edition

2.

Sandy Ryza
ISBN 10: 1491972955 ISBN 13: 9781491972953
New Softcover Quantity Available: 2
International Edition
Seller:
Rating
[?]

Book Description Softcover. Condition: Brand New. .. International Edition, ISBN and Cover image may differ but contents similar to U.S. Edition, Printed in Black & White or color. Territorial restrictions may be printed on the book. GET IT FAST within 3-5 business days by DHL/FedEx/Aramex and tracking number will be uploaded into your order page within 24-48 hours. Kindly provide day time phone number in order to ensure smooth delivery. No shipping to PO BOX, APO, FPO addresses. 100% Customer satisfaction guaranteed!. . Seller Inventory # STB108028

More information about this seller | Contact this seller

Buy New
US$ 22.82
Convert currency

Add to Basket

Shipping: FREE
From India to U.S.A.
Destination, rates & speeds
International Edition
International Edition

3.

Sandy Ryza andUri Laserson
ISBN 10: 1491972955 ISBN 13: 9781491972953
New Softcover Quantity Available: 4
International Edition
Seller:
Rating
[?]

Book Description Softcover. Condition: Brand New. .. International Edition, ISBN and Cover image may differ but contents similar to U.S. Edition, Printed in Black & White or color. Territorial restrictions may be printed on the book. GET IT FAST within 3-5 business days by DHL/FedEx/Aramex and tracking number will be uploaded into your order page within 24-48 hours. Kindly provide day time phone number in order to ensure smooth delivery. No shipping to PO BOX, APO, FPO addresses. 100% Customer satisfaction guaranteed!. . Seller Inventory # STB108165

More information about this seller | Contact this seller

Buy New
US$ 25.76
Convert currency

Add to Basket

Shipping: FREE
From India to U.S.A.
Destination, rates & speeds

4.

Ryza, Sandy; Laserson, Uri; Owen, Sean; Wills, Josh
Published by O'Reilly Media
ISBN 10: 1491972955 ISBN 13: 9781491972953
New PAPERBACK Quantity Available: > 20
Seller:
Mediaoutlet12345
(Springfield, VA, U.S.A.)
Rating
[?]

Book Description O'Reilly Media. PAPERBACK. Condition: New. 1491972955 *BRAND NEW* Ships Same Day or Next!. Seller Inventory # SWATI2132896577

More information about this seller | Contact this seller

Buy New
US$ 27.78
Convert currency

Add to Basket

Shipping: US$ 3.99
Within U.S.A.
Destination, rates & speeds

5.

Ryza, Sandy
Published by O'Reilly Media 7/6/2017 (2017)
ISBN 10: 1491972955 ISBN 13: 9781491972953
New Paperback or Softback Quantity Available: 5
Seller:
BargainBookStores
(Grand Rapids, MI, U.S.A.)
Rating
[?]

Book Description O'Reilly Media 7/6/2017, 2017. Paperback or Softback. Condition: New. Advanced Analytics with Spark: Patterns for Learning from Data at Scale. Book. Seller Inventory # BBS-9781491972953

More information about this seller | Contact this seller

Buy New
US$ 32.07
Convert currency

Add to Basket

Shipping: FREE
Within U.S.A.
Destination, rates & speeds

6.

Ryza, Sandy
ISBN 10: 1491972955 ISBN 13: 9781491972953
New Quantity Available: 2
Seller:
Paperbackshop-US
(Wood Dale, IL, U.S.A.)
Rating
[?]

Book Description 2017. PAP. Condition: New. New Book. Shipped from US within 10 to 14 business days. Established seller since 2000. Seller Inventory # KB-9781491972953

More information about this seller | Contact this seller

Buy New
US$ 31.49
Convert currency

Add to Basket

Shipping: US$ 3.99
Within U.S.A.
Destination, rates & speeds

7.

Ryza, Sandy/ Laserson, Uri/ Owen, Sean/ Wills, Josh
ISBN 10: 1491972955 ISBN 13: 9781491972953
New Softcover Quantity Available: 3
Seller:
VNHM SHOP
(Pompano Beach, FL, U.S.A.)
Rating
[?]

Book Description Softcover. Condition: New. Seller Inventory # 5659563

More information about this seller | Contact this seller

Buy New
US$ 35.73
Convert currency

Add to Basket

Shipping: FREE
Within U.S.A.
Destination, rates & speeds

8.

Uri Laserson, Sean Owens, Sandy Ryza,
Published by O'Reilly Media, Inc, USA, United States (2017)
ISBN 10: 1491972955 ISBN 13: 9781491972953
New Paperback Quantity Available: 10
Seller:
Book Depository International
(London, United Kingdom)
Rating
[?]

Book Description O'Reilly Media, Inc, USA, United States, 2017. Paperback. Condition: New. 2nd ed.. Language: English. Brand new Book. In the second edition of this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with Spark. The authors bring Spark, statistical methods, and real-world data sets together to teach you how to approach analytics problems by example. Updated for Spark 2.1, this edition acts as an introduction to these techniques and other best practices in Spark programming. You'll start with an introduction to Spark and its ecosystem, and then dive into patterns that apply common techniques-including classification, clustering, collaborative filtering, and anomaly detection-to fields such as genomics, security, and finance. If you have an entry-level understanding of machine learning and statistics, and you program in Java, Python, or Scala, you'll find the book's patterns useful for working on your own data applications. With this book, you will: Familiarize yourself with the Spark programming model Become comfortable within the Spark ecosystem Learn general approaches in data science Examine complete implementations that analyze large public data sets Discover which machine learning tools make sense for particular problems Acquire code that can be adapted to many uses. Seller Inventory # AAH9781491972953

More information about this seller | Contact this seller

Buy New
US$ 36.14
Convert currency

Add to Basket

Shipping: FREE
From United Kingdom to U.S.A.
Destination, rates & speeds

9.

Uri Laserson, Sean Owens, Sandy Ryza,
Published by O'Reilly Media, Inc, USA, United States (2017)
ISBN 10: 1491972955 ISBN 13: 9781491972953
New Paperback Quantity Available: 10
Seller:
The Book Depository
(London, United Kingdom)
Rating
[?]

Book Description O'Reilly Media, Inc, USA, United States, 2017. Paperback. Condition: New. 2nd ed.. Language: English. Brand new Book. In the second edition of this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with Spark. The authors bring Spark, statistical methods, and real-world data sets together to teach you how to approach analytics problems by example. Updated for Spark 2.1, this edition acts as an introduction to these techniques and other best practices in Spark programming. You'll start with an introduction to Spark and its ecosystem, and then dive into patterns that apply common techniques-including classification, clustering, collaborative filtering, and anomaly detection-to fields such as genomics, security, and finance. If you have an entry-level understanding of machine learning and statistics, and you program in Java, Python, or Scala, you'll find the book's patterns useful for working on your own data applications. With this book, you will: Familiarize yourself with the Spark programming model Become comfortable within the Spark ecosystem Learn general approaches in data science Examine complete implementations that analyze large public data sets Discover which machine learning tools make sense for particular problems Acquire code that can be adapted to many uses. Seller Inventory # AAH9781491972953

More information about this seller | Contact this seller

Buy New
US$ 37.09
Convert currency

Add to Basket

Shipping: FREE
From United Kingdom to U.S.A.
Destination, rates & speeds

10.

Ryza, Sandy
Published by O'Reilly Media (2018)
ISBN 10: 1491972955 ISBN 13: 9781491972953
New Paperback Quantity Available: > 20
Print on Demand
Seller:
Murray Media
(NORTH MIAMI BEACH, FL, U.S.A.)
Rating
[?]

Book Description O'Reilly Media, 2018. Paperback. Condition: New. Never used! This item is printed on demand. Seller Inventory # 1491972955

More information about this seller | Contact this seller

Buy New
US$ 37.41
Convert currency

Add to Basket

Shipping: FREE
Within U.S.A.
Destination, rates & speeds

There are more copies of this book

View all search results for this book