Items related to High Performance Spark: Best Practices for Scaling...

High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark - Softcover

  • 3.98 out of 5 stars
    128 ratings by Goodreads
 
9781491943205: High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark

Synopsis

Apache Spark is amazing when everything clicks. But if you haven’t seen the performance improvements you expected, or still don’t feel confident enough to use Spark in production, this practical book is for you. Authors Holden Karau and Rachel Warren demonstrate performance optimizations to help your Spark queries run faster and handle larger data sizes, while using fewer resources.

Ideal for software engineers, data engineers, developers, and system administrators working with large-scale data applications, this book describes techniques that can reduce data infrastructure costs and developer hours. Not only will you gain a more comprehensive understanding of Spark, you’ll also learn how to make it sing.

With this book, you’ll explore:

  • How Spark SQL’s new interfaces improve performance over SQL’s RDD data structure
  • The choice between data joins in Core Spark and Spark SQL
  • Techniques for getting the most out of standard RDD transformations
  • How to work around performance issues in Spark’s key/value pair paradigm
  • Writing high-performance Spark code without Scala or the JVM
  • How to test for functionality and performance when applying suggested improvements
  • Using Spark MLlib and Spark ML machine learning libraries
  • Spark’s Streaming components and external community packages

"synopsis" may belong to another edition of this title.

Book Description

Best practices for scaling and optimizing Apache Spark

About the Author

Holden Karau is transgender Canadian, and an active open source contributor. When not in San Francisco working as a software development engineer at IBM's Spark Technology Center, Holden talks internationally on Apache Spark and holds office hours at coffee shops at home and abroad. She is a Spark committer with frequent contributions, specializing in PySpark and Machine Learning. Prior to IBM she worked on a variety of distributed, search, and classification problems at Alpine, Databricks, Google, Foursquare, and Amazon. She graduated from the University of Waterloo with a Bachelor of Mathematics in Computer Science. Outside of software she enjoys playing with fire, welding, scooters, poutine, and dancing.

Rachel Warren is a data scientist and software engineer at Alpine Data Labs, where she uses Spark to address real world data processing challenges. She has experience working as an analyst both in industry and academia. She graduated with a degree in Computer Science from Wesleyan University in Connecticut.

"About this title" may belong to another edition of this title.

  • PublisherO'Reilly Media
  • Publication date2017
  • ISBN 10 1491943203
  • ISBN 13 9781491943205
  • BindingPaperback
  • LanguageEnglish
  • Edition number1
  • Number of pages358
  • Rating
    • 3.98 out of 5 stars
      128 ratings by Goodreads

Buy Used

Condition: Very Good
Ship within 24hrs. Satisfaction...
View this item

FREE shipping within U.S.A.

Destination, rates & speeds

Other Popular Editions of the Same Title

9789352135615: High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark

Featured Edition

ISBN 10:  935213561X ISBN 13:  9789352135615
Publisher: Shroff/O'Reilly; First edition, 2017
Softcover

Search results for High Performance Spark: Best Practices for Scaling...

Stock Image

Karau, Holden; Warren, Rachel
Published by O'Reilly Media (edition 1), 2017
ISBN 10: 1491943203 ISBN 13: 9781491943205
Used Paperback

Seller: BooksRun, Philadelphia, PA, U.S.A.

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Paperback. Condition: Very Good. 1. Ship within 24hrs. Satisfaction 100% guaranteed. APO/FPO addresses supported. Seller Inventory # 1491943203-8-1

Contact seller

Buy Used

US$ 15.85
Convert currency
Shipping: FREE
Within U.S.A.
Destination, rates & speeds

Quantity: 1 available

Add to basket

Stock Image

Warren, Rachel,Karau, Holden
Published by O'Reilly Media, 2017
ISBN 10: 1491943203 ISBN 13: 9781491943205
Used Paperback

Seller: HPB-Red, Dallas, TX, U.S.A.

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Paperback. Condition: Good. Connecting readers with great books since 1972! Used textbooks may not include companion materials such as access codes, etc. May have some wear or writing/highlighting. We ship orders daily and Customer Service is our top priority! Seller Inventory # S_424749562

Contact seller

Buy Used

US$ 13.69
Convert currency
Shipping: US$ 3.75
Within U.S.A.
Destination, rates & speeds

Quantity: 1 available

Add to basket

Stock Image

Karau, Holden; Warren, Rachel
Published by O'Reilly Media, 2017
ISBN 10: 1491943203 ISBN 13: 9781491943205
Used Paperback

Seller: ThriftBooks-Atlanta, AUSTELL, GA, U.S.A.

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Paperback. Condition: Good. No Jacket. Pages can have notes/highlighting. Spine may show signs of wear. ~ ThriftBooks: Read More, Spend Less 1.33. Seller Inventory # G1491943203I3N00

Contact seller

Buy Used

US$ 17.67
Convert currency
Shipping: FREE
Within U.S.A.
Destination, rates & speeds

Quantity: 1 available

Add to basket

Stock Image

Warren, Rachel; Karau, Holden
Published by O'Reilly Media, 2017, 2017
ISBN 10: 1491943203 ISBN 13: 9781491943205
Used Soft cover

Seller: Virginia Martin, aka bookwitch, Concord, CA, U.S.A.

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Soft cover. Condition: As New. Small quarto, softcover, 12 n32 in white and red wraps. 342 pp. including index. Apache Spark is amazing when everything clicks. But if you haven't seen the performance improvements you expected, or still don't feel confident enough to use Spark in production, this practical book is for you. help your Spark queries run faster and handle larger data sizes, while using fewer resources. Ideal for software engineers, data engineers, developers, and system administrators. Seller Inventory # 88322

Contact seller

Buy Used

US$ 15.00
Convert currency
Shipping: US$ 5.00
Within U.S.A.
Destination, rates & speeds

Quantity: 1 available

Add to basket

Stock Image

Karau, Holden
Published by O'Reilly Media, 2017
ISBN 10: 1491943203 ISBN 13: 9781491943205
Used paperback

Seller: Seattle Goodwill, Seattle, WA, U.S.A.

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

paperback. Condition: Good. May have some shelf-wear due to normal use. Your purchase funds free job training and education in the greater Seattle area. Thank you for supporting Goodwill's nonprofit mission! Seller Inventory # 0KVOFY002XHP

Contact seller

Buy Used

US$ 18.00
Convert currency
Shipping: US$ 3.99
Within U.S.A.
Destination, rates & speeds

Quantity: 2 available

Add to basket

Seller Image

Karau, Holden; Warren, Rachel
Published by O'Reilly Media, 2017
ISBN 10: 1491943203 ISBN 13: 9781491943205
Used Softcover

Seller: GreatBookPrices, Columbia, MD, U.S.A.

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: good. May show signs of wear, highlighting, writing, and previous use. This item may be a former library book with typical markings. No guarantee on products that contain supplements Your satisfaction is 100% guaranteed. Twenty-five year bookseller with shipments to over fifty million happy customers. Seller Inventory # 25233654-5

Contact seller

Buy Used

US$ 24.21
Convert currency
Shipping: US$ 2.64
Within U.S.A.
Destination, rates & speeds

Quantity: 4 available

Add to basket

Seller Image

Karau, Holden; Warren, Rachel
Published by O'Reilly Media, 2017
ISBN 10: 1491943203 ISBN 13: 9781491943205
New Softcover

Seller: GreatBookPrices, Columbia, MD, U.S.A.

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: New. Seller Inventory # 25233654-n

Contact seller

Buy New

US$ 34.82
Convert currency
Shipping: US$ 2.64
Within U.S.A.
Destination, rates & speeds

Quantity: 6 available

Add to basket

Seller Image

Karau, Holden
Published by O'Reilly Media 6/16/2017, 2017
ISBN 10: 1491943203 ISBN 13: 9781491943205
New Paperback or Softback

Seller: BargainBookStores, Grand Rapids, MI, U.S.A.

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Paperback or Softback. Condition: New. High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark 1.2. Book. Seller Inventory # BBS-9781491943205

Contact seller

Buy New

US$ 37.47
Convert currency
Shipping: FREE
Within U.S.A.
Destination, rates & speeds

Quantity: 5 available

Add to basket

Stock Image

Karau, Holden; Warren, Rachel
Published by O'Reilly Media, 2017
ISBN 10: 1491943203 ISBN 13: 9781491943205
New Softcover

Seller: Lakeside Books, Benton Harbor, MI, U.S.A.

Seller rating 4 out of 5 stars 4-star rating, Learn more about seller ratings

Condition: New. Brand New! Not Overstocks or Low Quality Book Club Editions! Direct From the Publisher! We're not a giant, faceless warehouse organization! We're a small town bookstore that loves books and loves it's customers! Buy from Lakeside Books! Seller Inventory # OTF-S-9781491943205

Contact seller

Buy New

US$ 34.18
Convert currency
Shipping: US$ 3.99
Within U.S.A.
Destination, rates & speeds

Quantity: Over 20 available

Add to basket

Stock Image

Karau, Holden; Warren, Rachel
Published by O'Reilly Media, 2017
ISBN 10: 1491943203 ISBN 13: 9781491943205
New Softcover

Seller: Lucky's Textbooks, Dallas, TX, U.S.A.

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: New. Seller Inventory # ABLIING23Mar2716030177469

Contact seller

Buy New

US$ 37.13
Convert currency
Shipping: US$ 3.99
Within U.S.A.
Destination, rates & speeds

Quantity: Over 20 available

Add to basket

There are 15 more copies of this book

View all search results for this book