Num Pages: 260 pages. BIC Classification: UK. Category: (P) Professional & Vocational. Weight in Grams: 666. . 2016. 1st Edition. Paperback. . . . . Books ship from the US and Ireland.

Seller Inventory # V9781119254010

Contact seller

Report this item

Bibliographic Details

Title: Spark: Big Data Cluster Computing in Production
Publisher: John Wiley & Sons Inc
Publication Date: 2016
Language: English
ISBN 10: 1119254019
ISBN 13: 9781119254010
Binding: Soft cover
Condition: New

About this title

Synopsis

Production-targeted Spark guidance with real-world use cases

Spark: Big Data Cluster Computing in Production goes beyond general Spark overviews to provide targeted guidance toward using lightning-fast big-data clustering in production. Written by an expert team well-known in the big data community, this book walks you through the challenges in moving from proof-of-concept or demo Spark applications to live Spark in production. Real use cases provide deep insight into common problems, limitations, challenges, and opportunities, while expert tips and tricks help you get the most out of Spark performance. Coverage includes Spark SQL, Tachyon, Kerberos, ML Lib, YARN, and Mesos, with clear, actionable guidance on resource scheduling, db connectors, streaming, security, and much more.

Spark has become the tool of choice for many Big Data problems, with more active contributors than any other Apache Software project. General introductory books abound, but this book is the first to provide deep insight and real-world advice on using Spark in production. Specific guidance, expert tips, and invaluable foresight make this guide an incredibly useful resource for real production settings.

Review Spark hardware requirements and estimate cluster size
Gain insight from real-world production use cases
Tighten security, schedule resources, and fine-tune performance
Overcome common problems encountered using Spark in production

Spark works with other big data tools including MapReduce and Hadoop, and uses languages you already know like Java, Scala, Python, and R. Lightning speed makes Spark too good to pass up, but understanding limitations and challenges in advance goes a long way toward easing actual production implementation. Spark: Big Data Cluster Computing in Production tells you everything you need to know, with real-world production insight and expert guidance, tips, and tricks.

About the Author

Ilya Ganelin is a data engineer working at Capital One Data Innovation Lab. Ilya is an active contributor to the core components of Apache Spark and a committer to Apache Apex.

Ema Orhian is a Big Data Engineer interested in scaling algorithms. She is the main committer on jaws-spark-sql-rest, a data warehouse explorer on top of Spark SQL.

Kai Sasaki is a software engineer working in distributed computing and machine learning. He is a Spark contributor who develops mainly MLlib, ML libraries.

Brennon York has been a core contributor to Apache Spark since 2014 including development on GraphX and the core build environment.

"About this title" may belong to another edition of this title.

Store Description

We carry a comprehensive range of out of print and rare books.

Visit Seller's Storefront

Seller's business information

Kennys Bookshops and Art Galleries Limited
Liosban Industrial Estate, Tuam Road, Galway, GY, Ireland

Sale & Shipping Terms

Terms of Sale

We guarantee the condition of every book as it's described on the Abebooks websites.

If you're dissatisfied with your purchase (Incorrect Book/Not as Described/Damaged) or if the order hasn't arrived, you're eligible for a refund within 30 days of the estimated delivery date.

For any queries please use the contact seller link or send an email to books@kennys.ie

Conor Kenny

Shipping Terms

All books securely packaged. Some books ship from Ireland.

Shipping rates within U.S.A.

Shipping rates within U.S.A.
Order quantity	14 to 20 business days	13 to 14 business days
First item	US$ 10.50	US$ 21.00

Delivery times are set by sellers and vary by carrier and location. Orders passing through Customs may face delays and buyers are responsible for any associated duties or fees. Sellers may contact you regarding additional charges to cover any increased costs to ship your items.