Apache Flume: Distributed Log Collection for Hadoop - Second Edition

1 avg rating
( 1 ratings by Goodreads )
 
9781784392178: Apache Flume: Distributed Log Collection for Hadoop - Second Edition
View all copies of this ISBN edition:
 
 

Design and implement a series of Flume agents to send streamed data into Hadoop

About This Book

  • Construct a series of Flume agents using the Apache Flume service to efficiently collect, aggregate, and move large amounts of event data
  • Configure failover paths and load balancing to remove single points of failure
  • Use this step-by-step guide to stream logs from application servers to Hadoop's HDFS

Who This Book Is For

If you are a Hadoop programmer who wants to learn about Flume to be able to move datasets into Hadoop in a timely and replicable manner, then this book is ideal for you. No prior knowledge about Apache Flume is necessary, but a basic knowledge of Hadoop and the Hadoop File System (HDFS) is assumed.

What You Will Learn

  • Understand the Flume architecture, and also how to download and install open source Flume from Apache
  • Follow along a detailed example of transporting weblogs in Near Real Time (NRT) to Kibana/Elasticsearch and archival in HDFS
  • Learn tips and tricks for transporting logs and data in your production environment
  • Understand and configure the Hadoop File System (HDFS) Sink
  • Use a morphline-backed Sink to feed data into Solr
  • Create redundant data flows using sink groups
  • Configure and use various sources to ingest data
  • Inspect data records and move them between multiple destinations based on payload content
  • Transform data en-route to Hadoop and monitor your data flows

In Detail

Apache Flume is a distributed, reliable, and available service used to efficiently collect, aggregate, and move large amounts of log data. It is used to stream logs from application servers to HDFS for ad hoc analysis.

This book starts with an architectural overview of Flume and its logical components. It explores channels, sinks, and sink processors, followed by sources and channels. By the end of this book, you will be fully equipped to construct a series of Flume agents to dynamically transport your stream data and logs from your systems into Hadoop.

A step-by-step book that guides you through the architecture and components of Flume covering different approaches, which are then pulled together as a real-world, end-to-end use case, gradually going from the simplest to the most advanced features.

"synopsis" may belong to another edition of this title.

About the Author:

Steve Hoffman

Steve Hoffman has 32 years of experience in software development, ranging from embedded software development to the design and implementation of large-scale, service-oriented, object-oriented systems. For the last 5 years, he has focused on infrastructure as code, including automated Hadoop and HBase implementations and data ingestion using Apache Flume. Steve holds a BS in computer engineering from the University of Illinois at Urbana-Champaign and an MS in computer science from DePaul University. He is currently a senior principal engineer at Orbitz Worldwide (http://orbitz.com/). More information on Steve can be found at http://bit.ly/bacoboy and on Twitter at @bacoboy. This is the first update to Steve's first book, Apache Flume: Distributed Log Collection for Hadoop, Packt Publishing.

"About this title" may belong to another edition of this title.

Buy New View Book
List Price: US$ 36.99
US$ 35.77

Convert currency

Shipping: FREE
From United Kingdom to U.S.A.

Destination, rates & speeds

Add to Basket

Top Search Results from the AbeBooks Marketplace

1.

Steve Hoffman
Published by Packt Publishing Limited, United Kingdom (2015)
ISBN 10: 1784392170 ISBN 13: 9781784392178
New Paperback Quantity Available: 10
Print on Demand
Seller:
The Book Depository
(London, United Kingdom)
Rating
[?]

Book Description Packt Publishing Limited, United Kingdom, 2015. Paperback. Condition: New. 2nd Revised edition. Language: English . Brand New Book ***** Print on Demand *****.If you are a Hadoop programmer who wants to learn about Flume to be able to move datasets into Hadoop in a timely and replicable manner, then this book is ideal for you. No prior knowledge about Apache Flume is necessary, but a basic knowledge of Hadoop and the Hadoop File System (HDFS) is assumed. Seller Inventory # AAV9781784392178

More information about this seller | Contact this seller

Buy New
US$ 35.77
Convert currency

Add to Basket

Shipping: FREE
From United Kingdom to U.S.A.
Destination, rates & speeds

2.

Steve Hoffman
Published by Packt Publishing Limited, United Kingdom (2015)
ISBN 10: 1784392170 ISBN 13: 9781784392178
New Paperback Quantity Available: 10
Print on Demand
Seller:
Book Depository International
(London, United Kingdom)
Rating
[?]

Book Description Packt Publishing Limited, United Kingdom, 2015. Paperback. Condition: New. 2nd Revised edition. Language: English . Brand New Book ***** Print on Demand *****. If you are a Hadoop programmer who wants to learn about Flume to be able to move datasets into Hadoop in a timely and replicable manner, then this book is ideal for you. No prior knowledge about Apache Flume is necessary, but a basic knowledge of Hadoop and the Hadoop File System (HDFS) is assumed. Seller Inventory # AAV9781784392178

More information about this seller | Contact this seller

Buy New
US$ 40.55
Convert currency

Add to Basket

Shipping: FREE
From United Kingdom to U.S.A.
Destination, rates & speeds

3.

Hoffman, Steve
Published by Packt Publishing Limited (2015)
ISBN 10: 1784392170 ISBN 13: 9781784392178
New Quantity Available: > 20
Print on Demand
Seller:
Pbshop
(Wood Dale, IL, U.S.A.)
Rating
[?]

Book Description Packt Publishing Limited, 2015. PAP. Condition: New. New Book. Shipped from US within 10 to 14 business days. THIS BOOK IS PRINTED ON DEMAND. Established seller since 2000. Seller Inventory # IQ-9781784392178

More information about this seller | Contact this seller

Buy New
US$ 36.57
Convert currency

Add to Basket

Shipping: US$ 3.99
Within U.S.A.
Destination, rates & speeds

4.

Steve Hoffman
Published by Packt Publishing Limited (2015)
ISBN 10: 1784392170 ISBN 13: 9781784392178
New Quantity Available: > 20
Print on Demand
Seller:
Books2Anywhere
(Fairford, GLOS, United Kingdom)
Rating
[?]

Book Description Packt Publishing Limited, 2015. PAP. Condition: New. New Book. Delivered from our UK warehouse in 4 to 14 business days. THIS BOOK IS PRINTED ON DEMAND. Established seller since 2000. Seller Inventory # LQ-9781784392178

More information about this seller | Contact this seller

Buy New
US$ 32.01
Convert currency

Add to Basket

Shipping: US$ 11.75
From United Kingdom to U.S.A.
Destination, rates & speeds

5.

Hoffman, Steve
Published by Packt Publishing 2/28/2015 (2015)
ISBN 10: 1784392170 ISBN 13: 9781784392178
New Paperback or Softback Quantity Available: 10
Seller:
BargainBookStores
(Grand Rapids, MI, U.S.A.)
Rating
[?]

Book Description Packt Publishing 2/28/2015, 2015. Paperback or Softback. Condition: New. Apache Flume: Distributed Log Collection for Hadoop - Second Edition. Book. Seller Inventory # BBS-9781784392178

More information about this seller | Contact this seller

Buy New
US$ 47.54
Convert currency

Add to Basket

Shipping: FREE
Within U.S.A.
Destination, rates & speeds

6.

Hoffman, Steve
Published by Packt Publishing - ebooks Acco (2018)
ISBN 10: 1784392170 ISBN 13: 9781784392178
New Paperback Quantity Available: 14
Print on Demand
Seller:
Murray Media
(NORTH MIAMI BEACH, FL, U.S.A.)
Rating
[?]

Book Description Packt Publishing - ebooks Acco, 2018. Paperback. Condition: New. Never used! This item is printed on demand. Seller Inventory # 1784392170

More information about this seller | Contact this seller

Buy New
US$ 48.09
Convert currency

Add to Basket

Shipping: FREE
Within U.S.A.
Destination, rates & speeds

7.

Steve Hoffman
Published by Packt Publishing (2015)
ISBN 10: 1784392170 ISBN 13: 9781784392178
New Softcover Quantity Available: 1
Print on Demand
Seller:
Rating
[?]

Book Description Packt Publishing, 2015. Condition: New. This item is printed on demand for shipment within 3 working days. Seller Inventory # GM9781784392178

More information about this seller | Contact this seller

Buy New
US$ 45.92
Convert currency

Add to Basket

Shipping: US$ 3.45
From Germany to U.S.A.
Destination, rates & speeds

8.

Steve Hoffman
Published by Packt Publishing Limited, United Kingdom (2015)
ISBN 10: 1784392170 ISBN 13: 9781784392178
New Paperback Quantity Available: 10
Seller:
Book Depository hard to find
(London, United Kingdom)
Rating
[?]

Book Description Packt Publishing Limited, United Kingdom, 2015. Paperback. Condition: New. 2nd Revised edition. Language: English. Brand new Book. If you are a Hadoop programmer who wants to learn about Flume to be able to move datasets into Hadoop in a timely and replicable manner, then this book is ideal for you. No prior knowledge about Apache Flume is necessary, but a basic knowledge of Hadoop and the Hadoop File System (HDFS) is assumed. Seller Inventory # LIE9781784392178

More information about this seller | Contact this seller

Buy New
US$ 52.22
Convert currency

Add to Basket

Shipping: FREE
From United Kingdom to U.S.A.
Destination, rates & speeds

9.

Steve Hoffman
Published by Packt Publishing - ebooks Account
ISBN 10: 1784392170 ISBN 13: 9781784392178
New Paperback Quantity Available: > 20
Seller:
BuySomeBooks
(Las Vegas, NV, U.S.A.)
Rating
[?]

Book Description Packt Publishing - ebooks Account. Paperback. Condition: New. 175 pages. Dimensions: 9.2in. x 7.5in. x 0.4in.Design and implement a series of Flume agents to send streamed data into Hadoop About This BookConstruct a series of Flume agents using the Apache Flume service to efficiently collect, aggregate, and move large amounts of event dataConfigure failover paths and load balancing to remove single points of failureUse this step-by-step guide to stream logs from application servers to Hadoops HDFSWho This Book Is ForIf you are a Hadoop programmer who wants to learn about Flume to be able to move datasets into Hadoop in a timely and replicable manner, then this book is ideal for you. No prior knowledge about Apache Flume is necessary, but a basic knowledge of Hadoop and the Hadoop File System (HDFS) is assumed. In Detail Apache Flume is a distributed, reliable, and available service used to efficiently collect, aggregate, and move large amounts of log data. It is used to stream logs from application servers to HDFS for ad hoc analysis. This book starts with an architectural overview of Flume and its logical components. It explores channels, sinks, and sink processors, followed by sources and channels. By the end of this book, you will be fully equipped to construct a series of Flume agents to dynamically transport your stream data and logs from your systems into Hadoop. A step-by-step book that guides you through the architecture and components of Flume covering different approaches, which are then pulled together as a real-world, end-to-end use case, gradually going from the simplest to the most advanced features. This item ships from multiple locations. Your book may arrive from Roseburg,OR, La Vergne,TN. Paperback. Seller Inventory # 9781784392178

More information about this seller | Contact this seller

Buy New
US$ 52.96
Convert currency

Add to Basket

Shipping: FREE
Within U.S.A.
Destination, rates & speeds

10.

Steve Hoffman
Published by Packt Publishing - ebooks Account (2015)
ISBN 10: 1784392170 ISBN 13: 9781784392178
New Softcover Quantity Available: 1
Seller:
Irish Booksellers
(Portland, ME, U.S.A.)
Rating
[?]

Book Description Packt Publishing - ebooks Account, 2015. Condition: New. book. Seller Inventory # M1784392170

More information about this seller | Contact this seller

Buy New
US$ 50.82
Convert currency

Add to Basket

Shipping: US$ 3.27
Within U.S.A.
Destination, rates & speeds

There are more copies of this book

View all search results for this book