Items related to Perl & LWP: Fetching Web Pages, Parsing HTML, Writing...

Perl & LWP: Fetching Web Pages, Parsing HTML, Writing Spiders & More - Softcover

 
9780596001780: Perl & LWP: Fetching Web Pages, Parsing HTML, Writing Spiders & More
View all copies of this ISBN edition:
 
 

Perl soared to popularity as a language for creating and managing web content, but with LWP (Library for WWW in Perl), Perl is equally adept at consuming information on the Web. LWP is a suite of modules for fetching and processing web pages.The Web is a vast data source that contains everything from stock prices to movie credits, and with LWP all that data is just a few lines of code away. Anything you do on the Web, whether it's buying or selling, reading or writing, uploading or downloading, news to e-commerce, can be controlled with Perl and LWP. You can automate Web-based purchase orders as easily as you can set up a program to download MP3 files from a web site.Perl & LWP covers:

  • Understanding LWP and its design
  • Fetching and analyzing URLs
  • Extracting information from HTML using regular expressions and tokens
  • Working with the structure of HTML documents using trees
  • Setting and inspecting HTTP headers and response codes
  • Managing cookies
  • Accessing information that requires authentication
  • Extracting links
  • Cooperating with proxy caches
  • Writing web spiders (also known as robots) in a safe fashion
Perl & LWP includes many step-by-step examples that show how to apply the various techniques. Programs to extract information from the web sites of BBC News, Altavista, ABEBooks.com, and the Weather Underground, to name just a few, are explained in detail, so that you understand how and why they work.Perl programmers who want to automate and mine the web can pick up this book and be immediately productive. Written by a contributor to LWP, and with a foreword by one of LWP's creators, Perl & LWP is the authoritative guide to this powerful and popular toolkit.

"synopsis" may belong to another edition of this title.

Review:
Perl & LWP sets out to unwrap the Library for the Web in Perl (LWP), which is a collection of modules that make it easier to access and pick apart Web pages (and FTP-accessible files, and outgoing e-mail messages) from within your Perl programs. The book succeeds wonderfully, not only in conveying the technical aspects of LWP programming, but in making clear the fun of doing work that's very well suited to Perl. Sean Burke assumes that his readers know something about Perl, albeit not much, and a similar amount about HTML. He does a great job of explaining how LWP functions fit into Perl programs, and how you can use them to make reference to Internet resources far more easily than before.

Burke's narrative takes the form of a guided tour in which he introduces his readers to aspects of the LWP modules one by one. His tone is generally straightforward (sharp commentary alternates with brief code listings, with occasional passages of reference material), but there's sometimes an undercurrent of exuberance that makes the reader want to get going with his or her own programming right away. Overall, the emphasis is on teaching both LWP and Perl itself to the extent necessary to do LWP work. Because of the concise and nicely indexed code modules, though, you'll find this book useful as a reference after you're under way with LWP. --David Wall

Topics covered: How to program with LWP and Perl itself. All of LWP's strong points--including HTML parsing (with tokens and trees as well as with regular expressions), HTML generation and modification, manipulation of HTML forms, and the operation of spiders--are covered. This book has more of a tutorial tone than any similar reference material on the Internet.

About the Author:
Sean Burke is an active member in the Perl community and one of CPAN's most prolific module authors. He has been a columnist for The Perl Journal since 1998, and is an authority on markup languages. Trained as a linguist, he also develops tools for software internationalization and Native language preservation.

"About this title" may belong to another edition of this title.

  • PublisherO'Reilly Media
  • Publication date2002
  • ISBN 10 0596001789
  • ISBN 13 9780596001780
  • BindingPaperback
  • Edition number1
  • Number of pages260
  • Rating

Other Popular Editions of the Same Title

9789350230169: Perl and LWP

Featured Edition

ISBN 10:  935023016X ISBN 13:  9789350230169
Publisher: SHROFF
Softcover

  • 9788173665530: [(Perl and LWP)] [by: Sean M. Burke]

    O'..., 2002
    Softcover

Top Search Results from the AbeBooks Marketplace

Seller Image

Sean M. Burke
Published by O'Reilly Media (2002)
ISBN 10: 0596001789 ISBN 13: 9780596001780
New Soft Cover Quantity: 10
Seller:
booksXpress
(Bayonne, NJ, U.S.A.)

Book Description Soft Cover. Condition: new. Seller Inventory # 9780596001780

More information about this seller | Contact seller

Buy New
US$ 22.61
Convert currency

Add to Basket

Shipping: FREE
Within U.S.A.
Destination, rates & speeds
Stock Image

Burke, Sean
Published by O'Reilly Media (2002)
ISBN 10: 0596001789 ISBN 13: 9780596001780
New Paperback Quantity: 1
Seller:
GoldenWavesOfBooks
(Fayetteville, TX, U.S.A.)

Book Description Paperback. Condition: new. New. Fast Shipping and good customer service. Seller Inventory # Holz_New_0596001789

More information about this seller | Contact seller

Buy New
US$ 21.27
Convert currency

Add to Basket

Shipping: US$ 4.00
Within U.S.A.
Destination, rates & speeds
Seller Image

Burke, Sean M.
Published by O'Reilly Media (2002)
ISBN 10: 0596001789 ISBN 13: 9780596001780
New Softcover Quantity: 4
Seller:
GreatBookPrices
(Columbia, MD, U.S.A.)

Book Description Condition: New. Seller Inventory # 716760-n

More information about this seller | Contact seller

Buy New
US$ 27.58
Convert currency

Add to Basket

Shipping: US$ 2.64
Within U.S.A.
Destination, rates & speeds
Seller Image

Burke, Sean M.
Published by O'Reilly Media 6/30/2002 (2002)
ISBN 10: 0596001789 ISBN 13: 9780596001780
New Paperback or Softback Quantity: 5
Seller:
BargainBookStores
(Grand Rapids, MI, U.S.A.)

Book Description Paperback or Softback. Condition: New. Perl & Lwp 0.92. Book. Seller Inventory # BBS-9780596001780

More information about this seller | Contact seller

Buy New
US$ 30.23
Convert currency

Add to Basket

Shipping: FREE
Within U.S.A.
Destination, rates & speeds
Stock Image

Sean M. Burke
Published by O'Reilly Media (2002)
ISBN 10: 0596001789 ISBN 13: 9780596001780
New Softcover Quantity: > 20
Seller:
Lakeside Books
(Benton Harbor, MI, U.S.A.)

Book Description Condition: New. Brand New! Not Overstocks or Low Quality Book Club Editions! Direct From the Publisher! We're not a giant, faceless warehouse organization! We're a small town bookstore that loves books and loves it's customers! Buy from Lakeside Books!. Seller Inventory # OTF-S-9780596001780

More information about this seller | Contact seller

Buy New
US$ 26.53
Convert currency

Add to Basket

Shipping: US$ 3.99
Within U.S.A.
Destination, rates & speeds
Stock Image

Sean M. Burke
Published by O'Reilly Media (2002)
ISBN 10: 0596001789 ISBN 13: 9780596001780
New Paperback Quantity: 1
Seller:
GoldBooks
(Denver, CO, U.S.A.)

Book Description Paperback. Condition: new. New Copy. Customer Service Guaranteed. Seller Inventory # think0596001789

More information about this seller | Contact seller

Buy New
US$ 27.47
Convert currency

Add to Basket

Shipping: US$ 4.25
Within U.S.A.
Destination, rates & speeds
Stock Image

Burke, Sean
Published by O'Reilly Media (2002)
ISBN 10: 0596001789 ISBN 13: 9780596001780
New Softcover Quantity: > 20
Seller:
Lucky's Textbooks
(Dallas, TX, U.S.A.)

Book Description Condition: New. Seller Inventory # ABLIING23Feb2416190070675

More information about this seller | Contact seller

Buy New
US$ 29.98
Convert currency

Add to Basket

Shipping: US$ 3.99
Within U.S.A.
Destination, rates & speeds
Stock Image

Burke, Sean
Published by O'Reilly Media (2002)
ISBN 10: 0596001789 ISBN 13: 9780596001780
New Softcover Quantity: > 20
Seller:
California Books
(Miami, FL, U.S.A.)

Book Description Condition: New. Seller Inventory # I-9780596001780

More information about this seller | Contact seller

Buy New
US$ 39.00
Convert currency

Add to Basket

Shipping: FREE
Within U.S.A.
Destination, rates & speeds
Stock Image

Burke, Sean
Published by O'Reilly Media (2002)
ISBN 10: 0596001789 ISBN 13: 9780596001780
New Softcover Quantity: 1
Seller:
GF Books, Inc.
(Hawthorne, CA, U.S.A.)

Book Description Condition: New. Book is in NEW condition. Seller Inventory # 0596001789-2-1

More information about this seller | Contact seller

Buy New
US$ 39.99
Convert currency

Add to Basket

Shipping: FREE
Within U.S.A.
Destination, rates & speeds
Stock Image

Sean M. Burke
Published by O'Reilly Media, Inc, USA (2002)
ISBN 10: 0596001789 ISBN 13: 9780596001780
New Paperback Quantity: 1
Seller:
THE SAINT BOOKSTORE
(Southport, United Kingdom)

Book Description Paperback. Condition: New. New copy - Usually dispatched within 4 working days. This text covers topics including: understanding LWP and its design; fetching and analyzing URLs; extracting information from HTML using regular expressions and tokens; working with the structure of HTML documents using trees; and setting and inspecting HTTP headers and response codes. Seller Inventory # B9780596001780

More information about this seller | Contact seller

Buy New
US$ 34.04
Convert currency

Add to Basket

Shipping: US$ 11.15
From United Kingdom to U.S.A.
Destination, rates & speeds

There are more copies of this book

View all search results for this book