Items related to Using OpenRefine: The Essential OpenRefine Guide That...

Using OpenRefine: The Essential OpenRefine Guide That Takes You from Data Analysis and Error Fixing to Linking Your Dataset to the Web - Softcover

  • 3.79 out of 5 stars
    28 ratings by Goodreads
 
9781783289080: Using OpenRefine: The Essential OpenRefine Guide That Takes You from Data Analysis and Error Fixing to Linking Your Dataset to the Web

Synopsis

With this book on OpenRefine, managing and cleaning your large datasets suddenly got a lot easier! With a cookbook approach and free datasheets included, you'll quickly and painlessly improve your data managing capabilities.

Overview

  • Create links between your dataset and others in an instant
  • Effectively transform data with regular expressions and the General Refine Expression Language
  • Spot issues in your dataset and take effective action with just a few clicks

In Detail

Data is supposed to be the new gold, but how can you unlock the value in your data? Managing large datasets used to be a task for specialists, but you don't have to worry about inconsistencies or errors anymore. OpenRefine lets you clean, link, and publish your dataset in a breeze.

Using OpenRefine takes you on a practical tour of all the handy features of this well-known data transformation tool. It is a hands-on recipe book that teaches you data techniques by example. Starting from the basics, it gradually transforms you into an OpenRefine expert.

This book will teach you all the necessary skills to handle any large dataset and to turn it into high-quality data for the Web. After you learn how to analyze data and spot issues, we'll see how we can solve them to obtain a clean dataset. Messy and inconsistent data is recovered through advanced techniques such as automated clustering. We'll then show extract links from keyword and full-text fields using reconciliation and named-entity extraction.

Using OpenRefine is more than a manual: it's a guide stuffed with tips and tricks to get the best out of your data.

What you will learn from this book

  • Import data in various formats
  • Explore datasets in a matter of seconds
  • Apply basic and advanced cell transformations
  • Deal with cells that contain multiple values
  • Create instantaneous links between datasets
  • Filter and partition your data easily with regular expressions
  • Use named-entity extraction on full-text fields to automatically identify topics
  • Perform advanced data operations with the General Refine Expression Language

Approach

The book is styled on a Cookbook, containing recipes - combined with free datasets - which will turn readers into proficient OpenRefine users in the fastest possible way.

Who this book is written for

This book is targeted at anyone who works on or handles a large amount of data. No prior knowledge of OpenRefine is required, as we start from the very beginning and gradually reveal more advanced features. You don't even need your own dataset, as we provide example data to try out the book's recipes.

"synopsis" may belong to another edition of this title.

About the Author

Ruben Verborgh

Ruben Verborgh is a PhD researcher in semantic hypermedia, and is fascinated by the Web's immense possibilities. He tries to contribute ideas that will maybe someday slightly influence the way the Web changes all of us. His degree in Computer Science Engineering convinced him more than ever that communication is the most crucial thing for IT-based solutions. This is why he really enjoys explaining things to those eager to learn. In 2011, he launched the Free Your Metadata project together with Seth van Hooland and Max De Wilde, which aims to evangelize the importance of putting your data on the Web. This book is one of the assets in this continuing quest.

Ruben currently works at Multimedia Lab, a research group of iMinds, Ghent University, Belgium, in the domains of the Semantic Web, web APIs, and adaptive hypermedia. Together with Seth van Hooland, he's currently writing Linked Data for Libraries, Archives, and Museums, a practical guide for metadata practitioners.

"About this title" may belong to another edition of this title.

  • PublisherPackt Pub Ltd
  • Publication date2013
  • ISBN 10 1783289082
  • ISBN 13 9781783289080
  • BindingPaperback
  • LanguageEnglish
  • Number of pages95
  • Rating
    • 3.79 out of 5 stars
      28 ratings by Goodreads

Buy Used

Condition: Good
Item in good condition. Textbooks...
View this item

FREE shipping within U.S.A.

Destination, rates & speeds

Search results for Using OpenRefine: The Essential OpenRefine Guide That...

Stock Image

Verborgh, Ruben, De Wilde, Max
Published by Packt Publishing, 2013
ISBN 10: 1783289082 ISBN 13: 9781783289080
Used Softcover

Seller: SecondSale, Montgomery, IL, U.S.A.

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: Good. Item in good condition. Textbooks may not include supplemental items i.e. CDs, access codes etc. Seller Inventory # 00029509529

Contact seller

Buy Used

US$ 4.97
Convert currency
Shipping: FREE
Within U.S.A.
Destination, rates & speeds

Quantity: 2 available

Add to basket

Stock Image

Verborgh, Ruben; De Wilde, Max
Published by Packt Pub Ltd, 2013
ISBN 10: 1783289082 ISBN 13: 9781783289080
Used Softcover

Seller: More Than Words, Waltham, MA, U.S.A.

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: Good. . . All orders guaranteed and ship within 24 hours. Before placing your order for please contact us for confirmation on the book's binding. Check out our other listings to add to your order for discounted shipping. Seller Inventory # BOS-E-09i-01494

Contact seller

Buy Used

US$ 1.00
Convert currency
Shipping: US$ 3.99
Within U.S.A.
Destination, rates & speeds

Quantity: 1 available

Add to basket

Stock Image

Verborgh, Ruben; De Wilde, Max
Published by Packt Pub Ltd, 2013
ISBN 10: 1783289082 ISBN 13: 9781783289080
Used paperback

Seller: The Maryland Book Bank, Baltimore, MD, U.S.A.

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

paperback. Condition: Very Good. Revised ed. Used - Very Good. Seller Inventory # 5-T-5-0262

Contact seller

Buy Used

US$ 1.75
Convert currency
Shipping: US$ 4.20
Within U.S.A.
Destination, rates & speeds

Quantity: 1 available

Add to basket

Stock Image

Verborgh, Ruben; De Wilde, Max
Published by Packt Publishing
ISBN 10: 1783289082 ISBN 13: 9781783289080
Used Paperback

Seller: ThriftBooks-Dallas, Dallas, TX, U.S.A.

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Paperback. Condition: Very Good. No Jacket. May have limited writing in cover pages. Pages are unmarked. ~ ThriftBooks: Read More, Spend Less 0.55. Seller Inventory # G1783289082I4N00

Contact seller

Buy Used

US$ 7.98
Convert currency
Shipping: FREE
Within U.S.A.
Destination, rates & speeds

Quantity: 1 available

Add to basket

Stock Image

De Wilde, Max
Published by Packt Pub Ltd, 2013
ISBN 10: 1783289082 ISBN 13: 9781783289080
Used Paperback

Seller: WorldofBooks, Goring-By-Sea, WS, United Kingdom

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Paperback. Condition: Very Good. The book has been read, but is in excellent condition. Pages are intact and not marred by notes or highlighting. The spine remains undamaged. Seller Inventory # GOR006477184

Contact seller

Buy Used

US$ 1.45
Convert currency
Shipping: US$ 7.44
From United Kingdom to U.S.A.
Destination, rates & speeds

Quantity: 1 available

Add to basket

Stock Image

Verborgh, Ruben, De Wilde, Max
Published by Packt Publishing, Limited, 2013
ISBN 10: 1783289082 ISBN 13: 9781783289080
Used Softcover

Seller: Better World Books, Mishawaka, IN, U.S.A.

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: Good. Used book that is in clean, average condition without any missing pages. Seller Inventory # 39247975-6

Contact seller

Buy Used

US$ 9.42
Convert currency
Shipping: FREE
Within U.S.A.
Destination, rates & speeds

Quantity: 1 available

Add to basket

Seller Image

Verborgh, Ruben; De Wilde, Max; Huynh, David (FRW)
Published by Packt Pub Ltd, 2013
ISBN 10: 1783289082 ISBN 13: 9781783289080
New Softcover

Seller: GreatBookPrices, Columbia, MD, U.S.A.

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: New. Seller Inventory # 20119044-n

Contact seller

Buy New

US$ 39.47
Convert currency
Shipping: US$ 2.64
Within U.S.A.
Destination, rates & speeds

Quantity: Over 20 available

Add to basket

Stock Image

Verborgh, Ruben; De Wilde, Max
Published by Packt Pub Ltd, 2013
ISBN 10: 1783289082 ISBN 13: 9781783289080
New Softcover

Seller: Lucky's Textbooks, Dallas, TX, U.S.A.

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: New. Seller Inventory # ABLIING23Mar2912160162956

Contact seller

Buy New

US$ 38.13
Convert currency
Shipping: US$ 3.99
Within U.S.A.
Destination, rates & speeds

Quantity: Over 20 available

Add to basket

Seller Image

Verborgh, Ruben
Published by Packt Publishing 9/10/2013, 2013
ISBN 10: 1783289082 ISBN 13: 9781783289080
New Paperback or Softback

Seller: BargainBookStores, Grand Rapids, MI, U.S.A.

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Paperback or Softback. Condition: New. Using Openrefine 0.46. Book. Seller Inventory # BBS-9781783289080

Contact seller

Buy New

US$ 42.86
Convert currency
Shipping: FREE
Within U.S.A.
Destination, rates & speeds

Quantity: 5 available

Add to basket

Stock Image

Verborgh, Ruben; De Wilde, Max
Published by Packt Pub Ltd, 2013
ISBN 10: 1783289082 ISBN 13: 9781783289080
New Softcover

Seller: California Books, Miami, FL, U.S.A.

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: New. Seller Inventory # I-9781783289080

Contact seller

Buy New

US$ 44.00
Convert currency
Shipping: FREE
Within U.S.A.
Destination, rates & speeds

Quantity: Over 20 available

Add to basket

There are 16 more copies of this book

View all search results for this book