Web Corpus Construction (Synthesis Lectures on Human Language Technologies) - Softcover

Sch�fer, Roland; Bildhauer, Felix

9783031010248: Web Corpus Construction (Synthesis Lectures on Human Language Technologies)

Softcover

ISBN 10: 3031010248 ISBN 13: 9783031010248

Publisher: Springer, 2013

View all copies of this ISBN edition

2 Used

From US$ 45.11

11 New

From US$ 45.51

The World Wide Web constitutes the largest existing source of texts written in a great variety of languages. A feasible and sound way of exploiting this data for linguistic research is to compile a static corpus for a given language. There are several adavantages of this approach: (i) Working with such corpora obviates the problems encountered when using Internet search engines in quantitative linguistic research (such as non-transparent ranking algorithms). (ii) Creating a corpus from web data is virtually free. (iii) The size of corpora compiled from the WWW may exceed by several orders of magnitudes the size of language resources offered elsewhere. (iv) The data is locally available to the user, and it can be linguistically post-processed and queried with the tools preferred by her/him. This book addresses the main practical tasks in the creation of web corpora up to giga-token size. Among these tasks are the sampling process (i.e., web crawling) and the usual cleanups including boilerplate removal and removal of duplicated content. Linguistic processing and problems with linguistic processing coming from the different kinds of noise in web corpora are also covered. Finally, the authors show how web corpora can be evaluated and compared to other corpora (such as traditionally compiled corpora). For additional material please visit the companion website: sites.morganclaypool.com/wcc Table of Contents: Preface / Acknowledgments / Web Corpora / Data Collection / Post-Processing / Linguistic Processing / Corpus Evaluation and Comparison / Bibliography / Authors' Biographies

"synopsis" may belong to another edition of this title.

About the Author

Roland Sch�fer studied Theoretical and Indo-European Linguistics as well as Japanese Linguistics at Marburg and Bochum Universities. He completed his doctorate Arguments and Adjuncts at the Syntax-Semantics Interface in 2008 at Gottingen University, supervised by Gert Webelhuth and Regine Eckardt. Since then, he has been working as a research assistant at Freie Universitat Berlin, mainly doing corpus-based research on semantic and morpho-syntactic phenomena. In 2011, he started working on the COW ("Corpora from the Web") project with Felix Bildhauer. His teaching experience covers a wide range of topics including Theoretical and Corpus Linguistics, English and German Linguistics, as well as Computational Linguistics.

"About this title" may belong to another edition of this title.

Publisher: Springer
Publication date: 2013
Language: English
ISBN 10: 3031010248
ISBN 13: 9783031010248
Binding: Paperback
Edition number: 1
Number of pages: 148

Buy Used

Condition: As New

Unread book in perfect condition...

View this item

US$ 45.11

Convert currency

US$ 2.64 shipping within U.S.A.

Destination, rates & speeds

Add to basket

Buy New

View this item

US$ 45.51

Convert currency

US$ 15.98 shipping from United Kingdom to U.S.A.

Destination, rates & speeds

Add to basket

Free 30-day returns

Other Popular Editions of the Same Title

9781608459834: Web Corpus Construction (Synthesis Lectures on Human Language Technologies, 22)

Featured Edition

ISBN 10: 1608459837 ISBN 13: 9781608459834
Publisher: Morgan & Claypool Publishers, 2013
Softcover

Springer, 2013 (Softcover)

Search results for Web Corpus Construction (Synthesis Lectures on Human...

Seller Image

Web Corpus Construction

Sch�fer, Roland; Bildhauer, Felix

Published by Springer, 2013

ISBN 10: 3031010248 ISBN 13: 9783031010248

Used Softcover

Seller: GreatBookPrices, Columbia, MD, U.S.A.

Seller rating 5 out of 5 stars

Condition: As New. Unread book in perfect condition. Seller Inventory # 44545674

Contact seller

Buy Used

US$ 45.11

Convert currency

Shipping: US$ 2.64

Within U.S.A.

Destination, rates & speeds

Quantity: Over 20 available

Add to basket

Stock Image

Web Corpus Construction (Synthesis Lectures on Human Language Technologies)

Sch�fer, Roland; Bildhauer, Felix

Published by Springer, 2013

ISBN 10: 3031010248 ISBN 13: 9783031010248

New Softcover

Seller: Ria Christie Collections, Uxbridge, United Kingdom

Seller rating 5 out of 5 stars

Condition: New. In. Seller Inventory # ria9783031010248_new

Contact seller

Buy New

US$ 45.51

Convert currency

Shipping: US$ 15.98

From United Kingdom to U.S.A.

Destination, rates & speeds

Quantity: Over 20 available

Add to basket

Stock Image

Web Corpus Construction

Sch�fer, Roland

Published by Springer 2013-07, 2013

ISBN 10: 3031010248 ISBN 13: 9783031010248

New PF

Seller: Chiron Media, Wallingford, United Kingdom

Seller rating 4 out of 5 stars

PF. Condition: New. Seller Inventory # 6666-IUK-9783031010248

Contact seller

Buy New

US$ 41.01

Convert currency

Shipping: US$ 20.66

From United Kingdom to U.S.A.

Destination, rates & speeds

Quantity: 10 available

Add to basket

Seller Image

Web Corpus Construction

Sch�fer, Roland; Bildhauer, Felix

Published by Springer, 2013

ISBN 10: 3031010248 ISBN 13: 9783031010248

New Softcover

Seller: GreatBookPrices, Columbia, MD, U.S.A.

Seller rating 5 out of 5 stars

Condition: New. Seller Inventory # 44545674-n

Contact seller

Buy New

US$ 59.37

Convert currency

Shipping: US$ 2.64

Within U.S.A.

Destination, rates & speeds

Quantity: Over 20 available

Add to basket

Stock Image

Web Corpus Construction (Synthesis Lectures on Human Language Technologies)

Sch�fer, Roland; Bildhauer, Felix

Published by Springer, 2013

ISBN 10: 3031010248 ISBN 13: 9783031010248

New Softcover

Seller: Books Puddle, New York, NY, U.S.A.

Seller rating 4 out of 5 stars

Condition: New. 1st edition NO-PA16APR2015-KAP. Seller Inventory # 26394683696

Contact seller

Buy New

US$ 58.33

Convert currency

Shipping: US$ 3.99

Within U.S.A.

Destination, rates & speeds

Quantity: 4 available

Add to basket

Seller Image

Web Corpus Construction

Sch�fer, Roland; Bildhauer, Felix

Published by Springer, 2013

ISBN 10: 3031010248 ISBN 13: 9783031010248

New Softcover

Seller: GreatBookPricesUK, Woodford Green, United Kingdom

Seller rating 5 out of 5 stars

Condition: New. Seller Inventory # 44545674-n

Contact seller

Buy New

US$ 44.41

Convert currency

Shipping: US$ 20.00

From United Kingdom to U.S.A.

Destination, rates & speeds

Quantity: Over 20 available

Add to basket

Stock Image

Web Corpus Construction (Synthesis Lectures on Human Language Technologies)

Sch�fer, Roland; Bildhauer, Felix

Published by Springer, 2013

ISBN 10: 3031010248 ISBN 13: 9783031010248

New Softcover

Print on Demand

Seller: Majestic Books, Hounslow, United Kingdom

Seller rating 5 out of 5 stars

Condition: New. Print on Demand. Seller Inventory # 401726191

Contact seller

Buy New

US$ 58.48

Convert currency

Shipping: US$ 8.67

From United Kingdom to U.S.A.

Destination, rates & speeds

Quantity: 4 available

Add to basket

Seller Image

Web Corpus Construction

Felix Bildhauer

Published by Springer International Publishing Jul 2013, 2013

ISBN 10: 3031010248 ISBN 13: 9783031010248

New Taschenbuch

Print on Demand

Seller: BuchWeltWeit Ludwig Meier e.K., Bergisch Gladbach, Germany

Seller rating 5 out of 5 stars

Taschenbuch. Condition: Neu. This item is printed on demand - it takes 3-4 days longer - Neuware 148 pp. Englisch. Seller Inventory # 9783031010248

Contact seller

Buy New

US$ 42.40

Convert currency

Shipping: US$ 26.82

From Germany to U.S.A.

Destination, rates & speeds

Quantity: 2 available

Add to basket

Seller Image

Web Corpus Construction

Sch�fer, Roland; Bildhauer, Felix

Published by Springer, 2013

ISBN 10: 3031010248 ISBN 13: 9783031010248

Used Softcover

Seller: GreatBookPricesUK, Woodford Green, United Kingdom

Seller rating 5 out of 5 stars

Condition: As New. Unread book in perfect condition. Seller Inventory # 44545674

Contact seller

Buy Used

US$ 49.55

Convert currency

Shipping: US$ 20.00

From United Kingdom to U.S.A.

Destination, rates & speeds

Quantity: Over 20 available

Add to basket

Stock Image

Web Corpus Construction (Synthesis Lectures on Human Language Technologies)

Sch�fer, Roland; Bildhauer, Felix

Published by Springer, 2013

ISBN 10: 3031010248 ISBN 13: 9783031010248

New Softcover

Print on Demand

Seller: Biblios, Frankfurt am main, HESSE, Germany

Seller rating 5 out of 5 stars

Condition: New. PRINT ON DEMAND. Seller Inventory # 18394683706

Contact seller

Buy New

US$ 64.33

Convert currency

Shipping: US$ 11.60

From Germany to U.S.A.

Destination, rates & speeds

Quantity: 4 available

Add to basket

There are 3 more copies of this book

View all search results for this book

Items related to Web Corpus Construction (Synthesis Lectures on Human...

Web Corpus Construction (Synthesis Lectures on Human Language Technologies) - Softcover

Sch�fer, Roland; Bildhauer, Felix

Synopsis

About the Author

Buy Used

Buy New

Other Popular Editions of the Same Title

Featured Edition

Search results for Web Corpus Construction (Synthesis Lectures on Human...

Buy Used

Buy New

Buy New

Buy New

Buy New

Buy New

Buy New

Buy New

Buy Used

Buy New

There are 3 more copies of this book