Items related to Similarity Joins in Relational Database Systems (Synthesis...

Similarity Joins in Relational Database Systems (Synthesis Lectures on Data Management) - Softcover

 
9783031007231: Similarity Joins in Relational Database Systems (Synthesis Lectures on Data Management)

Synopsis

State-of-the-art database systems manage and process a variety of complex objects, including strings and trees. For such objects equality comparisons are often not meaningful and must be replaced by similarity comparisons. This book describes the concepts and techniques to incorporate similarity into database systems. We start out by discussing the properties of strings and trees, and identify the edit distance as the de facto standard for comparing complex objects. Since the edit distance is computationally expensive, token-based distances have been introduced to speed up edit distance computations. The basic idea is to decompose complex objects into sets of tokens that can be compared efficiently. Token-based distances are used to compute an approximation of the edit distance and prune expensive edit distance calculations. A key observation when computing similarity joins is that many of the object pairs, for which the similarity is computed, are very different from each other. Filters exploit this property to improve the performance of similarity joins. A filter preprocesses the input data sets and produces a set of candidate pairs. The distance function is evaluated on the candidate pairs only. We describe the essential query processing techniques for filters based on lower and upper bounds. For token equality joins we describe prefix, size, positional and partitioning filters, which can be used to avoid the computation of small intersections that are not needed since the similarity would be too low.

"synopsis" may belong to another edition of this title.

About the Author

Nikolaus Augsten is a professor in the Department of Com puter Science at the University of Salzburg, Austria, where he heads the Database Group. He received his Ph.D. degree in computer science from Aalborg University, Denmark, in 2008, and holds a M.Sc. degree from Graz University of Technol ogy, Austria. Prior to joining the University of Salzburg in 2013, he was an assistant professor at the Free University of Bolzano, Italy. He was on leave at TU München, Germany, in 2010/2011 and visited Washington State University for six months in 2005/2006. His main research interests include sim ilarity search queries over massive data collections, approximate matching techniques for complex data structures, efficient in dex structures for distance computations, and top-k queries. For his work on top-k approximate subtree matching he received the Best Paper Award at the IEEE International Conference on Data Engineering in 2010. Currently, he serves as an Associate Editor for the VLDB Journal.Michael H. Böhlen is a professor of computer science at the University of Zürich where he heads the Database Technology Group. His research interests include various aspects of data management, and have focused on time-varying information, data warehousing and data analysis, and similarity search. He received his M.Sc. and Ph.D. degrees from ETH Zürich in 1990 and 1994, respectively. Before joining the University of Zürich he visited the University of Arizona for one year, and was a faculty member at Aalborg University for eight years and the Free University of Bozen-Bolzano for six years. He was pro gram co-chair of the 39th International Conference on Very Large Data Bases and served as an Associate Editor for the VLDB Journal. He served as a PC member for SIGMOD, VLDB, ICDE, and EDBT. Cur rently, he serves as an Associate Editor for ACM TODS, and he is a member of the VLDB Endowment’s Board of Trustees

"About this title" may belong to another edition of this title.

  • PublisherSpringer
  • Publication date2013
  • ISBN 10 3031007239
  • ISBN 13 9783031007231
  • BindingPaperback
  • LanguageEnglish
  • Edition number1
  • Number of pages128

Buy Used

Condition: As New
Unread book in perfect condition...
View this item

US$ 2.64 shipping within U.S.A.

Destination, rates & speeds

Other Popular Editions of the Same Title

9781627050289: Similarity Joins in Relational Database Systems (Synthesis Lectures on Data Management)

Featured Edition

ISBN 10:  1627050280 ISBN 13:  9781627050289
Publisher: Morgan & Claypool Publishers, 2013
Softcover

Search results for Similarity Joins in Relational Database Systems (Synthesis...

Seller Image

Augsten, Nikolaus; Bohlen, Michael
Published by Springer, 2013
ISBN 10: 3031007239 ISBN 13: 9783031007231
Used Softcover

Seller: GreatBookPrices, Columbia, MD, U.S.A.

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: As New. Unread book in perfect condition. Seller Inventory # 44544138

Contact seller

Buy Used

US$ 44.70
Convert currency
Shipping: US$ 2.64
Within U.S.A.
Destination, rates & speeds

Quantity: Over 20 available

Add to basket

Seller Image

Augsten, Nikolaus; Bohlen, Michael
Published by Springer, 2013
ISBN 10: 3031007239 ISBN 13: 9783031007231
New Softcover

Seller: GreatBookPrices, Columbia, MD, U.S.A.

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: New. Seller Inventory # 44544138-n

Contact seller

Buy New

US$ 56.78
Convert currency
Shipping: US$ 2.64
Within U.S.A.
Destination, rates & speeds

Quantity: Over 20 available

Add to basket

Stock Image

Augsten, Nikolaus
Published by Springer 2013-11, 2013
ISBN 10: 3031007239 ISBN 13: 9783031007231
New PF

Seller: Chiron Media, Wallingford, United Kingdom

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

PF. Condition: New. Seller Inventory # 6666-IUK-9783031007231

Contact seller

Buy New

US$ 38.59
Convert currency
Shipping: US$ 20.84
From United Kingdom to U.S.A.
Destination, rates & speeds

Quantity: 10 available

Add to basket

Stock Image

Augsten, Nikolaus; Bohlen, Michael
Published by Springer, 2013
ISBN 10: 3031007239 ISBN 13: 9783031007231
New Softcover

Seller: Ria Christie Collections, Uxbridge, United Kingdom

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: New. In. Seller Inventory # ria9783031007231_new

Contact seller

Buy New

US$ 45.93
Convert currency
Shipping: US$ 16.12
From United Kingdom to U.S.A.
Destination, rates & speeds

Quantity: Over 20 available

Add to basket

Seller Image

Augsten, Nikolaus; Bohlen, Michael
Published by Springer, 2013
ISBN 10: 3031007239 ISBN 13: 9783031007231
New Softcover

Seller: GreatBookPricesUK, Woodford Green, United Kingdom

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: New. Seller Inventory # 44544138-n

Contact seller

Buy New

US$ 42.03
Convert currency
Shipping: US$ 20.19
From United Kingdom to U.S.A.
Destination, rates & speeds

Quantity: Over 20 available

Add to basket

Stock Image

Augsten, Nikolaus; Bohlen, Michael
Published by Springer, 2013
ISBN 10: 3031007239 ISBN 13: 9783031007231
New Softcover

Seller: Books Puddle, New York, NY, U.S.A.

Seller rating 4 out of 5 stars 4-star rating, Learn more about seller ratings

Condition: New. 1st edition NO-PA16APR2015-KAP. Seller Inventory # 26395061324

Contact seller

Buy New

US$ 60.58
Convert currency
Shipping: US$ 3.99
Within U.S.A.
Destination, rates & speeds

Quantity: 4 available

Add to basket

Seller Image

Michael Bohlen
ISBN 10: 3031007239 ISBN 13: 9783031007231
New Taschenbuch
Print on Demand

Seller: BuchWeltWeit Ludwig Meier e.K., Bergisch Gladbach, Germany

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Taschenbuch. Condition: Neu. This item is printed on demand - it takes 3-4 days longer - Neuware -State-of-the-art database systems manage and process a variety of complex objects, including strings and trees. For such objects equality comparisons are often not meaningful and must be replaced by similarity comparisons. This book describes the concepts and techniques to incorporate similarity into database systems. We start out by discussing the properties of strings and trees, and identify the edit distance as the de facto standard for comparing complex objects. Since the edit distance is computationally expensive, token-based distances have been introduced to speed up edit distance computations. The basic idea is to decompose complex objects into sets of tokens that can be compared efficiently. Token-based distances are used to compute an approximation of the edit distance and prune expensive edit distance calculations. A key observation when computing similarity joins is that many of the object pairs, for which the similarity is computed, are very different from each other. Filters exploit this property to improve the performance of similarity joins. A filter preprocesses the input data sets and produces a set of candidate pairs. The distance function is evaluated on the candidate pairs only. We describe the essential query processing techniques for filters based on lower and upper bounds. For token equality joins we describe prefix, size, positional and partitioning filters, which can be used to avoid the computation of small intersections that are not needed since the similarity would be too low. 128 pp. Englisch. Seller Inventory # 9783031007231

Contact seller

Buy New

US$ 41.91
Convert currency
Shipping: US$ 26.51
From Germany to U.S.A.
Destination, rates & speeds

Quantity: 2 available

Add to basket

Stock Image

Augsten, Nikolaus; Bohlen, Michael
Published by Springer, 2013
ISBN 10: 3031007239 ISBN 13: 9783031007231
New Softcover
Print on Demand

Seller: Majestic Books, Hounslow, United Kingdom

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: New. Print on Demand. Seller Inventory # 402364307

Contact seller

Buy New

US$ 59.90
Convert currency
Shipping: US$ 8.75
From United Kingdom to U.S.A.
Destination, rates & speeds

Quantity: 4 available

Add to basket

Seller Image

Augsten, Nikolaus; Bohlen, Michael
Published by Springer, 2013
ISBN 10: 3031007239 ISBN 13: 9783031007231
Used Softcover

Seller: GreatBookPricesUK, Woodford Green, United Kingdom

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: As New. Unread book in perfect condition. Seller Inventory # 44544138

Contact seller

Buy Used

US$ 49.99
Convert currency
Shipping: US$ 20.19
From United Kingdom to U.S.A.
Destination, rates & speeds

Quantity: Over 20 available

Add to basket

Seller Image

Michael Bohlen
ISBN 10: 3031007239 ISBN 13: 9783031007231
New Taschenbuch

Seller: AHA-BUCH GmbH, Einbeck, Germany

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Taschenbuch. Condition: Neu. Druck auf Anfrage Neuware - Printed after ordering - State-of-the-art database systems manage and process a variety of complex objects, including strings and trees. For such objects equality comparisons are often not meaningful and must be replaced by similarity comparisons. This book describes the concepts and techniques to incorporate similarity into database systems. We start out by discussing the properties of strings and trees, and identify the edit distance as the de facto standard for comparing complex objects. Since the edit distance is computationally expensive, token-based distances have been introduced to speed up edit distance computations. The basic idea is to decompose complex objects into sets of tokens that can be compared efficiently. Token-based distances are used to compute an approximation of the edit distance and prune expensive edit distance calculations. A key observation when computing similarity joins is that many of the object pairs, for which the similarity is computed, are very different from each other. Filters exploit this property to improve the performance of similarity joins. A filter preprocesses the input data sets and produces a set of candidate pairs. The distance function is evaluated on the candidate pairs only. We describe the essential query processing techniques for filters based on lower and upper bounds. For token equality joins we describe prefix, size, positional and partitioning filters, which can be used to avoid the computation of small intersections that are not needed since the similarity would be too low. Seller Inventory # 9783031007231

Contact seller

Buy New

US$ 41.91
Convert currency
Shipping: US$ 33.74
From Germany to U.S.A.
Destination, rates & speeds

Quantity: 1 available

Add to basket

There are 3 more copies of this book

View all search results for this book