About this Item
Hardcover, vi + 280 pages, NOT ex-library. Minor handling wear only, book is fresh, clean and bright throughout with unmarked text, free of inscriptions and stamps, firmly bound. Issued without a dust jacket. -- Hardcover, vi + 280 pages, NOT ex-library. Minor handling wear only, book is fresh, clean and bright throughout with unmarked text, free of inscriptions and stamps, firmly bound. Issued without a dust jacket. -- A collection of research in quantitative linguistics and text mining. It brings together international scholars to demonstrate how mathematical models and statistical data can be used to uncover the underlying structures of human language. The book is dedicated to the memory of Ludek Hrebícek, a pioneer in the field, and reflects his influence on treating language as a dynamic, self-organising system. Quantitative Models: The contributors apply various statistical laws (such as Zipf's Law and the Menzerath-Altmann Law) to analyse word frequency, sentence length, and the distribution of linguistic units. Textual Dynamics: Several chapters explore "textual space," examining how information is distributed throughout a document and how different parts of a text relate to one another mathematically. Information Theory: The book uses concepts like entropy and information density to measure the complexity of different languages and genres. Applied Linguistics: Beyond pure theory, the text demonstrates how these models can be applied to authorship attribution (identifying who wrote a text), stylometry, and machine translation. The research is categorised into several specialised fields: - Lexicology and Grammar: Statistical analysis of vocabulary growth and the hierarchical structure of grammatical units. - Dialectology and Typology: Using data-driven methods to compare different languages and regional dialects. - History of Linguistics: Essays reflecting on the evolution of quantitative methods from the mid-20th century to the era of Big Data. - Digital Humanities: Demonstrations of how computational tools can process vast corpora of historical and contemporary texts to find patterns invisible to the naked eye. The book is highly relevant for researchers in natural language processing (NLP) and corpus linguistics. It provides a rigorous theoretical foundation for the algorithms that power modern search engines and AI language models.
Seller Inventory # 011694
Contact seller
Report this item