Robust Automatic Speech Recognition: A Bridge to Practical Applications

0 avg rating
( 0 ratings by Goodreads )
 
9780128023983: Robust Automatic Speech Recognition: A Bridge to Practical Applications

Robust Automatic Speech Recognition: A Bridge to Practical Applications establishes a solid foundation for automatic speech recognition that is robust against acoustic environmental distortion. It provides a thorough overview of classical and modern noise-and reverberation robust techniques that have been developed over the past thirty years, with an emphasis on practical methods that have been proven to be successful and which are likely to be further developed for future applications. The strengths and weaknesses of robustness-enhancing speech recognition techniques are carefully analyzed. The book covers noise-robust techniques designed for acoustic models which are based on both Gaussian mixture models and deep neural networks. In addition, a guide to selecting the best methods for practical applications is provided. The reader will:

  • Gain a unified, deep and systematic understanding of the state-of-the-art technologies for robust speech recognition
  • Learn the links and relationship between alternative technologies for robust speech recognition
  • Be able to use the technology analysis and categorization detailed in the book to guide future technology development
  • Be able to develop new noise-robust methods in the current era of deep learning for acoustic modeling in speech recognition
  • The first book that provides a comprehensive review on noise and reverberation robust speech recognition methods in the era of deep neural networks
  • Connects robust speech recognition techniques to machine learning paradigms with rigorous mathematical treatment
  • Provides elegant and structural ways to categorize and analyze noise-robust speech recognition techniques
  • Written by leading researchers who have been actively working on the subject matter in both industrial and academic organizations for many years

"synopsis" may belong to another edition of this title.

From the Back Cover:

Learn how automatic speech recognition can be used with robustness in real-world applications

Robust Automatic Speech Recognition establishes a solid foundation for automatic speech recognition that is robust against acoustic environmental distortion. It provides a thorough overview of classical and modern noise-and reverberation robust techniques which have been developed over the past thirty years, with an emphasis on practical methods that have proven to be successful and which are likely to be further developed for future applications.

The strengths and weaknesses of robust enhancing speech recognition techniques are carefully analyzed, and a guide to selecting the best methods for practical applications is provided. The book covers noise-robust techniques designed for acoustic models, which are based on both Gaussian mixture models and deep neural networks.

Key Features:

  • This is the  first book that provides a comprehensive review on noise and reverberation robust speech recognition methods in the era of deep neural networks
  • It connects robust speech recognition techniques to machine learning paradigms with rigorous mathematical treatment
  • It provides elegant and structural ways to categorize and analyze noise-robust speech recognition techniques
  • It is written by leading researchers who have been actively working on the subject matter in both industrial and academic organizations for many years

The reader will:

  • Gain a unified, deep and systematic understanding of the state-of-the-art technologies for robust  speech recognition
  • Learn the links and relationship  between alternative technologies for  robustness speech recognition
  • Be able to use the  technology analysis and categorization detailed in the book to guide future technology development
  • Be able to develop new noise-robust methods in the current era of deep learning for acoustic modeling in speech recognition

About the Author:

Jinyu Li received a Ph.D. degree from Georgia Institute of Technology, U.S. From 2000 to 2003, he was a Researcher at Intel China Research Center and a Research Manager at iFlytek, China. Currently, he is a Principal Applied Scientist at Microsoft, working as a technical lead to design and improve speech modeling algorithms and technologies that ensure industry state-of-the-art speech recognition accuracy for Microsoft products. His major research interests cover several topics in speech recognition and machine learning, including noise robustness, deep learning, discriminative training, and feature extraction. He has authored over 60 papers and awarded over 10 patents.

Li Deng received a Ph.D. degree from the University of Wisconsin-Madison, US. He was a professor (1989-1999) at the University of Waterloo, Canada. In 1999, he joined Microsoft Research, where he currently leads R&D of application-focused deep learning as Partner Research Manager of its Deep Learning Technology Center. He is also an Affiliate Professor at University of Washington. He is a Fellow of the Acoustical Society of America, Fellow of the IEEE, and Fellow of the International Speech Communication Association. He served as Editors-in-Chief for the IEEE Signal Processing Magazine and for the IEEE/ACM Transactions on Audio, Speech and Language Processing (2009-2014). His technical work has been focused on deep learning for speech, language, image, and multimodal processing, and for other areas of machine intelligence involving big data. He received numerous awards including the IEEE SPS Best Paper Awards, IEEE Outstanding Engineer Award, and APSIPA Industrial Distinguished Leader Award.

Reinhold Haeb-Umbach is a professor with the University of Paderborn, Germany. His main research interests are in the fields of statistical signal processing and pattern recognition, with applications to speech enhancement, acoustic beamforming and source separation, as well as automatic speech recognition. After having worked in industrial research laboratories for more than 10 years he joined academia as a full professor of Communications Engineering in 2001. He has published more than 150 papers in peer reviewed journals and conferences.

Yifan Gong served the National Scientific Research Center (CNRS) and INRIA, France, as Research Engineer and then joined CNRS as Senior Research Scientist. He was a Visiting Research Fellow at the Communications Research Center of Canada. As Senior Member of Technical Staff, he worked for Texas Instruments at the Speech Technologies Lab, where he developed speech modeling technologies robust against noisy environments, designed systems, algorithms, and software for speech and speaker recognition, and delivered memory- and CPU-efficient recognizers for mobile devices. Yifan joined Microsoft in 2004, and is currently a Principal Science Manager in the areas of speech modeling, computing infrastructure, and speech model development for speech products. His research interests include automatic speech recognition/interpretation, signal processing, algorithm development, and engineering process/infrastructure and management. He has authored over 130 publications and awarded over 30 patents. Specific contributions include stochastic trajectory modeling, source normalization HMM training, joint compensation of additive and convolutional noises, and variable parameter HMM. In these areas, he gave tutorials and presentations in international conferences. He has been serving as member of technical committee and session chair for many international conferences, and with IEEE Signal Processing Spoken Language Technical Committees from 1998 to 2002 and since 2013.

"About this title" may belong to another edition of this title.

Buy New View Book
List Price: US$ 135.00
US$ 88.77

Convert Currency

Shipping: FREE
From United Kingdom to U.S.A.

Destination, Rates & Speeds

Add to Basket

Top Search Results from the AbeBooks Marketplace

1.

Jinyu Li, Li Deng, Reinhold Haeb-Umbach
Published by Elsevier Science Publishing Co Inc, United States (2015)
ISBN 10: 0128023988 ISBN 13: 9780128023983
New Hardcover Quantity Available: 1
Seller
The Book Depository US
(London, United Kingdom)
Rating
[?]

Book Description Elsevier Science Publishing Co Inc, United States, 2015. Hardback. Book Condition: New. 235 x 190 mm. Language: English . Brand New Book. Robust Automatic Speech Recognition: A Bridge to Practical Applications establishes a solid foundation for automatic speech recognition that is robust against acoustic environmental distortion. It provides a thorough overview of classical and modern noise-and reverberation robust techniques that have been developed over the past thirty years, with an emphasis on practical methods that have been proven to be successful and which are likely to be further developed for future applications. The strengths and weaknesses of robustness-enhancing speech recognition techniques are carefully analyzed. The book covers noise-robust techniques designed for acoustic models which are based on both Gaussian mixture models and deep neural networks. In addition, a guide to selecting the best methods for practical applications is provided.The reader will: * Gain a unified, deep and systematic understanding of the state-of-the-art technologies for robust speech recognition* Learn the links and relationship between alternative technologies for robust speech recognition * Be able to use the technology analysis and categorization detailed in the book to guide future technology development* Be able to develop new noise-robust methods in the current era of deep learning for acoustic modeling in speech recognition. Bookseller Inventory # AAZ9780128023983

More Information About This Seller | Ask Bookseller a Question

Buy New
US$ 88.77
Convert Currency

Add to Basket

Shipping: FREE
From United Kingdom to U.S.A.
Destination, Rates & Speeds

2.

Jinyu Li, Li Deng, Reinhold Haeb-Umbach
Published by Elsevier Science Publishing Co Inc, United States (2015)
ISBN 10: 0128023988 ISBN 13: 9780128023983
New Hardcover Quantity Available: 1
Seller
The Book Depository
(London, United Kingdom)
Rating
[?]

Book Description Elsevier Science Publishing Co Inc, United States, 2015. Hardback. Book Condition: New. 235 x 190 mm. Language: English . Brand New Book. Robust Automatic Speech Recognition: A Bridge to Practical Applications establishes a solid foundation for automatic speech recognition that is robust against acoustic environmental distortion. It provides a thorough overview of classical and modern noise-and reverberation robust techniques that have been developed over the past thirty years, with an emphasis on practical methods that have been proven to be successful and which are likely to be further developed for future applications. The strengths and weaknesses of robustness-enhancing speech recognition techniques are carefully analyzed. The book covers noise-robust techniques designed for acoustic models which are based on both Gaussian mixture models and deep neural networks. In addition, a guide to selecting the best methods for practical applications is provided.The reader will: * Gain a unified, deep and systematic understanding of the state-of-the-art technologies for robust speech recognition* Learn the links and relationship between alternative technologies for robust speech recognition * Be able to use the technology analysis and categorization detailed in the book to guide future technology development* Be able to develop new noise-robust methods in the current era of deep learning for acoustic modeling in speech recognition. Bookseller Inventory # AAZ9780128023983

More Information About This Seller | Ask Bookseller a Question

Buy New
US$ 93.43
Convert Currency

Add to Basket

Shipping: FREE
From United Kingdom to U.S.A.
Destination, Rates & Speeds

3.

Li, Jinyu
Published by Academic Press (2015)
ISBN 10: 0128023988 ISBN 13: 9780128023983
New Quantity Available: 1
Seller
Books2Anywhere
(Fairford, GLOS, United Kingdom)
Rating
[?]

Book Description Academic Press, 2015. HRD. Book Condition: New. New Book. Shipped from UK in 4 to 14 days. Established seller since 2000. Bookseller Inventory # GB-9780128023983

More Information About This Seller | Ask Bookseller a Question

Buy New
US$ 84.45
Convert Currency

Add to Basket

Shipping: US$ 11.65
From United Kingdom to U.S.A.
Destination, Rates & Speeds

4.

LI
ISBN 10: 0128023988 ISBN 13: 9780128023983
New Quantity Available: 1
Seller
firstbookstore
(New Delhi, India)
Rating
[?]

Book Description Book Condition: Brand New. Brand New Original US Edition, Perfect Condition. Printed in English. Excellent Quality, Service and customer satisfaction guaranteed!. Bookseller Inventory # AIND-93535

More Information About This Seller | Ask Bookseller a Question

Buy New
US$ 97.11
Convert Currency

Add to Basket

Shipping: FREE
From India to U.S.A.
Destination, Rates & Speeds

5.

Li, Jinyu
ISBN 10: 0128023988 ISBN 13: 9780128023983
New Quantity Available: 1
Seller
Bookshub
(Karol Bagh, India)
Rating
[?]

Book Description Book Condition: New. New. US edition. Perfect condition. Customer satisfaction our priority. Bookseller Inventory # ABE-FEB-18311

More Information About This Seller | Ask Bookseller a Question

Buy New
US$ 98.22
Convert Currency

Add to Basket

Shipping: FREE
From India to U.S.A.
Destination, Rates & Speeds

6.

Li, Jinyu
ISBN 10: 0128023988 ISBN 13: 9780128023983
New Quantity Available: 1
Seller
Bookshub
(Karol Bagh, India)
Rating
[?]

Book Description Book Condition: New. New. US edition. Perfect condition. Customer satisfaction our priority. Bookseller Inventory # ABE-FEB-447

More Information About This Seller | Ask Bookseller a Question

Buy New
US$ 98.22
Convert Currency

Add to Basket

Shipping: FREE
From India to U.S.A.
Destination, Rates & Speeds

7.

Li, Jinyu
ISBN 10: 0128023988 ISBN 13: 9780128023983
New Quantity Available: 1
Seller
EBOOKSTORE2010
(New Delhi, ND, India)
Rating
[?]

Book Description Book Condition: Brand New. New. US edition. Customer Satisfaction guaranteed!!. Bookseller Inventory # SHUB18311

More Information About This Seller | Ask Bookseller a Question

Buy New
US$ 98.27
Convert Currency

Add to Basket

Shipping: FREE
From India to U.S.A.
Destination, Rates & Speeds

8.

Li, Jinyu
ISBN 10: 0128023988 ISBN 13: 9780128023983
New Quantity Available: 1
Seller
EBOOKSTORE2010
(New Delhi, ND, India)
Rating
[?]

Book Description Book Condition: Brand New. New. US edition. Customer Satisfaction guaranteed!!. Bookseller Inventory # SHUB447

More Information About This Seller | Ask Bookseller a Question

Buy New
US$ 98.27
Convert Currency

Add to Basket

Shipping: FREE
From India to U.S.A.
Destination, Rates & Speeds

9.

LI
ISBN 10: 0128023988 ISBN 13: 9780128023983
New Quantity Available: 1
Seller
Romtrade Corp.
(STERLING HEIGHTS, MI, U.S.A.)
Rating
[?]

Book Description Book Condition: New. Brand New Original US Edition.We Ship to PO BOX Address also. EXPEDITED shipping option also available for faster delivery. Bookseller Inventory # AUSBNEW-93535

More Information About This Seller | Ask Bookseller a Question

Buy New
US$ 102.17
Convert Currency

Add to Basket

Shipping: FREE
Within U.S.A.
Destination, Rates & Speeds

10.

Li, Jinyu
ISBN 10: 0128023988 ISBN 13: 9780128023983
New Quantity Available: 1
Seller
Basi6 International
(Irving, TX, U.S.A.)
Rating
[?]

Book Description Book Condition: Brand New. New, US edition. Excellent Customer Service. Bookseller Inventory # ABEUSA-18311

More Information About This Seller | Ask Bookseller a Question

Buy New
US$ 102.18
Convert Currency

Add to Basket

Shipping: FREE
Within U.S.A.
Destination, Rates & Speeds

There are more copies of this book

View all search results for this book