Text to Speech Synthesis: New Paradigms and Advances (Prentice Hall Imsc Press Multimedia Series)

2 avg rating
( 1 ratings by GoodReads )
 
9780131456617: Text to Speech Synthesis: New Paradigms and Advances (Prentice Hall Imsc Press Multimedia Series)

Text to speech synthesis (TTS) is a critical research and application area in the field of multimedia interfaces. Recent advances in TTS will impact is wide number of disciplines from education, business and entertainment applications to medical aids. Until recently, speech synthesis relied on models and rule-based approaches. While this had yielded intelligible sounding speech, the voice quality was unacceptable for widespread adoption. Fortunately, there has been a major technological paradigm shift recently in how speech synthesis is done: going from rule-based to explicit data-driven methods. Recent advances in computing and corpus driven methodologies have yielded exciting possibilities for research and development in this domain yielding highly natural sounding speech. The book focuses on recent advances and new paradigms in text to speech synthesis contributed by leading experts from both academia and industry from across the world. There is no book of this nature that documents in a comprehensive way the recent research trends. This is not only important for researchers and students of the field but potential customers and other benefactors of the results. The book's chapters address key current topic areas in text to speech synthesis (TTS): Data-driven systems, unit selection Hybrid Schemes: interplay between data-driven and knowledge-based techniques, prosody models and generation and expressive speech synthesis.

"synopsis" may belong to another edition of this title.

From the Back Cover:

Recent advances in speech synthesis will enable the development of high-quality natural voice systems with broad application in education, business, entertainment, and medicine. Text to Speech Synthesis is the first book to comprehensively document these new research trends and paradigms, balancing coverage of research and applications. It brings together seminal research by leaders in the field, drawn from both academic and industrial laboratories worldwide.

The authors and editors offer broad coverage of several key areas, including new unit selection approaches, speech representations and modeling, data-driven synthesis schemes, and expressive speech synthesis.

Coverage includes:

  • Unit Selection Methods: Reducing discontinuities at synthesis time in corpus-based speech processing, voice quality variation, and join costs
  • Hidden Markov Model (HMM)-Based Synthesis: Advanced uses of speech recognition technology, HMM-based multilingual speech synthesis, and new prosody control techniques
  • Expressive Speech Synthesis: Challenges, questions, and avenues of research, including diphone transplantation and minimization of pitch modification
  • Speech Representation and Models: A new articulatory modeling paradigm for controlling synthesis quality

This is an essential resource for all researchers working in speech synthesis and related areas such as multimedia signal processing, linguistics, and spoken user interfaces. It will also be valuable to any engineer, developer, or manager who must evaluate the latest speech technologies or integrate them into practical applications.



About the Author:

Dr. Shrikanth Narayanan is associate professor at the Signal and Image Processing Institute of USC's Electrical Engineering Department. He founded and directs USC's Speech Analysis and Interpretation Laboratory, and serves as research area director of the Integrated Media Systems Center, an NSF Engineering Research Center. He is associate editor of IEEE Transactions of Speech and Audio Processing, serves on the speech communication technical committee of the Acoustical Society of America, and was Principal Member of Technical Staff at AT&T Laboratories.

Dr. Abeer Alwan, a professor of electrical engineering at UCLA, established and directs the Speech Processing and Auditory Perception Laboratory there. Her research interests include modeling human speech production and perception mechanisms and applying these models to speech-processing applications such as noise-robust automatic speech recnognition, compression, and synthesis. She is a Fellow of the Acoustical Society of America and recently served as editor-in-chief of the journal Speech Communication.



013145661XAB04232004

"About this title" may belong to another edition of this title.

Buy New View Book
List Price: US$ 95.00
US$ 170.26

Convert Currency

Shipping: US$ 3.99
Within U.S.A.

Destination, Rates & Speeds

Add to Basket

Top Search Results from the AbeBooks Marketplace

1.

Narayanan, Shrikanth; Alwan, Abeer
Published by Prentice Hall PTR (2004)
ISBN 10: 013145661X ISBN 13: 9780131456617
New Hardcover Quantity Available: 1
Seller
Ergodebooks
(RICHMOND, TX, U.S.A.)
Rating
[?]

Book Description Prentice Hall PTR, 2004. Hardcover. Book Condition: New. Bookseller Inventory # DADAX013145661X

More Information About This Seller | Ask Bookseller a Question

Buy New
US$ 170.26
Convert Currency

Add to Basket

Shipping: US$ 3.99
Within U.S.A.
Destination, Rates & Speeds

2.

Narayanan, Shrikanth; Alwan, Abeer
Published by Prentice Hall PTR (2004)
ISBN 10: 013145661X ISBN 13: 9780131456617
New Hardcover Quantity Available: 1
Seller
Irish Booksellers
(Rumford, ME, U.S.A.)
Rating
[?]

Book Description Prentice Hall PTR, 2004. Hardcover. Book Condition: New. book. Bookseller Inventory # 013145661X

More Information About This Seller | Ask Bookseller a Question

Buy New
US$ 328.77
Convert Currency

Add to Basket

Shipping: FREE
Within U.S.A.
Destination, Rates & Speeds