Normal view MARC view ISBD view

Quality of Synthetic Speech [electronic resource] : Perceptual Dimensions, Influencing Factors, and Instrumental Assessment / by Florian Hinterleitner.

By: Hinterleitner, Florian [author.].
Contributor(s): SpringerLink (Online service).
Material type: materialTypeLabelBookSeries: T-Labs Series in Telecommunication Services: Publisher: Singapore : Springer Nature Singapore : Imprint: Springer, 2017Edition: 1st ed. 2017.Description: XVI, 157 p. 29 illus. online resource.Content type: text Media type: computer Carrier type: online resourceISBN: 9789811037344.Subject(s): Signal processing | User interfaces (Computer systems) | Human-computer interaction | Signal, Speech and Image Processing | User Interfaces and Human Computer InteractionAdditional physical formats: Printed edition:: No title; Printed edition:: No title; Printed edition:: No titleDDC classification: 621.382 Online resources: Click here to access online
Contents:
Introduction -- Speech Synthesis -- Auditory and Instrumental Quality Evaluation Metrics -- Perceptual Quality Dimensions -- Influencing Factors on Perceptual Quality -- Instrumental Quality Assessment -- Requirements for the Integration of an Instrumental Quality Measure into a Concatenative TTS System -- Conclusions.
In: Springer Nature eBookSummary: This book reviews research towards perceptual quality dimensions of synthetic speech, compares these findings with the state of the art, and derives a set of five universal perceptual quality dimensions for TTS signals. They are: (i) naturalness of voice, (ii) prosodic quality, (iii) fluency and intelligibility, (iv) absence of disturbances, and (v) calmness. Moreover, a test protocol for the efficient indentification of those dimensions in a listening test is introduced. Furthermore, several factors influencing these dimensions are examined. In addition, different techniques for the instrumental quality assessment of TTS signals are introduced, reviewed and tested. Finally, the requirements for the integration of an instrumental quality measure into a concatenative TTS system are examined.
    average rating: 0.0 (0 votes)
No physical items for this record

Introduction -- Speech Synthesis -- Auditory and Instrumental Quality Evaluation Metrics -- Perceptual Quality Dimensions -- Influencing Factors on Perceptual Quality -- Instrumental Quality Assessment -- Requirements for the Integration of an Instrumental Quality Measure into a Concatenative TTS System -- Conclusions.

This book reviews research towards perceptual quality dimensions of synthetic speech, compares these findings with the state of the art, and derives a set of five universal perceptual quality dimensions for TTS signals. They are: (i) naturalness of voice, (ii) prosodic quality, (iii) fluency and intelligibility, (iv) absence of disturbances, and (v) calmness. Moreover, a test protocol for the efficient indentification of those dimensions in a listening test is introduced. Furthermore, several factors influencing these dimensions are examined. In addition, different techniques for the instrumental quality assessment of TTS signals are introduced, reviewed and tested. Finally, the requirements for the integration of an instrumental quality measure into a concatenative TTS system are examined.

There are no comments for this item.

Log in to your account to post a comment.