A Methodology for Generated Text Annotation for High Quality Speech Synthesis

TitleA Methodology for Generated Text Annotation for High Quality Speech Synthesis
Publication TypeConference Paper
Year of Publication2019
AuthorsSpiliotopoulos D, Vassilakis C, Margaris D, Kotis K
Conference NameProceedings of the 10th International Conference on Information, Intelligence, Systems and Applications (IISA2019)
Date Publishedjul
KeywordsNatural language generation, Natural language processing, Prosody enrichment, Semantic feature annotation, Text-to-speech
AbstractNatural Language Generators may produce linguistically en-riched text descriptions that can lead to significantly improved quality of speech synthesis. There are cases, however, where either the generator modules produce pieces of non-analysed, non-annotated plain text, or such modules are not available at all. Moreover, the language analysis is restricted by the usually limited domain coverage of the generator. For those cases the enriched input to the speech synthesizer needs to be produced from plain text in order to maintain speech quality. This work reports on the method for automatic domain dependent annota-tion of plain text, through the utilisation of the linguistic infor-mation from rich generated text. The synthetic speech from the resulting prosody models is evaluated by human participants showing annotation results for plain text quite on par with the rich generated text. This leads to improved perceived natural-ness of the synthesized speech.