Important Notice: Our web hosting provider recently started charging us for additional visits, which was unexpected. In response, we're seeking donations. Depending on the situation, we may explore different monetization options for our Community and Expert Contributors. It's crucial to provide more returns for their expertise and offer more Expert Validated Answers or AI Validated Answers. Learn more about our hosting issue here.

Whats the tradeoff between improving accuracy using speaker-adaptation and by improving the pronunciation dictionary?

April 26, 2017dictionary improving pronunciation tradeoff

0

Posted

Whats the tradeoff between improving accuracy using speaker-adaptation and by improving the pronunciation dictionary?

1 Answer

0

Posted

Solution: Adaptation will shift the acoustics for each phoneme from the global, speaker-independent distributions to the particular distribution of the individual speaker being adapted to. This significantly improves accuracy at a general level. However, adaptation doesn’t effect the phoneme patterns in the dictionary for individual words. If you have an error or dialect miscorrespondence in the dictionary, then even after adaptation it can remain the source of repeated systematic errors when that word is (mis-)recognized. On the other hand, if you fix the dictionary without doing adaptation, you will have entirely removed that source of error. The global benefits of adaptation are basically an independent issue. If there is an interaction between dictionary entry adjustment and adaption, it is that adaptation on data using bad pronunciations will adapt phones that should be quite different in the direction that makes them overlap each other, so that they are harder to recognize as bei