Whats the tradeoff between improving accuracy using speaker-adaptation and by improving the pronunciation dictionary?
Solution: Adaptation will shift the acoustics for each phoneme from the global, speaker-independent distributions to the particular distribution of the individual speaker being adapted to. This significantly improves accuracy at a general level. However, adaptation doesn’t effect the phoneme patterns in the dictionary for individual words. If you have an error or dialect miscorrespondence in the dictionary, then even after adaptation it can remain the source of repeated systematic errors when that word is (mis-)recognized. On the other hand, if you fix the dictionary without doing adaptation, you will have entirely removed that source of error. The global benefits of adaptation are basically an independent issue. If there is an interaction between dictionary entry adjustment and adaption, it is that adaptation on data using bad pronunciations will adapt phones that should be quite different in the direction that makes them overlap each other, so that they are harder to recognize as bei