Important Notice: Our web hosting provider recently started charging us for additional visits, which was unexpected. In response, we're seeking donations. Depending on the situation, we may explore different monetization options for our Community and Expert Contributors. It's crucial to provide more returns for their expertise and offer more Expert Validated Answers or AI Validated Answers. Learn more about our hosting issue here.

Can you elaborate on the difference between diphone synthesis and unit selection synthesis?

April 26, 2017difference elaborate selection synthesis

0

Posted

Can you elaborate on the difference between diphone synthesis and unit selection synthesis?

1 Answer

0

Posted

In unit selection the size of the synthesis unit chosen for a particular system may be one of many choices: half-phones, phones, diphones, syllables, etc. The key idea is that we get multiple examples of the same unit in different contexts (where context may be some combination of adjoining phonemes and maybe prosody features, e.g. emphasized or non-emphasized). We can cluster the examples to find representative units for acoustically different units. With diphone synthesis, we only use one example of each phone-phone transition and do not have different versions of the diphone depending on context. Unit selection systems are more difficult to build and do require more labelled data, but may produce much better quality than diphone (though not necessarily).