Important Notice: Our web hosting provider recently started charging us for additional visits, which was unexpected. In response, we're seeking donations. Depending on the situation, we may explore different monetization options for our Community and Expert Contributors. It's crucial to provide more returns for their expertise and offer more Expert Validated Answers or AI Validated Answers. Learn more about our hosting issue here.

Does Session Variability Compensation in Speaker Recognition Model Intrinsic Variation Under Mismatched Conditions?

0
Posted

Does Session Variability Compensation in Speaker Recognition Model Intrinsic Variation Under Mismatched Conditions?

0

Elizabeth Shriberg (SRI International) Sachin Kajarekar (SRI International) Nicolas Scheffer (SRI International) Intersession variability (ISV) compensation in speaker recognition is well studied with respect to extrinsic variation, but little is known about its ability to model intrinsic variation. We find that ISV compensation is remarkably successful on a corpus of intrinsic variation that is highly controlled for channel (a dominant component of ISV). The results are particularly surprising because the ISV training data come from a different corpus than do speaker train and test data. We further find that relative improvements are (1) inversely related to uncompensated performance, (2) reduced more by vocal effort train/test mismatch than by speaking style mismatch, and (3) reduced additionally for mismatches in both style and level. Results demonstrate that intersession variability compensation does model intrinsic variation, and suggest that mismatched data may be more useful tha

Related Questions

What is your question?

*Sadly, we had to bring back ads too. Hopefully more targeted.

Experts123