Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/89623
Full metadata record
DC FieldValueLanguage
dc.contributor.authorHuang, Dong-Yanen
dc.contributor.authorXie, Leien
dc.contributor.authorZhang, Shaofeien
dc.contributor.authorLee, Yvonne Siu Waen
dc.contributor.authorWu, Jieen
dc.contributor.authorMing, Huaipingen
dc.contributor.authorTian, Xiaohaien
dc.contributor.authorDing, Chuangen
dc.contributor.authorLi, Meien
dc.contributor.authorNguyen, Quy Hyen
dc.contributor.authorDong, Minghuien
dc.contributor.authorChng, Eng Siongen
dc.contributor.authorLi, Haizhouen
dc.date.accessioned2019-08-20T04:39:20Zen
dc.date.accessioned2019-12-06T17:29:47Z-
dc.date.available2019-08-20T04:39:20Zen
dc.date.available2019-12-06T17:29:47Z-
dc.date.copyright2016-09-01en
dc.date.issued2016en
dc.identifier.citationHuang, D.-Y., Xie, L., Lee, Y. S. W., Wu, J., Ming, H., Tian, X., … Li, H. (2016). An automatic voice conversion evaluation strategy based on perceptual background noise distortion and speaker similarity. 9th ISCA Speech Synthesis Workshop. doi:10.21437/SSW.2016-8en
dc.identifier.urihttps://hdl.handle.net/10356/89623-
dc.description.abstractVoice conversion aims to modify the characteristics of one speaker to make it sound like spoken by another speaker without changing the language content. This task has attracted considerable attention and various approaches have been proposed since two decades ago. The evaluation of voice conversion approaches, usually through time-intensive subject listening tests, requires a huge amount of human labor. This paper proposes an automatic voice conversion evaluation strategy based on perceptual background noise distortion and speaker similarity. Experimental results show that our automatic evaluation results match the subjective listening results quite well. We further use our strategy to select best converted samples from multiple voice conversion systems and our submission achieves promising results in the voice conversion challenge (VCC2016).en
dc.format.extent8 p.en
dc.language.isoenen
dc.rights© 2016 International Speech Communication Association (ISCA). All rights reserved. This paper was published in 9th ISCA Speech Synthesis Workshop and is made available with permission of International Speech Communication Association (ISCA).en
dc.subjectEngineering::Computer science and engineeringen
dc.subjectVoice Conversionen
dc.subjectObjective Measuresen
dc.titleAn automatic voice conversion evaluation strategy based on perceptual background noise distortion and speaker similarityen
dc.typeConference Paperen
dc.contributor.schoolSchool of Computer Science and Engineeringen
dc.contributor.conference9th ISCA Speech Synthesis Workshopen
dc.identifier.doi10.21437/SSW.2016-8en
dc.description.versionPublished versionen
dc.identifier.rims200458en
item.fulltextWith Fulltext-
item.grantfulltextopen-
Appears in Collections:SCSE Conference Papers
Files in This Item:
File Description SizeFormat 
ssw9_2016.pdf845.15 kBAdobe PDFThumbnail
View/Open

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.