Unsupervised Cross-Domain Singing Voice Conversion

Online Supplement

Adam Polyak, Lior Wolf, Yossef Mordechay Adi, Yaniv Taigman

Contents

LJS - voice learned from single speaker speech dataset
LCSING - voice learned from single speaker singing dataset
VCTK - Voices learned from multi-speaker speech dataset
NUS-48E - Voices learned from multi-speaker singing dataset



LJS

Reference speech samples from LJS dataset.


Conversions of singing samples from the NUS-48E dataset to LJS voice. Upper row, shows the name of source singer. Middle row, the audio sample to be converted. The bottom row shows the conversion generated by our method.

Source Speaker ZHIY VKOW SAMF PMAR NJAT MPUR
Original
Conversion


Source Speaker MPOL MCUR KENN JTAN JLEE ADIZ
Original
Conversion



LCSING

Refernce singing samples from LCSING dataset.


Conversions of singing samples from the NUS-48E dataset to LCSING voice. Upper row, shows the name of source singer. Middle row, the audio sample to be converted. The bottom row shows the conversion generated by our method.

Source Speaker ZHIY VKOW SAMF PMAR NJAT MPUR
Original
Conversion


Source Speaker MPOL MCUR KENN JTAN JLEE ADIZ
Original
Conversion



VCTK

Conversions of singing samples from the NUS-48E dataset to VCTK voices. Upper row, shows the name of source singer. Second row, the audio sample to be converted. Third row shows the conversion generated by our method. Bottom row, shows the name of the target speaker.

Source Speaker ZHIY VKOW SAMF PMAR NJAT MPUR
Original
Reference
Conversion
Target Speaker p307 p259 p335 p311 p243 p248


Source Speaker MPOL MCUR KENN JTAN JLEE ADIZ
Original
Reference
Conversion
Target Speaker p256 p233 p248 p282 p304 p258



NUS48E

Conversions of between singers from the NUS-48E dataset. Each table presents the conversion of a single sample to all 12 singers in the dataset (including reconstruction). Upper row, shows the name of target singer. Bottom row shows the conversion generated by our method.

Source Speaker Original
ADIZ

Target Speaker ZHIY VKOW SAMF PMAR NJAT MPUR
Conversion


Source Speaker MPOL MCUR KENN JTAN JLEE ADIZ
Conversion



Source Speaker Original
VKOW

Target Speaker ZHIY VKOW SAMF PMAR NJAT MPUR
Conversion


Source Speaker MPOL MCUR KENN JTAN JLEE ADIZ
Conversion