Unsupervised Cross-Domain Singing Voice Conversion

Online Supplement

Adam Polyak, Lior Wolf, Yossef Mordechay Adi, Yaniv Taigman

LJS - voice learned from single speaker speech dataset
LCSING - voice learned from single speaker singing dataset
VCTK - Voices learned from multi-speaker speech dataset
NUS-48E - Voices learned from multi-speaker singing dataset

LJS

Reference speech samples from LJS dataset.

Conversions of singing samples from the NUS-48E dataset to LJS voice. Upper row, shows the name of source singer. Middle row, the audio sample to be converted. The bottom row shows the conversion generated by our method.

Source Speaker	ZHIY	VKOW	SAMF	PMAR	NJAT	MPUR
Original
Conversion

Source Speaker	MPOL	MCUR	KENN	JTAN	JLEE	ADIZ
Original
Conversion

LCSING

Refernce singing samples from LCSING dataset.

Conversions of singing samples from the NUS-48E dataset to LCSING voice. Upper row, shows the name of source singer. Middle row, the audio sample to be converted. The bottom row shows the conversion generated by our method.

Source Speaker	ZHIY	VKOW	SAMF	PMAR	NJAT	MPUR
Original
Conversion

Source Speaker	MPOL	MCUR	KENN	JTAN	JLEE	ADIZ
Original
Conversion

VCTK

Conversions of singing samples from the NUS-48E dataset to VCTK voices. Upper row, shows the name of source singer. Second row, the audio sample to be converted. Third row shows the conversion generated by our method. Bottom row, shows the name of the target speaker.

Source Speaker	ZHIY	VKOW	SAMF	PMAR	NJAT	MPUR
Original
Reference
Conversion
Target Speaker	p307	p259	p335	p311	p243	p248

Source Speaker	MPOL	MCUR	KENN	JTAN	JLEE	ADIZ
Original
Reference
Conversion
Target Speaker	p256	p233	p248	p282	p304	p258

NUS48E

Conversions of between singers from the NUS-48E dataset. Each table presents the conversion of a single sample to all 12 singers in the dataset (including reconstruction). Upper row, shows the name of target singer. Bottom row shows the conversion generated by our method.

Source Speaker	Original
ADIZ

Target Speaker	ZHIY	VKOW	SAMF	PMAR	NJAT	MPUR
Conversion

Source Speaker	MPOL	MCUR	KENN	JTAN	JLEE	ADIZ
Conversion

Source Speaker	Original
VKOW

Target Speaker	ZHIY	VKOW	SAMF	PMAR	NJAT	MPUR
Conversion

Source Speaker	MPOL	MCUR	KENN	JTAN	JLEE	ADIZ
Conversion