Unified model for voice conversion of speech and singing voice using adaptive pitch constraints

Shogo Fukawa, Takashi Nose, Shuhei Imai, Akinori Ito

Research output: Contribution to journalArticlepeer-review

Abstract

This paper proposes a voice conversion named SpSiVC that appropriately converts both speech and singing voices with a single model. Since the distribution of pitch between speakers is significantly different for speech and singing voices, voice conversion has been mainly evaluated as a separate task for speech and singing voice conversion. SpSiVC introduces an adaptive F0 loss, which enables conversion that implicitly switches the shift width of the logarithm F0 according to the type of input voice. We examine the effectiveness of the F0 constraints in objective and subjective evaluations.

Original languageEnglish
Pages (from-to)120-123
Number of pages4
JournalAcoustical Science and Technology
Volume46
Issue number1
DOIs
Publication statusPublished - 2025 Jan

Keywords

  • CycleGAN
  • Singing voice conversion (SVC)
  • Unified model
  • Voice conversion (VC)

Fingerprint

Dive into the research topics of 'Unified model for voice conversion of speech and singing voice using adaptive pitch constraints'. Together they form a unique fingerprint.

Cite this