본문 바로가기
  • Home

Singing Voice Synthesis Using HMM Based TTS and MusicXML

  • Journal of The Korea Society of Computer and Information
  • Abbr : JKSCI
  • 2015, 20(5), pp.53-63
  • Publisher : The Korean Society Of Computer And Information
  • Research Area : Engineering > Computer Science

Najeeb Ullah Khan 1 Jungchul Lee 1

1울산대학교

Accredited

ABSTRACT

Singing voice synthesis is the generation of a song using a computer given its lyrics and musical notes. Hidden Markov models (HMM) have been proved to be the models of choice for text to speech synthesis. HMMs have also been used for singing voice synthesis research, however, a huge database is needed for the training of HMMs for singing voice synthesis. And commercially available singing voice synthesis systems which use the piano roll music notation, needs to adopt the easy to read standard music notation which make it suitable for singing learning applications. To overcome this problem, we use a speech database for training context dependent HMMs, to be used for singing voice synthesis. Pitch and duration control methods have been devised to modify the parameters of the HMMs trained on speech, to be used as the synthesis units for the singing voice. This work describes a singing voice synthesis system which uses a MusicXML based music score editor as the front-end interface for entry of the notes and lyrics to be synthesized and a hidden Markov model based text to speech synthesis system as the back-end synthesizer. A perceptual test shows the feasibility of our proposed system.

Citation status

* References for papers published after 2023 are currently being built.