TY - GEN
T1 - Lyrics recognition from a singing voice based on finite state automaton for music information retrieval
AU - Hosoya, Toru
AU - Suzuki, Motoyuki
AU - Ito, Akinori
AU - Makino, Shozo
PY - 2005
Y1 - 2005
N2 - Recently, several music information retrieval (MIR) systems have been developed which retrieve musical pieces by the user's singing voice. All of these systems use only the melody information for retrieval. Although the lyrics information is useful for retrieval, there have been few attempts to exploit lyrics in the user's input. In order to develop a MIR system that uses lyrics and melody information, lyrics recognition is needed. Lyrics recognition from a singing voice is achieved by similar technology to that of speech recognition. The difference between lyrics recognition and general speech recognition is that the input lyrics are a part of the lyrics of songs in a database. To exploit linguistic constraints maximally, we described the recognition grammar using a finite state automaton (FSA) that accepts only lyrics in the database. In addition, we carried out a "singing voice adaptation" using a speaker adaptation technique. In our experimental results, about 86% retrieval accuracy was obtained.
AB - Recently, several music information retrieval (MIR) systems have been developed which retrieve musical pieces by the user's singing voice. All of these systems use only the melody information for retrieval. Although the lyrics information is useful for retrieval, there have been few attempts to exploit lyrics in the user's input. In order to develop a MIR system that uses lyrics and melody information, lyrics recognition is needed. Lyrics recognition from a singing voice is achieved by similar technology to that of speech recognition. The difference between lyrics recognition and general speech recognition is that the input lyrics are a part of the lyrics of songs in a database. To exploit linguistic constraints maximally, we described the recognition grammar using a finite state automaton (FSA) that accepts only lyrics in the database. In addition, we carried out a "singing voice adaptation" using a speaker adaptation technique. In our experimental results, about 86% retrieval accuracy was obtained.
KW - FSA
KW - Lyrics recognition
KW - MIR
UR - http://www.scopus.com/inward/record.url?scp=84873550660&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84873550660&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84873550660
SN - 9780955117909
T3 - ISMIR 2005 - 6th International Conference on Music Information Retrieval
SP - 532
EP - 535
BT - ISMIR 2005 - 6th International Conference on Music Information Retrieval
T2 - 6th International Conference on Music Information Retrieval, ISMIR 2005
Y2 - 11 September 2005 through 15 September 2005
ER -