A query-by-humming music information retrieval from audio signals based on multiple F0 candidates

Akinori Ito, Yu Kosugi, Shozo Makino, Masashi Ito

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

In this paper, we propose a query-by-humming (QbH) system that retrieves musical pieces given as audio signals. Most conventional QbH systems assume that the symbolic melody information is given a priori, which is not always true. In our system, the database for retrieval is generated from 1ch audio signal that contains many sounds. We generate the database by estimating fundamental frequencies (F0) of the audio signals frame by frame. To improve the retrieval accuracy, we exploit multiple F0 candidates to absorb the impact of F0 estimation errors. From the experiment, we obtained about 15 points of improvement by using multiple F0 candidates, compared with the QbH system with only one F0 candidate .

Original languageEnglish
Title of host publicationICALIP 2010 - 2010 International Conference on Audio, Language and Image Processing, Proceedings
Pages1-5
Number of pages5
DOIs
Publication statusPublished - 2010 Dec 1
Event2010 International Conference on Audio, Language and Image Processing, ICALIP 2010 - Shanghai, China
Duration: 2010 Nov 232010 Nov 25

Publication series

NameICALIP 2010 - 2010 International Conference on Audio, Language and Image Processing, Proceedings

Other

Other2010 International Conference on Audio, Language and Image Processing, ICALIP 2010
Country/TerritoryChina
CityShanghai
Period10/11/2310/11/25

ASJC Scopus subject areas

  • Language and Linguistics
  • Computer Vision and Pattern Recognition
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'A query-by-humming music information retrieval from audio signals based on multiple F0 candidates'. Together they form a unique fingerprint.

Cite this