A Quantitative Comparison of Different Approaches for Melody Extraction from Polyphonic Audio Recording



This paper provides an overview of current state-of-the-art
approaches for melody extraction from polyphonic audio recordings, and it
proposes a methodology for the quantitative evaluation of melody
extraction algorithms. We first define a general architecture for melody
extraction systems and discuss the difficulties of the problem in hand; then,
we review different approaches for melody extraction which represent the
current state-of-the-art in this area. We propose and discuss a methodology
for evaluating the different approaches, and we finally present some results
and conclusions of the comparison.

TechReport Number


PDF File

Cited by

Year 2011 : 3 citations

 Fonseca N. (2011). “Singing voice resynthesis using concatenative-based techniques”, PhD Thesis, University of Porto, Portugal

 Serrà J. J. (2011). “Identification of versions of the same musical composition by processing audio descriptions”. PhD Thesis, Universitat Pompeu Fabra, Barcelona, Spain.

 ???. "?? ?? ??? ?? ??? ?? ??? ?? ??." ?????? 16.4 (2011): 84-92.

Year 2010 : 1 citations

 1. JL Durrieu (2010). “Transcription et séparation automatique de la mélodie principale dans les signaux de musique polyphoniques”, PhD Thesis, Telecom Paris Tech

Year 2008 : 4 citations

 Misra (2008). “Technical report on audio and speech processing”. Technical Report FP6-027026, K-Space D3.7

 Oudtshoorn B. (2008). “Investigating the Feasibility of Near Real-Time Music Transcription on Mobile Devices”. Internal Report, University of Western Australia.

 Rao V. and Rao P. (2008). “Vocal Melody Detection in the Presence of Pitched Accompaniment using Harmonic Matching Methods”. Proceedings of the International Conference on Digital Audio Effects – DAFx’08, Espoo, Finland

 Salamon J. (2008). “Chroma-based Predominant Melody and Bass Line Extraction from Music Audio Signals”. MSc Thesis, University Pompeu Fabra, Barcelona.

Year 2007 : 3 citations

 Demopoulos R. J. and Katchabaw M. J. (2007). “Investigating the Feasibility of Near Real-Time Music Transcription on Mobile Devices”. Technical Report #677, University of Western Ontario, Canada.

 Poliner G., Ellis D., Ehmann A., Gomez E., Streich S. and Ong B. (2007). “Melody Transcription from Music Audio: Approaches and Evaluation”. IEEE Transactions on Audio, Speech, and Language Processing, Vol. 15, No. 4, pp. 1247 - 1256.

 Reis G. and Fernandez Veja F. (2007). “Electronic synthesis using genetic algorithms for automatic music transcription”. Proceedings of the 9th Annual Conference on Genetic and Evolutionary Computation, London, England.

Year 2006 : 2 citations

 de Cheveigné A. (2006). ““Multiple F0 Estimation”, in Computational Auditory Scene Analysis: Principles, Algorithms, and Applications, edited by DeLiang Wang and Guy J. Brown, John Wiley and sons.

 Lemvigh M. B. (2006). “Automatisk transskribering af musik”. Technical Report, University of Copenhagen, Denmark.