Difference between revisions of "Audio Melody Extraction"
From MIREX Wiki
(→Participation in previous years) |
(→Description) |
||
(One intermediate revision by the same user not shown) | |||
Line 10: | Line 10: | ||
* Voicing detection (deciding whether a particular time frame contains a "melody pitch" or not), | * Voicing detection (deciding whether a particular time frame contains a "melody pitch" or not), | ||
− | * | + | * Pitch detection (deciding the most likely melody pitch for each time frame). |
We structure the submission to allow these parts | We structure the submission to allow these parts | ||
Line 17: | Line 17: | ||
frames that were being judged unvoiced. Algorithms which don't | frames that were being judged unvoiced. Algorithms which don't | ||
perform a discrimination between melodic and non-melodic parts are | perform a discrimination between melodic and non-melodic parts are | ||
− | also welcome! | + | also welcome! |
− | |||
− | |||
== Participation in previous years == | == Participation in previous years == | ||
Line 28: | Line 26: | ||
| '''Participating Algorithms ''' | | '''Participating Algorithms ''' | ||
| '''URL''' | | '''URL''' | ||
+ | |- | ||
+ | | 2011 | ||
+ | | 10 | ||
+ | | https://www.music-ir.org/mirex/wiki/2011:MIREX2011_Results | ||
|- | |- | ||
| 2010 | | 2010 |
Latest revision as of 11:12, 17 January 2012
Description
The aim of the MIREX audio melody extraction evaluation is to identify the melody pitch contour from polyphonic musical audio. Pitch is expressed as the fundamental frequency of the main melodic voice, and is reported in a frame-based manner on an evenly-spaced time-grid.
The task consists of two parts:
- Voicing detection (deciding whether a particular time frame contains a "melody pitch" or not),
- Pitch detection (deciding the most likely melody pitch for each time frame).
We structure the submission to allow these parts to be done independently within a single output file. That is, it is possible (via a negative pitch value) to guess a pitch even for frames that were being judged unvoiced. Algorithms which don't perform a discrimination between melodic and non-melodic parts are also welcome!