2007:Query by Singing/Humming
Status
This is only a very basic draft version of a task proposal. Once more people show interest we can fill in the details.
The goal of the Query-by-Singing/Humming (QBSH) task is the evaluation of MIR systems that take as query input queries sung or hummed by real-world users. More information can be found in:
Please feel free to edit this page.
Query Data
1. Roger Jang's corpus (MIREX2006 QBSH corpus) which is comprised of 2797 queries along with 48 ground-truth MIDI files. All queries are from the beginning of references.
2. ThinkIT corpus comprised of 355 queries and 106 monophonic ground-truth midi files (with MIDI 0 or 1 format). There are no "singing from beginning" gurantee. This corpus will be published after the task running.
3. Noise MIDI will be the 5000+ Essen collection(can be accessed from http://www.esac-data.org/).
To build a large test set which can reflect real-world queries, it is suggested that every participant makes a contribution to the evaluation corpus.
Task description
Classic QBSH evaluation:
- Input: human singing/humming snippets (.wav)
- Database: ground-truth and noise midi files(which are monophonic)
- Output: candidate list.
- Evaluation: Mean Reciprocal Rank (MMR) and Top-X hit rate.
Rainer Typke also suggests a hybrid symbolic/audio query by humming task which combines a few different algorithm modules (like mono/poly phonic transcriber and rhythm/melody matcher) and evaluates them in a more complex database composed of polyphonic audio files. It could be further discussed.
Participants
If you think there is a slight chance that you might want to participate, please add your name and e-mail address to this list
- Xiao Wu (xwu at hccl dot ioa dot ac dot cn)
- Maarten Grachten (maarten dot grachten at jku dot at)
- Jiang Danning (jiangdn at cn dot ibm dot com)
- Niko Mikkila (mikkila at cs dot helsinki dot fi)