2007:Symbolic Melodic Similarity

From MIREX Wiki
Revision as of 23:09, 27 May 2007 by Carlosg (talk | contribs)

Overview

This page is devoted to discussions of the MIREX 07 Symbolic Melodic Similarity contest. Discussions on the MIREX 07 Symbolic Melodic Similarity contest planning list will be briefly digested on this page. A full digest of the discussions is available to subscribers from the MIREX 07 Symbolic Melodic Similarity contest planning list archives. You can subscribe to this list to participate in the discussion.

You can additionaly read information about the Symbolic Melodic Similarity tasks that were run in the 2005 and 2006 MIREX editions.

Task description

Retrieve the most similar items from a collection of symbolic documents, given a query, and rank them by similarity. The following tasks could be defined this year:

Task 1: Monophonic to monophonic. Both the query and the documents in the collection will be monophonic.

Task 2: Monophonic to polyphonic. The documents will be polyphonic (i.e. can have simultaneous notes), but the query will still be monophonic.

Task 3: Polyphonic to polyphonic. Both the query and documents will be polyphonic.

For now, the description of these tasks is intentionally open; the details can be found in the discussion section. Also, the realization of these tasks is subject to the numbers of participants interested in each task.

Evaluation and ground truth

The same method for building the ground truth as last year can be used. This method has the advantage that no ground truth needs to be built in advance. After the algorithms have been submitted, their results are pooled for every query, and human evaluators are asked to judge the relevance of the matches for some queries. To make this evaluation feasible, it is important that the algorithms do not only return the names of the matching MIDI files for task 2 and 3, but also where the matching fragment starts and ends in the matching MIDI file.

Potencial participants

If you think there is a slight chance that you might consider participating, please add your name here. Please indicate as well in which tasks you wish to participate.

  • Carlos G├│mez (monophonic-to-monophonic)

Discussion

Comments from Carlos G├│mez

For the monophonic task, an interesting variation this year could be to use a collection of different source than RISM. As RISM snippets have been used in the two previous competitions, past participants can have more interest in participating this year if data that is different in some aspect is used. I have though of the following possible sources for the collection this year:

  • The Themefinder database (www.themefinder.org) is comprised of three collections, which consist of classical themes, folksong themes and renaissance incipits. Incipits in the renaissance collection are tagged with their RISM number, so this collection has in common with the previously ones used, but the other two collections could be used. The difference lies in that the music is from different periods or genres, but the format is still the same (around 15 notes fragments).
  • The NZDL (Meldex) "folkfull" collection contains 9354 melodies, which appear to be longer on average than the Themefinder and RISM snippets.
  • A perhaps less feasible option is that there exists a digital version of the Barlow and Morgenstern dictionary of musical themes, at the website www.multimedialibrary.com. This database is copyrighted by the publishers of the book and the authors of the website, but we could try to ask permission to use it.

References