Difference between revisions of "2007:Symbolic Melodic Similarity"

From MIREX Wiki
Line 3: Line 3:
 
This page is devoted to discussions of the MIREX 07 Symbolic Melodic Similarity contest. Discussions on the <span class="plainlinks">[https://mail.lis.uiuc.edu/mailman/listinfo/mrx-com02 MIREX 07 Symbolic Melodic Similarity contest planning list]</span> will be briefly digested on this page. A full digest of the discussions is available to subscribers from the [https://mail.lis.uiuc.edu/mailman/private/mrx-com02/ MIREX 07 Symbolic Melodic Similarity contest planning list archives]. You can <span class="plainlinks">[https://mail.lis.uiuc.edu/mailman/listinfo/mrx-com02 subscribe]</span> to this list to participate in the discussion.
 
This page is devoted to discussions of the MIREX 07 Symbolic Melodic Similarity contest. Discussions on the <span class="plainlinks">[https://mail.lis.uiuc.edu/mailman/listinfo/mrx-com02 MIREX 07 Symbolic Melodic Similarity contest planning list]</span> will be briefly digested on this page. A full digest of the discussions is available to subscribers from the [https://mail.lis.uiuc.edu/mailman/private/mrx-com02/ MIREX 07 Symbolic Melodic Similarity contest planning list archives]. You can <span class="plainlinks">[https://mail.lis.uiuc.edu/mailman/listinfo/mrx-com02 subscribe]</span> to this list to participate in the discussion.
  
You can additionaly read information about the Symbolic Melodic Similarity tasks that were run in the [[2005:Symbolic Melodic Similarity|2005]] and [[2006:Symbolic Melodic Similarity|2006]] MIREX editions.
+
Additionally, you can read information about the Symbolic Melodic Similarity tasks that were run in the [[2005:Symbolic Melodic Similarity|2005]] and [[2006:Symbolic Melodic Similarity|2006]] MIREX editions.
  
 
== Task description ==
 
== Task description ==
Line 15: Line 15:
 
Task 3: Polyphonic to polyphonic. Both the query and documents will be polyphonic.
 
Task 3: Polyphonic to polyphonic. Both the query and documents will be polyphonic.
  
For now, the description of these tasks is intentionally open; the details can be found in the [[#Discussion|discussion]] section. Also, the realization of these tasks is subject to the numbers of participants interested in each task.
+
For now, the description of these tasks is intentionally open; the details are to be determined in the [[#Discussion|discussion]] section. Also note that the realization of these tasks is subject to the numbers of participants interested in each task.
  
 
== Evaluation and ground truth ==
 
== Evaluation and ground truth ==
Line 21: Line 21:
 
The same method for building the ground truth as last year can be used. This method has the advantage that no ground truth needs to be built in advance. After the algorithms have been submitted, their results are pooled for every query, and human evaluators are asked to judge the relevance of the matches for some queries. To make this evaluation feasible, it is important that the algorithms do not only return the names of the matching MIDI files for task 2 and 3, but also where the matching fragment starts and ends in the matching MIDI file.  
 
The same method for building the ground truth as last year can be used. This method has the advantage that no ground truth needs to be built in advance. After the algorithms have been submitted, their results are pooled for every query, and human evaluators are asked to judge the relevance of the matches for some queries. To make this evaluation feasible, it is important that the algorithms do not only return the names of the matching MIDI files for task 2 and 3, but also where the matching fragment starts and ends in the matching MIDI file.  
  
== Potencial participants ==
+
== Potential participants ==
  
 
If you think there is a slight chance that you might consider participating, please add your name here. Please indicate as well in which tasks you wish to participate.
 
If you think there is a slight chance that you might consider participating, please add your name here. Please indicate as well in which tasks you wish to participate.
Line 31: Line 31:
 
=== Comments from Carlos G├│mez ===
 
=== Comments from Carlos G├│mez ===
  
For the monophonic task, an interesting variation this year could be to use a collection of different source than RISM. As RISM snippets have been used in the two previous competitions, past participants can have more interest in participating this year if data that is different in some aspect is used. I have though of the following possible sources for the collection this year:
+
For the monophonic task, an interesting variation this year would be to use a collection of different source than RISM. As RISM snippets have been used in the two previous competitions, past participants can have more interest in participating this year if data different in some aspect is used. These are some possible sources for the collection this year:
* The Themefinder database ([http://www.themefinder.org www.themefinder.org]) is comprised of three collections, which consist of classical themes, folksong themes and renaissance incipits. Incipits in the renaissance collection are tagged with their RISM number, so this collection has in common with the previously ones used, but the other two collections could be used. The difference lies in that the music is from different periods or genres, but the format is still the same (around 15 notes fragments).  
+
* The Themefinder database ([http://www.themefinder.org www.themefinder.org]) is comprised of three collections, which consist of classical themes, folksong themes and renaissance incipits. Incipits in the renaissance collection are tagged with their RISM number, so this collection is probably similar to the previously ones used, but the other two collections could be tried. The difference lies in that the music in those collections comes from other periods or genres, but the format is still the same (around 15 notes fragments).  
* The NZDL (Meldex) "folkfull" collection contains 9354 melodies, which appear to be longer on average than the Themefinder and RISM snippets.  
+
* The Meldex (NZDL) database contains around 10,000 melodies [http://www.informatics.indiana.edu/donbyrd/MusicTestCollections.HTML], which appear to be longer on average than the Themefinder and RISM snippets.  
* A perhaps less feasible option is that there exists a digital version of the Barlow and Morgenstern dictionary of musical themes, at the website [http://www.multimedialibrary.com www.multimedialibrary.com]. This database is copyrighted by the publishers of the book and the authors of the website, but we could try to ask permission to use it.
+
* A perhaps less feasible option is that there exists a digital version of the Barlow and Morgenstern dictionary of musical themes, that can be browsed at [http://www.multimedialibrary.com www.multimedialibrary.com]. This database is copyrighted by the publishers of the book and the authors of the website, but we could try to ask permission to use it.
 +
* Another large monophonic database is HymnQuest.
  
== References ==
+
These references where taken from the list of [http://www.informatics.indiana.edu/donbyrd/MusicTestCollections.HTML candidate music IR test collections] mantained by Donald Byrd.

Revision as of 10:10, 28 May 2007

Overview

This page is devoted to discussions of the MIREX 07 Symbolic Melodic Similarity contest. Discussions on the MIREX 07 Symbolic Melodic Similarity contest planning list will be briefly digested on this page. A full digest of the discussions is available to subscribers from the MIREX 07 Symbolic Melodic Similarity contest planning list archives. You can subscribe to this list to participate in the discussion.

Additionally, you can read information about the Symbolic Melodic Similarity tasks that were run in the 2005 and 2006 MIREX editions.

Task description

Retrieve the most similar items from a collection of symbolic documents, given a query, and rank them by similarity. The following tasks could be defined this year:

Task 1: Monophonic to monophonic. Both the query and the documents in the collection will be monophonic.

Task 2: Monophonic to polyphonic. The documents will be polyphonic (i.e. can have simultaneous notes), but the query will still be monophonic.

Task 3: Polyphonic to polyphonic. Both the query and documents will be polyphonic.

For now, the description of these tasks is intentionally open; the details are to be determined in the discussion section. Also note that the realization of these tasks is subject to the numbers of participants interested in each task.

Evaluation and ground truth

The same method for building the ground truth as last year can be used. This method has the advantage that no ground truth needs to be built in advance. After the algorithms have been submitted, their results are pooled for every query, and human evaluators are asked to judge the relevance of the matches for some queries. To make this evaluation feasible, it is important that the algorithms do not only return the names of the matching MIDI files for task 2 and 3, but also where the matching fragment starts and ends in the matching MIDI file.

Potential participants

If you think there is a slight chance that you might consider participating, please add your name here. Please indicate as well in which tasks you wish to participate.

  • Carlos G├│mez (monophonic-to-monophonic)

Discussion

Comments from Carlos G├│mez

For the monophonic task, an interesting variation this year would be to use a collection of different source than RISM. As RISM snippets have been used in the two previous competitions, past participants can have more interest in participating this year if data different in some aspect is used. These are some possible sources for the collection this year:

  • The Themefinder database (www.themefinder.org) is comprised of three collections, which consist of classical themes, folksong themes and renaissance incipits. Incipits in the renaissance collection are tagged with their RISM number, so this collection is probably similar to the previously ones used, but the other two collections could be tried. The difference lies in that the music in those collections comes from other periods or genres, but the format is still the same (around 15 notes fragments).
  • The Meldex (NZDL) database contains around 10,000 melodies [1], which appear to be longer on average than the Themefinder and RISM snippets.
  • A perhaps less feasible option is that there exists a digital version of the Barlow and Morgenstern dictionary of musical themes, that can be browsed at www.multimedialibrary.com. This database is copyrighted by the publishers of the book and the authors of the website, but we could try to ask permission to use it.
  • Another large monophonic database is HymnQuest.

These references where taken from the list of candidate music IR test collections mantained by Donald Byrd.