Difference between revisions of "2024:Cover Song Identification"

From MIREX Wiki
(Created page with "==Description== This task requires that algorithms identify, for a query audio track, other recordings of the same composition, or "cover songs". Within the a collection of...")
 
(Description)
Line 10: Line 10:
 
=== Task specific mailing list ===
 
=== Task specific mailing list ===
 
In the past we have use a specific mailing list for the discussion of this task and related tasks. This year, however, we are asking that all discussions take place on the MIREX  [https://mail.lis.illinois.edu/mailman/listinfo/evalfest "EvalFest" list]. If you have an question or comment, simply include the task name in the subject heading.
 
In the past we have use a specific mailing list for the discussion of this task and related tasks. This year, however, we are asking that all discussions take place on the MIREX  [https://mail.lis.illinois.edu/mailman/listinfo/evalfest "EvalFest" list]. If you have an question or comment, simply include the task name in the subject heading.
 +
 +
== Data ==
 +
Two datasets will be used to evaluate cover song identification:
 +
 +
===US Pop Music Collection Cover Song (aka Mixed Collection)===
 +
This is the "original" ACS collection. Within the 1000 pieces in the Audio Cover Song database, there are embedded 30 different "cover songs" each represented by 11 different "versions" for a total of 330 audio files.
 +
 +
Using each of these cover song files in turn as as the "seed/query" file, we will examine the returned lists of items for the presence of the other 10 versions of the "seed/query" file.
 +
 +
Collection statistics:
 +
* 16bit, monophonic, 22.05khz, wav
 +
* The "cover songs" represent a variety of genres (e.g., classical, jazz, gospel, rock, folk-rock, etc.) and the variations span a variety of styles and orchestrations.
 +
* Size: 1000 tracks
 +
* Queries: 330 tracks
 +
 +
=== Sapp's Mazurka Collection Information ===
 +
In addition to our original ACS dataset, we used the  [http://www.mazurka.org.uk/ Mazurka.org dataset] put together by Craig Sapp. We randomly chose 11 versions from 49 mazurkas and ran it as a separate ACS subtask. Systems should return a distance matrix of 539x539 from which we located the ranks of each of the associated cover versions.
 +
 +
Collection statistics:
 +
* 16bit, monophonic, 22.05khz, wav
 +
* Size: 539 tracks
 +
* Queries: 539 tracks

Revision as of 22:46, 25 August 2024

Description

This task requires that algorithms identify, for a query audio track, other recordings of the same composition, or "cover songs".

Within the a collection of pieces in the cover song datasets, there are embedded a number of different "original songs" or compositions each represented by a number of different "versions". The "cover songs" or "versions" represent a variety of genres (e.g., classical, jazz, gospel, rock, folk-rock, etc.) and the variations span a variety of styles and orchestrations.

Using each of these version files in turn as as the "seed/query" file, we examine the returned ranked lists of items from each algorithm for the presence of the other versions of the "seed/query" file.

Two datasets are used in this task, the MIREX 2006 US Pop Music Cover Song dataset Audio Cover Song dataset the Mazurka dataset.

Task specific mailing list

In the past we have use a specific mailing list for the discussion of this task and related tasks. This year, however, we are asking that all discussions take place on the MIREX "EvalFest" list. If you have an question or comment, simply include the task name in the subject heading.

Data

Two datasets will be used to evaluate cover song identification:

US Pop Music Collection Cover Song (aka Mixed Collection)

This is the "original" ACS collection. Within the 1000 pieces in the Audio Cover Song database, there are embedded 30 different "cover songs" each represented by 11 different "versions" for a total of 330 audio files.

Using each of these cover song files in turn as as the "seed/query" file, we will examine the returned lists of items for the presence of the other 10 versions of the "seed/query" file.

Collection statistics:

  • 16bit, monophonic, 22.05khz, wav
  • The "cover songs" represent a variety of genres (e.g., classical, jazz, gospel, rock, folk-rock, etc.) and the variations span a variety of styles and orchestrations.
  • Size: 1000 tracks
  • Queries: 330 tracks

Sapp's Mazurka Collection Information

In addition to our original ACS dataset, we used the Mazurka.org dataset put together by Craig Sapp. We randomly chose 11 versions from 49 mazurkas and ran it as a separate ACS subtask. Systems should return a distance matrix of 539x539 from which we located the ranks of each of the associated cover versions.

Collection statistics:

  • 16bit, monophonic, 22.05khz, wav
  • Size: 539 tracks
  • Queries: 539 tracks