Difference between revisions of "2014:Audio Fingerprinting"
From MIREX Wiki
m (→Time and hardware limits) |
|||
Line 1: | Line 1: | ||
== Description == | == Description == | ||
This task requires the query by using exact but noisy recordings. | This task requires the query by using exact but noisy recordings. | ||
+ | |||
== Data == | == Data == | ||
+ | === Database === | ||
+ | * 589 songs (*.mp3 and *.wav) corresponding to 573 different songs | ||
+ | *Language: English, Chinese, Japanese, ... | ||
+ | * 35 mono, 554 stereo | ||
+ | * 40.97 hours long | ||
− | === | + | === Query === |
+ | * 305 recordings (*.wav) corresponding to 304 different songs in DB | ||
+ | * 166 mono, 139 stereo | ||
+ | * 16.11 hours long | ||
+ | * Recorded by variety brand of smartphones, containing noise | ||
+ | * Variety of sampling rate and bit resolution | ||
+ | * Divided into 5800 10-second clips | ||
+ | == Evaluation Procedures == | ||
+ | Top-10 hit rate (1 point is scored for a hit in the top 10 and 0 is scored otherwise, as in the Query by Singing/Humming task) | ||
== Submission Format == | == Submission Format == | ||
− | + | Participants are required to submit a breakdown version of algorithm. The two parts are: | |
− | = | + | # Fingerprinting extractor<br/>Command format:<table style="border:1px solid"><tr><td>extractor %file.list% %dir % %database_or_query%</td></tr></table>where %file.list% is the input list of database audio files named as uniq_key.mp3. or uniq_key.wav For example:<table style="border:1px solid"><tr><td>./AFP/database/00001.wav<br/>./AFP/database/00002.wav<br/>./AFP/database/00003.mp3<br/>./AFP/database/00004.wav<br/>...</td></tr></table> |
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
== Time and hardware limits == | == Time and hardware limits == |
Revision as of 19:13, 29 June 2014
Contents
Description
This task requires the query by using exact but noisy recordings.
Data
Database
- 589 songs (*.mp3 and *.wav) corresponding to 573 different songs
- Language: English, Chinese, Japanese, ...
- 35 mono, 554 stereo
- 40.97 hours long
Query
- 305 recordings (*.wav) corresponding to 304 different songs in DB
- 166 mono, 139 stereo
- 16.11 hours long
- Recorded by variety brand of smartphones, containing noise
- Variety of sampling rate and bit resolution
- Divided into 5800 10-second clips
Evaluation Procedures
Top-10 hit rate (1 point is scored for a hit in the top 10 and 0 is scored otherwise, as in the Query by Singing/Humming task)
Submission Format
Participants are required to submit a breakdown version of algorithm. The two parts are:
- Fingerprinting extractor
Command format:
where %file.list% is the input list of database audio files named as uniq_key.mp3. or uniq_key.wav For example:extractor %file.list% %dir % %database_or_query% ./AFP/database/00001.wav
./AFP/database/00002.wav
./AFP/database/00003.mp3
./AFP/database/00004.wav
...
Time and hardware limits
Due to the potentially high number of participants in this and other audio tasks, hard limits on the runtime of submissions are specified. The time/storage limits of different steps are shown in the following table:
Steps | Time limit | Storage (hard disk) limit |
---|---|---|
extractor (for database) | Totally 72 hours | 40KB for every 10 second audio + 250 MB extra overhead (if needed) |
extractor (for query) | 40 KB for each query clip | |
matcher | N/A |
Submissions that exceed these limitations may not receive a result.
Potential Participants
Discussion
name / email