Difference between revisions of "2006:Symbolic Melodic Similarity Results"
(→Team ID) |
(→Team ID) |
||
(49 intermediate revisions by 5 users not shown) | |||
Line 1: | Line 1: | ||
[[Category: Results]] | [[Category: Results]] | ||
− | ==Introduction== | + | ==Introduction== |
− | These are the results for the 2006 running of the Symbolic Melodic Similarity task set. For background information about this task set please refer to the [[Symbolic Melodic Similarity]] page. | + | These are the results for the 2006 running of the Symbolic Melodic Similarity task set. For background information about this task set please refer to the [[2006:Symbolic Melodic Similarity]] page. |
+ | Each system was given a query and returned the 10 most melodically similar songs from a given collection where the collections were RISM (monophonic; 10,000), Karoke (polyphonic; 1,000), Mixed (polyphonic; 15,741). Then, for each query, the returned results from all participants were grouped and were evaluated by human graders, each query being evaluated by 3 different graders with two scores (using the Evalutron 6000 system). Graders were asked to provide 1 categorical score with 3 categories: NS,SS,VS as explained below, and one fine score (in the range from 0 to 10). | ||
+ | |||
+ | ====Evalutron 6000 Summary Data==== | ||
+ | '''Number of evaluators''' = 20<br /> | ||
+ | '''Number of evaluations per query/candidate pair''' = 3<br /> | ||
+ | '''Number of queries per grader''' = 15<br /> | ||
+ | '''Ave. size of the candidate lists''' = 15<br /> | ||
+ | '''Ave. number of query/candidate pairs evaluated per grader: 225<br /> | ||
+ | '''Number of queries (across all subtasks''' = 17<br /> | ||
+ | |||
===General Legend=== | ===General Legend=== | ||
====Team ID==== | ====Team ID==== | ||
Prefix '''R''' = RISM collection, '''K''' = Karaoke collection, '''M''' = Polyphonic collection<br /> | Prefix '''R''' = RISM collection, '''K''' = Karaoke collection, '''M''' = Polyphonic collection<br /> | ||
− | '''FH''' = [https://www.music-ir.org/ | + | '''FH''' = [https://www.music-ir.org/mirex/abstracts/2006/SMS_hanna.pdf Pascal Ferraro and Pierre Hanna]<br /> |
− | '''NM''' = [https://www.music-ir.org/ | + | '''NM''' = [https://www.music-ir.org/mirex/abstracts/2006/SMS_mikkila.pdf Kjell Lemström, Niko Mikkilä, Veli Mäkinen and Esko Ukkonen]<br /> |
− | '''RT''' = [https://www.music-ir.org/ | + | '''RT''' = [https://www.music-ir.org/mirex/abstracts/2006/SMS_QBSH_typke.pdf Rainer Typke, Frans Wiering and Remco C. Veltkamp]<br /> |
− | '''KF''' = [https://www.music-ir.org/ | + | '''KF''' = [https://www.music-ir.org/mirex/abstracts/2006/SMS_frieler.pdf Klaus Frieler]<br /> |
− | '''AU''' = [https://www.music-ir.org/ | + | '''AU''' = [https://www.music-ir.org/mirex/abstracts/2006/SMS_uitdenbogerd.pdf Alexandra Uitdenbogerd]<br /> |
====Broad Categories==== | ====Broad Categories==== | ||
Line 17: | Line 27: | ||
'''SS''' = Somewhat Similar<br /> | '''SS''' = Somewhat Similar<br /> | ||
'''VS''' = Very Similar<br /> | '''VS''' = Very Similar<br /> | ||
+ | |||
+ | ====Table Headings==== | ||
+ | '''ADR''' = Average Dynamic Recall <br /> | ||
+ | '''NRGB''' = Normalize Recall at Group Boundaries <br /> | ||
+ | '''AP''' = Average Precision (non-interpolated) <br /> | ||
+ | '''PND''' = Precision at N Documents <br /> | ||
===Calculating Summary Measures=== | ===Calculating Summary Measures=== | ||
− | ''' | + | '''Fine'''<sup>(1)</sup> = Sum of fine-grained human similarity decisions (0-10). <br /> |
− | + | '''PSum'''<sup>(1)</sup> = Sum of human broad similarity decisions: NS=0, SS=1, VS=2. <br /> | |
− | + | '''WCsum'''<sup>(1)</sup> = 'World Cup' scoring: NS=0, SS=1, VS=3 (rewards Very Similar). <br /> | |
− | + | '''SDsum'''<sup>(1)</sup> = 'Stephen Downie' scoring: NS=0, SS=1, VS=4 (strongly rewards Very Similar). <br /> | |
− | + | '''Greater0'''<sup>(1)</sup> = NS=0, SS=1, VS=1 (binary relevance judgement).<br /> | |
− | '''PSum''' = Sum of human broad similarity decisions: NS=0, SS=1, VS=2. <br /> | + | '''Greater1'''<sup>(1)</sup> = NS=0, SS=0, VS=1 (binary relevance judgement using only Very Similar).<br /> |
− | '''WCsum''' = 'World Cup' scoring: NS=0, SS=1, VS=3 (rewards Very Similar). <br /> | + | |
− | '''SDsum''' = 'Stephen Downie' scoring: NS=0, SS=1, VS=4 (strongly rewards Very Similar. <br /> | + | <sup>(1)</sup>Normalized to the range 0 to 1. |
− | '''Greater0''' = NS=0, SS=1, VS=1 (binary relevance judgement).<br /> | ||
− | '''Greater1''' = NS=0, SS=0, VS=1 (binary relevance judgement using only Very Similar).<br /> | ||
==Overall Summary Results== | ==Overall Summary Results== | ||
+ | ===Visualizations=== | ||
+ | Rainer Typke has created a series of [[2006:Symbolic Melodic Similarity Graphs]] that help us visualize the results. <br /> | ||
+ | Rainer Typke has also created a set of detailed representations of the results that is definitely with exploring at [[http://rainer.typke.org/mirex06.0.html http://rainer.typke.org/mirex06.0.html]]. | ||
+ | |||
+ | ===Task I: RISM Overall Summary=== | ||
+ | |||
+ | <csv>2006/sms06_rism_sum.csv</csv> | ||
+ | |||
+ | ===Task I: RISM Runtime Data=== | ||
− | + | <csv>2006/sms06_rism_runtime.csv</csv> | |
− | + | ===Task IIa: Karaoke Overall Summary=== | |
− | + | <csv>2006/sms06_karaoke_sum.csv</csv> | |
− | + | ===Task IIa: Karaoke Runtime Data=== | |
− | + | <csv>2006/sms06_karaoke_runtime.csv</csv> | |
− | <csv>sms06_mixed_sum.csv</csv> | + | ===Task IIb: Mixed Polyphonic Overall Summary=== |
+ | |||
+ | <csv>2006/sms06_mixed_sum.csv</csv> | ||
+ | |||
+ | ===Task IIb: Mixed Polyphonic Runtime Data=== | ||
+ | |||
+ | <csv>2006/sms06_mixed_runtime.csv</csv> | ||
==Task I: RISM Collection Summary Results== | ==Task I: RISM Collection Summary Results== | ||
− | + | There is an error with this data set...please stand by. | |
− | <csv> | + | <csv>2006/sms06_rism_results3.csv</csv> |
==Task IIa: Karaoke Collection Summary Results== | ==Task IIa: Karaoke Collection Summary Results== | ||
− | <csv> | + | <csv>2006/sms06_kar_results3.csv</csv> |
==Task IIb: Mixed Polyphonic Collection Summary Results== | ==Task IIb: Mixed Polyphonic Collection Summary Results== | ||
− | <csv> | + | <csv>2006/sms06_mix_results3.csv</csv> |
==Raw Scores== | ==Raw Scores== | ||
− | The raw data derived from the Evalutron 6000 human evaluations are located on the [[Symbolic Melodic Similarity Raw Data]] page. | + | The raw data derived from the Evalutron 6000 human evaluations are located on the [[2006:Symbolic Melodic Similarity Raw Data]] page. |
Latest revision as of 11:07, 26 July 2010
Contents
Introduction
These are the results for the 2006 running of the Symbolic Melodic Similarity task set. For background information about this task set please refer to the 2006:Symbolic Melodic Similarity page.
Each system was given a query and returned the 10 most melodically similar songs from a given collection where the collections were RISM (monophonic; 10,000), Karoke (polyphonic; 1,000), Mixed (polyphonic; 15,741). Then, for each query, the returned results from all participants were grouped and were evaluated by human graders, each query being evaluated by 3 different graders with two scores (using the Evalutron 6000 system). Graders were asked to provide 1 categorical score with 3 categories: NS,SS,VS as explained below, and one fine score (in the range from 0 to 10).
Evalutron 6000 Summary Data
Number of evaluators = 20
Number of evaluations per query/candidate pair = 3
Number of queries per grader = 15
Ave. size of the candidate lists = 15
Ave. number of query/candidate pairs evaluated per grader: 225
Number of queries (across all subtasks = 17
General Legend
Team ID
Prefix R = RISM collection, K = Karaoke collection, M = Polyphonic collection
FH = Pascal Ferraro and Pierre Hanna
NM = Kjell Lemström, Niko Mikkilä, Veli Mäkinen and Esko Ukkonen
RT = Rainer Typke, Frans Wiering and Remco C. Veltkamp
KF = Klaus Frieler
AU = Alexandra Uitdenbogerd
Broad Categories
NS = Not Similar
SS = Somewhat Similar
VS = Very Similar
Table Headings
ADR = Average Dynamic Recall
NRGB = Normalize Recall at Group Boundaries
AP = Average Precision (non-interpolated)
PND = Precision at N Documents
Calculating Summary Measures
Fine(1) = Sum of fine-grained human similarity decisions (0-10).
PSum(1) = Sum of human broad similarity decisions: NS=0, SS=1, VS=2.
WCsum(1) = 'World Cup' scoring: NS=0, SS=1, VS=3 (rewards Very Similar).
SDsum(1) = 'Stephen Downie' scoring: NS=0, SS=1, VS=4 (strongly rewards Very Similar).
Greater0(1) = NS=0, SS=1, VS=1 (binary relevance judgement).
Greater1(1) = NS=0, SS=0, VS=1 (binary relevance judgement using only Very Similar).
(1)Normalized to the range 0 to 1.
Overall Summary Results
Visualizations
Rainer Typke has created a series of 2006:Symbolic Melodic Similarity Graphs that help us visualize the results.
Rainer Typke has also created a set of detailed representations of the results that is definitely with exploring at [http://rainer.typke.org/mirex06.0.html].
Task I: RISM Overall Summary
RFH | RRT | RKF2 | RAU | RKF3 | RNM2 | RNM1 | RKF1 | |
---|---|---|---|---|---|---|---|---|
ADR | 0.707 | 0.715 | 0.670 | 0.577 | 0.555 | 0.541 | 0.268 | 0.000 |
NRGB | 0.622 | 0.626 | 0.568 | 0.515 | 0.466 | 0.484 | 0.277 | 0.000 |
AP | 0.623 | 0.607 | 0.594 | 0.461 | 0.391 | 0.393 | 0.167 | 0.000 |
PND | 0.526 | 0.547 | 0.504 | 0.441 | 0.426 | 0.403 | 0.250 | 0.000 |
Fine | 0.490 | 0.488 | 0.478 | 0.433 | 0.418 | 0.292 | 0.208 | 0.145 |
Psum | 0.594 | 0.592 | 0.567 | 0.522 | 0.500 | 0.364 | 0.258 | 0.172 |
WCsum | 0.544 | 0.544 | 0.519 | 0.457 | 0.450 | 0.344 | 0.250 | 0.119 |
SDsum | 0.519 | 0.521 | 0.494 | 0.425 | 0.425 | 0.335 | 0.246 | 0.092 |
Greater0 | 0.744 | 0.733 | 0.711 | 0.717 | 0.650 | 0.422 | 0.283 | 0.333 |
Greater1 | 0.444 | 0.450 | 0.422 | 0.328 | 0.333 | 0.306 | 0.233 | 0.011 |
Task I: RISM Runtime Data
Team ID | Machine | Run-time(seconds) | |
---|---|---|---|
AU | indexing | beer 4 | 33 |
AU | query | beer 4 | 31 |
FH | query | beer 4 | 807 |
KF | indexing | black | 210 |
KF1 | query | black | 2880 |
KF2 | query | black | 2220 |
KF3 | optip | black | 3960 |
NM1 | query | beer 6 | 68 |
NM2 | query | beer 6 | 188 |
RT | query | beer 4 | 59 |
Task IIa: Karaoke Overall Summary
KRT | KAU | KFH | KNM2 | KNM1 | |
---|---|---|---|---|---|
ADR | 0.819 | 0.378 | 0.150 | 0.000 | 0.000 |
NRGB | 0.764 | 0.333 | 0.150 | 0.000 | 0.000 |
AP | 0.875 | 0.363 | 0.100 | 0.000 | 0.000 |
PND | 0.833 | 0.333 | 0.100 | 0.000 | 0.000 |
Fine | 0.267 | 0.207 | 0.153 | 0.112 | 0.105 |
Psum | 0.327 | 0.260 | 0.200 | 0.147 | 0.120 |
WCsum | 0.289 | 0.211 | 0.151 | 0.098 | 0.082 |
SDsum | 0.270 | 0.187 | 0.127 | 0.073 | 0.063 |
Greater0 | 0.440 | 0.407 | 0.347 | 0.293 | 0.233 |
Greater1 | 0.213 | 0.113 | 0.053 | 0.007 | 0.000 |
Task IIa: Karaoke Runtime Data
Team ID | Machine | Run-time(seconds) | |
---|---|---|---|
AU | indexing | beer 4 | 397 |
AU | query | beer 4 | 5 |
FH | query | beer 4 | 1338 |
NM1 | query | beer 6 | 386 |
NM2 | query | beer 6 | 1875 |
RT | query | beer 4 | 32 |
Task IIb: Mixed Polyphonic Overall Summary
MRT | MAU | MFH | MNM1 | MNH2 | |
---|---|---|---|---|---|
ADR | 0.784 | 0.587 | 0.218 | 0.070 | 0.000 |
NRGB | 0.806 | 0.516 | 0.174 | 0.040 | 0.000 |
AP | 0.903 | 0.517 | 0.170 | 0.032 | 0.004 |
PND | 0.903 | 0.549 | 0.187 | 0.044 | 0.000 |
Fine | 0.663 | 0.489 | 0.283 | 0.128 | 0.114 |
Psum | 0.772 | 0.558 | 0.342 | 0.153 | 0.153 |
WCsum | 0.752 | 0.535 | 0.294 | 0.115 | 0.113 |
SDsum | 0.742 | 0.524 | 0.271 | 0.096 | 0.093 |
Greater0 | 0.833 | 0.628 | 0.483 | 0.267 | 0.272 |
Greater1 | 0.711 | 0.489 | 0.200 | 0.039 | 0.033 |
Task IIb: Mixed Polyphonic Runtime Data
Team ID | Machine | Run-time(seconds) | |
---|---|---|---|
AU | indexing | beer 4 | 2785 |
AU | query | beer 4 | 51 |
FH | query | beer 4 | 14440 |
NM1 | both | beer 6 | 3271 |
NM2 | both | beer 6 | 16314 |
RT | query | beer 4 | 108 |
Task I: RISM Collection Summary Results
There is an error with this data set...please stand by.
ADR: | ||||||||
---|---|---|---|---|---|---|---|---|
queryID | RFH | RRT | RKF2 | RAU | RKF3 | RNM2 | RNM1 | RKF1 |
qr000001 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
qr000002 | 0.768 | 0.802 | 0.518 | 0.144 | 0.298 | 0.572 | 0.103 | 0.000 |
qr000003 | 0.786 | 0.840 | 0.707 | 0.740 | 0.642 | 0.000 | 0.555 | 0.000 |
qr000004 | 0.930 | 0.795 | 0.866 | 0.707 | 0.609 | 0.817 | 0.222 | 0.000 |
qr000005 | 0.870 | 0.870 | 0.960 | 0.960 | 0.870 | 0.870 | 0.613 | 0.000 |
qr000006 | 0.887 | 0.984 | 0.967 | 0.912 | 0.912 | 0.984 | 0.113 | 0.000 |
Ave. ADR score: | 0.707 | 0.715 | 0.670 | 0.577 | 0.555 | 0.541 | 0.268 | 0.000 |
NRGB: | ||||||||
queryID | RFH | RRT | RKF2 | RAU | RKF3 | RNM2 | RNM1 | RKF1 |
qr000001 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
qr000002 | 0.630 | 0.630 | 0.296 | 0.296 | 0.333 | 0.370 | 0.185 | 0.000 |
qr000003 | 0.625 | 0.688 | 0.563 | 0.625 | 0.500 | 0.000 | 0.375 | 0.000 |
qr000004 | 0.917 | 0.615 | 0.771 | 0.531 | 0.406 | 0.708 | 0.260 | 0.000 |
qr000005 | 0.840 | 0.840 | 0.920 | 0.920 | 0.840 | 0.840 | 0.560 | 0.000 |
qr000006 | 0.719 | 0.984 | 0.859 | 0.719 | 0.719 | 0.984 | 0.281 | 0.000 |
Ave. NRGB score: | 0.622 | 0.626 | 0.568 | 0.515 | 0.466 | 0.484 | 0.277 | 0.000 |
AP: | ||||||||
queryID | RFH | RRT | RKF2 | RAU | RKF3 | RNM2 | RNM1 | RKF1 |
qr000001 | 0.500 | 0.500 | 0.500 | 0.333 | 0.000 | 0.000 | 0.000 | 0.000 |
qr000002 | 0.604 | 0.514 | 0.222 | 0.059 | 0.137 | 0.383 | 0.050 | 0.000 |
qr000003 | 0.500 | 0.604 | 0.375 | 0.446 | 0.384 | 0.000 | 0.250 | 0.000 |
qr000004 | 0.750 | 0.548 | 0.683 | 0.375 | 0.375 | 0.500 | 0.186 | 0.000 |
qr000005 | 0.600 | 0.600 | 0.800 | 0.800 | 0.600 | 0.600 | 0.419 | 0.000 |
qr000006 | 0.785 | 0.875 | 0.986 | 0.750 | 0.847 | 0.875 | 0.098 | 0.000 |
Ave. AP score: | 0.623 | 0.607 | 0.594 | 0.461 | 0.391 | 0.393 | 0.167 | 0.000 |
PND: | ||||||||
queryID | RFH | RRT | RKF2 | RAU | RKF3 | RNM2 | RNM1 | RKF1 |
qr000001 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
qr000002 | 0.556 | 0.556 | 0.222 | 0.222 | 0.333 | 0.444 | 0.222 | 0.000 |
qr000003 | 0.500 | 0.625 | 0.375 | 0.500 | 0.500 | 0.000 | 0.250 | 0.000 |
qr000004 | 0.750 | 0.625 | 0.750 | 0.375 | 0.375 | 0.500 | 0.375 | 0.000 |
qr000005 | 0.600 | 0.600 | 0.800 | 0.800 | 0.600 | 0.600 | 0.400 | 0.000 |
qr000006 | 0.750 | 0.875 | 0.875 | 0.750 | 0.750 | 0.875 | 0.250 | 0.000 |
Ave. PND score: | 0.526 | 0.547 | 0.504 | 0.441 | 0.426 | 0.403 | 0.250 | 0.000 |
Fine: | ||||||||
queryID | RFH | RRT | RKF2 | RAU | RKF3 | RNM2 | RNM1 | RKF1 |
qr000001 | 0.322 | 0.265 | 0.359 | 0.309 | 0.230 | 0.042 | 0.007 | 0.154 |
qr000002 | 0.566 | 0.569 | 0.407 | 0.423 | 0.452 | 0.358 | 0.153 | 0.214 |
qr000003 | 0.401 | 0.489 | 0.352 | 0.396 | 0.355 | 0.017 | 0.191 | 0.076 |
qr000004 | 0.581 | 0.544 | 0.554 | 0.357 | 0.419 | 0.390 | 0.341 | 0.111 |
qr000005 | 0.387 | 0.386 | 0.475 | 0.483 | 0.379 | 0.276 | 0.278 | 0.140 |
qr000006 | 0.681 | 0.675 | 0.720 | 0.632 | 0.672 | 0.671 | 0.278 | 0.178 |
Ave. Fine Score: | 0.490 | 0.488 | 0.478 | 0.433 | 0.418 | 0.292 | 0.208 | 0.145 |
Psum: | ||||||||
queryID | RFH | RRT | RKF2 | RAU | RKF3 | RNM2 | RNM1 | RKF1 |
qr000001 | 0.417 | 0.317 | 0.433 | 0.383 | 0.267 | 0.067 | 0.033 | 0.200 |
qr000002 | 0.717 | 0.700 | 0.483 | 0.533 | 0.533 | 0.467 | 0.200 | 0.250 |
qr000003 | 0.500 | 0.617 | 0.417 | 0.483 | 0.450 | 0.050 | 0.250 | 0.100 |
qr000004 | 0.667 | 0.650 | 0.633 | 0.417 | 0.517 | 0.450 | 0.400 | 0.100 |
qr000005 | 0.483 | 0.450 | 0.550 | 0.567 | 0.450 | 0.350 | 0.350 | 0.183 |
qr000006 | 0.783 | 0.817 | 0.883 | 0.750 | 0.783 | 0.800 | 0.317 | 0.200 |
Ave. Psum Score: | 0.594 | 0.592 | 0.567 | 0.522 | 0.500 | 0.364 | 0.258 | 0.172 |
WCsum: | ||||||||
queryID | RFH | RRT | RKF2 | RAU | RKF3 | RNM2 | RNM1 | RKF1 |
qr000001 | 0.322 | 0.244 | 0.344 | 0.289 | 0.189 | 0.044 | 0.022 | 0.133 |
qr000002 | 0.689 | 0.667 | 0.422 | 0.456 | 0.467 | 0.444 | 0.200 | 0.178 |
qr000003 | 0.444 | 0.556 | 0.378 | 0.433 | 0.422 | 0.033 | 0.233 | 0.067 |
qr000004 | 0.633 | 0.600 | 0.600 | 0.378 | 0.456 | 0.433 | 0.400 | 0.067 |
qr000005 | 0.433 | 0.411 | 0.500 | 0.489 | 0.400 | 0.333 | 0.333 | 0.122 |
qr000006 | 0.744 | 0.789 | 0.867 | 0.700 | 0.767 | 0.778 | 0.311 | 0.144 |
Ave. WCsum Score: | 0.544 | 0.544 | 0.519 | 0.457 | 0.450 | 0.344 | 0.250 | 0.119 |
SDsum: | ||||||||
queryID | RRT | RFH | RKF2 | RAU | RKF3 | RNM2 | RNM1 | RKF1 |
qr000001 | 0.208 | 0.275 | 0.300 | 0.242 | 0.150 | 0.033 | 0.017 | 0.100 |
qr000002 | 0.650 | 0.675 | 0.392 | 0.417 | 0.433 | 0.433 | 0.200 | 0.142 |
qr000003 | 0.525 | 0.417 | 0.358 | 0.408 | 0.408 | 0.025 | 0.225 | 0.050 |
qr000004 | 0.575 | 0.617 | 0.583 | 0.358 | 0.425 | 0.425 | 0.400 | 0.050 |
qr000005 | 0.392 | 0.408 | 0.475 | 0.450 | 0.375 | 0.325 | 0.325 | 0.092 |
qr000006 | 0.775 | 0.725 | 0.858 | 0.675 | 0.758 | 0.767 | 0.308 | 0.117 |
Ave. SDsum Score: | 0.521 | 0.519 | 0.494 | 0.425 | 0.425 | 0.335 | 0.246 | 0.092 |
Greater0: | ||||||||
queryID | RFH | RRT | RAU | RKF2 | RKF3 | RNM2 | RNM1 | RKF1 |
qr000001 | 0.700 | 0.533 | 0.667 | 0.700 | 0.500 | 0.133 | 0.067 | 0.400 |
qr000002 | 0.800 | 0.800 | 0.767 | 0.667 | 0.733 | 0.533 | 0.200 | 0.467 |
qr000003 | 0.667 | 0.800 | 0.633 | 0.533 | 0.533 | 0.100 | 0.300 | 0.200 |
qr000004 | 0.767 | 0.800 | 0.533 | 0.733 | 0.700 | 0.500 | 0.400 | 0.200 |
qr000005 | 0.633 | 0.567 | 0.800 | 0.700 | 0.600 | 0.400 | 0.400 | 0.367 |
qr000006 | 0.900 | 0.900 | 0.900 | 0.933 | 0.833 | 0.867 | 0.333 | 0.367 |
Ave. greater0 Score: | 0.744 | 0.733 | 0.717 | 0.711 | 0.650 | 0.422 | 0.283 | 0.333 |
Greater1: | ||||||||
queryID | RRT | RFH | RKF2 | RKF3 | RAU | RNM2 | RNM1 | RKF1 |
qr000001 | 0.100 | 0.133 | 0.167 | 0.033 | 0.100 | 0.000 | 0.000 | 0.000 |
qr000002 | 0.600 | 0.633 | 0.300 | 0.233 | 0.300 | 0.400 | 0.200 | 0.033 |
qr000003 | 0.433 | 0.333 | 0.300 | 0.367 | 0.333 | 0.000 | 0.200 | 0.000 |
qr000004 | 0.500 | 0.567 | 0.533 | 0.333 | 0.300 | 0.400 | 0.400 | 0.000 |
qr000005 | 0.333 | 0.333 | 0.400 | 0.300 | 0.333 | 0.300 | 0.300 | 0.000 |
qr000006 | 0.733 | 0.667 | 0.833 | 0.733 | 0.600 | 0.733 | 0.300 | 0.033 |
Ave. greater1 Score: | 0.450 | 0.444 | 0.422 | 0.333 | 0.328 | 0.306 | 0.233 | 0.011 |
Task IIa: Karaoke Collection Summary Results
ADR: | |||||
---|---|---|---|---|---|
queryID | KRT | KAU | KFH | KNM2 | KNM1 |
qk000001 | 1.000 | 0.889 | 0.000 | 0.000 | 0.000 |
qk000002 | 0.542 | 0.000 | 0.000 | 0.000 | 0.000 |
qk000003 | 0.556 | 0.000 | 0.000 | 0.000 | 0.000 |
qk000004 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 |
qk000005 | 1.000 | 1.000 | 0.750 | 0.000 | 0.000 |
Ave. ADR score: | 0.820 | 0.378 | 0.150 | 0.000 | 0.000 |
NRGB: | |||||
queryID | KRT | KAU | KFH | KNM2 | KNM1 |
qk000001 | 1.000 | 0.667 | 0.000 | 0.000 | 0.000 |
qk000002 | 0.375 | 0.000 | 0.000 | 0.000 | 0.000 |
qk000003 | 0.444 | 0.000 | 0.000 | 0.000 | 0.000 |
qk000004 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 |
qk000005 | 1.000 | 1.000 | 0.750 | 0.000 | 0.000 |
Ave. NRGB score: | 0.764 | 0.333 | 0.150 | 0.000 | 0.000 |
AP: | |||||
queryID | KRT | KAU | KFH | KNM2 | KNM1 |
qk000001 | 1.000 | 0.667 | 0.000 | 0.000 | 0.000 |
qk000002 | 0.707 | 0.000 | 0.000 | 0.000 | 0.000 |
qk000003 | 0.667 | 0.067 | 0.000 | 0.000 | 0.000 |
qk000004 | 1.000 | 0.083 | 0.000 | 0.000 | 0.000 |
qk000005 | 1.000 | 1.000 | 0.500 | 0.000 | 0.000 |
Ave. AP score: | 0.875 | 0.363 | 0.100 | 0.000 | 0.000 |
PND: | |||||
queryID | KRT | KAU | KFH | KNM2 | KNM1 |
qk000001 | 1.000 | 0.667 | 0.000 | 0.000 | 0.000 |
qk000002 | 0.500 | 0.000 | 0.000 | 0.000 | 0.000 |
qk000003 | 0.667 | 0.000 | 0.000 | 0.000 | 0.000 |
qk000004 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 |
qk000005 | 1.000 | 1.000 | 0.500 | 0.000 | 0.000 |
Ave. PND score: | 0.833 | 0.333 | 0.100 | 0.000 | 0.000 |
Fine | |||||
queryID | KRT | KAU | KFH | KNM2 | KNM1 |
qk000001 | 0.337 | 0.268 | 0.108 | 0.079 | 0.116 |
qk000002 | 0.297 | 0.094 | 0.106 | 0.048 | 0.109 |
qk000003 | 0.222 | 0.189 | 0.169 | 0.177 | 0.073 |
qk000004 | 0.267 | 0.199 | 0.152 | 0.088 | 0.078 |
qk000005 | 0.214 | 0.283 | 0.230 | 0.168 | 0.149 |
Ave. Fine Score: | 0.267 | 0.207 | 0.153 | 0.112 | 0.105 |
Psum: | |||||
queryID | KRT | KAU | KFH | KNM2 | KNM1 |
qk000001 | 0.417 | 0.317 | 0.150 | 0.100 | 0.133 |
qk000002 | 0.350 | 0.117 | 0.117 | 0.067 | 0.133 |
qk000003 | 0.300 | 0.250 | 0.233 | 0.233 | 0.050 |
qk000004 | 0.317 | 0.267 | 0.217 | 0.117 | 0.117 |
qk000005 | 0.250 | 0.350 | 0.283 | 0.217 | 0.167 |
Ave. Psum Score: | 0.327 | 0.260 | 0.200 | 0.147 | 0.120 |
WCsum: | |||||
queryID | KRT | KAU | KFH | KNM2 | KNM1 |
qk000001 | 0.378 | 0.278 | 0.100 | 0.067 | 0.089 |
qk000002 | 0.322 | 0.078 | 0.078 | 0.044 | 0.089 |
qk000003 | 0.244 | 0.189 | 0.189 | 0.156 | 0.033 |
qk000004 | 0.278 | 0.211 | 0.167 | 0.078 | 0.078 |
qk000005 | 0.222 | 0.300 | 0.222 | 0.144 | 0.122 |
Ave. WCsum Score: | 0.289 | 0.211 | 0.151 | 0.098 | 0.082 |
SDsum: | |||||
queryID | KRT | KAU | KFH | KNM2 | KNM1 |
qk000001 | 0.358 | 0.258 | 0.075 | 0.050 | 0.067 |
qk000002 | 0.308 | 0.058 | 0.058 | 0.033 | 0.067 |
qk000003 | 0.217 | 0.158 | 0.167 | 0.117 | 0.025 |
qk000004 | 0.258 | 0.183 | 0.142 | 0.058 | 0.058 |
qk000005 | 0.208 | 0.275 | 0.192 | 0.108 | 0.100 |
Ave. SDsum Score: | 0.270 | 0.187 | 0.127 | 0.073 | 0.063 |
Greater0: | |||||
queryID | KRT | KAU | KFH | KNM2 | KNM1 |
qk000001 | 0.533 | 0.433 | 0.300 | 0.200 | 0.267 |
qk000002 | 0.433 | 0.233 | 0.233 | 0.133 | 0.267 |
qk000003 | 0.467 | 0.433 | 0.367 | 0.467 | 0.100 |
qk000004 | 0.433 | 0.433 | 0.367 | 0.233 | 0.233 |
qk000005 | 0.333 | 0.500 | 0.467 | 0.433 | 0.300 |
Ave. greater0 Score: | 0.440 | 0.407 | 0.347 | 0.293 | 0.233 |
Greater1: | |||||
queryID | KRT | KAU | KFH | KNM1 | KNM2 |
qk000001 | 0.300 | 0.200 | 0.000 | 0.000 | 0.000 |
qk000002 | 0.267 | 0.000 | 0.000 | 0.000 | 0.000 |
qk000003 | 0.133 | 0.067 | 0.100 | 0.000 | 0.000 |
qk000004 | 0.200 | 0.100 | 0.067 | 0.000 | 0.000 |
qk000005 | 0.167 | 0.200 | 0.100 | 0.033 | 0.000 |
Ave. greater1 Score: | 0.213 | 0.113 | 0.053 | 0.007 | 0.000 |
Task IIb: Mixed Polyphonic Collection Summary Results
ADR: | |||||
---|---|---|---|---|---|
queryID | MRT | MAU | MFH | MNM1 | MNM2 |
qm000001 | 0.569 | 0.158 | 0.000 | 0.000 | 0.000 |
qm000002 | 0.592 | 0.322 | 0.000 | 0.408 | 0.000 |
qm000003 | 0.793 | 0.663 | 0.000 | 0.000 | 0.000 |
qm000004 | 0.893 | 0.602 | 0.012 | 0.000 | 0.000 |
qm000005 | 1.000 | 1.000 | 1.000 | 0.000 | 0.000 |
qm000006 | 0.857 | 0.776 | 0.293 | 0.010 | 0.000 |
Ave. ADR score: | 0.784 | 0.587 | 0.218 | 0.070 | 0.000 |
NRGB: | |||||
queryID | MRT | MAU | MFH | MNM1 | MNM2 |
qm000001 | 0.583 | 0.139 | 0.000 | 0.000 | 0.000 |
qm000002 | 0.772 | 0.517 | 0.000 | 0.228 | 0.000 |
qm000003 | 0.857 | 0.510 | 0.000 | 0.000 | 0.000 |
qm000004 | 0.836 | 0.383 | 0.012 | 0.000 | 0.000 |
qm000005 | 0.909 | 0.909 | 0.909 | 0.000 | 0.000 |
qm000006 | 0.879 | 0.637 | 0.121 | 0.010 | 0.000 |
Ave. NRGB score: | 0.806 | 0.516 | 0.174 | 0.040 | 0.000 |
AP: | |||||
queryID | MRT | MAU | MFH | MNM1 | MNM2 |
qm000001 | 1.000 | 0.167 | 0.000 | 0.000 | 0.000 |
qm000002 | 0.833 | 0.403 | 0.000 | 0.167 | 0.021 |
qm000003 | 1.000 | 0.524 | 0.000 | 0.000 | 0.000 |
qm000004 | 0.778 | 0.397 | 0.012 | 0.000 | 0.000 |
qm000005 | 0.909 | 0.909 | 0.909 | 0.000 | 0.000 |
qm000006 | 0.900 | 0.700 | 0.100 | 0.025 | 0.000 |
Ave. AP score: | 0.903 | 0.517 | 0.170 | 0.032 | 0.004 |
PND: | |||||
queryID | MRT | MAU | MFH | MNM1 | MNM2 |
qm000001 | 1.000 | 0.167 | 0.000 | 0.000 | 0.000 |
qm000002 | 0.833 | 0.500 | 0.000 | 0.167 | 0.000 |
qm000003 | 1.000 | 0.571 | 0.000 | 0.000 | 0.000 |
qm000004 | 0.778 | 0.444 | 0.111 | 0.000 | 0.000 |
qm000005 | 0.909 | 0.909 | 0.909 | 0.000 | 0.000 |
qm000006 | 0.900 | 0.700 | 0.100 | 0.100 | 0.000 |
Ave. AP score: | 0.903 | 0.549 | 0.187 | 0.044 | 0.000 |
Fine: | |||||
queryID | MRT | MAU | MFH | MNM1 | MNM2 |
qm000001 | 0.502 | 0.162 | 0.130 | 0.097 | 0.101 |
qm000002 | 0.479 | 0.306 | 0.108 | 0.203 | 0.088 |
qm000003 | 0.670 | 0.438 | 0.087 | 0.105 | 0.073 |
qm000004 | 0.621 | 0.430 | 0.226 | 0.097 | 0.235 |
qm000005 | 0.866 | 0.880 | 0.870 | 0.147 | 0.078 |
qm000006 | 0.838 | 0.719 | 0.279 | 0.122 | 0.110 |
Ave. Fine Score: | 0.663 | 0.489 | 0.283 | 0.128 | 0.114 |
Psum: | |||||
queryID | MRT | MAU | MFH | MNM1 | MNM2 |
qm000001 | 0.600 | 0.217 | 0.167 | 0.083 | 0.150 |
qm000002 | 0.583 | 0.367 | 0.133 | 0.233 | 0.150 |
qm000003 | 0.783 | 0.483 | 0.133 | 0.117 | 0.083 |
qm000004 | 0.717 | 0.450 | 0.267 | 0.117 | 0.250 |
qm000005 | 1.000 | 1.000 | 1.000 | 0.167 | 0.100 |
qm000006 | 0.950 | 0.833 | 0.350 | 0.200 | 0.183 |
Ave. Psum Score: | 0.772 | 0.558 | 0.342 | 0.153 | 0.153 |
WCsum: | |||||
queryID | MRT | MAU | MFH | MNM1 | MNM2 |
qm000001 | 0.567 | 0.167 | 0.122 | 0.056 | 0.100 |
qm000002 | 0.533 | 0.344 | 0.089 | 0.189 | 0.122 |
qm000003 | 0.767 | 0.456 | 0.089 | 0.078 | 0.056 |
qm000004 | 0.700 | 0.422 | 0.189 | 0.078 | 0.200 |
qm000005 | 1.000 | 1.000 | 1.000 | 0.122 | 0.067 |
qm000006 | 0.944 | 0.822 | 0.278 | 0.167 | 0.133 |
Ave. WCsum Score: | 0.752 | 0.535 | 0.294 | 0.115 | 0.113 |
SDsum: | |||||
queryID | MRT | MAU | MFH | MNM1 | MNM2 |
qm000001 | 0.550 | 0.142 | 0.100 | 0.042 | 0.075 |
qm000002 | 0.508 | 0.333 | 0.067 | 0.167 | 0.108 |
qm000003 | 0.758 | 0.442 | 0.067 | 0.058 | 0.042 |
qm000004 | 0.692 | 0.408 | 0.150 | 0.058 | 0.175 |
qm000005 | 1.000 | 1.000 | 1.000 | 0.100 | 0.050 |
qm000006 | 0.942 | 0.817 | 0.242 | 0.150 | 0.108 |
Ave. SDsum Score: | 0.742 | 0.524 | 0.271 | 0.096 | 0.093 |
Greater0: | |||||
queryID | MRT | MAU | MFH | MNM2 | MNM1 |
qm000001 | 0.700 | 0.367 | 0.300 | 0.300 | 0.167 |
qm000002 | 0.733 | 0.433 | 0.267 | 0.233 | 0.367 |
qm000003 | 0.833 | 0.567 | 0.267 | 0.167 | 0.233 |
qm000004 | 0.767 | 0.533 | 0.500 | 0.400 | 0.233 |
qm000005 | 1.000 | 1.000 | 1.000 | 0.200 | 0.300 |
qm000006 | 0.967 | 0.867 | 0.567 | 0.333 | 0.300 |
Ave. greater0 Score: | 0.833 | 0.628 | 0.483 | 0.272 | 0.267 |
Greater1: | |||||
queryID | MRT | MAU | MFH | MNM1 | MNM2 |
qm000001 | 0.500 | 0.067 | 0.033 | 0.000 | 0.000 |
qm000002 | 0.433 | 0.300 | 0.000 | 0.100 | 0.067 |
qm000003 | 0.733 | 0.400 | 0.000 | 0.000 | 0.000 |
qm000004 | 0.667 | 0.367 | 0.033 | 0.000 | 0.100 |
qm000005 | 1.000 | 1.000 | 1.000 | 0.033 | 0.000 |
qm000006 | 0.933 | 0.800 | 0.133 | 0.100 | 0.033 |
Ave. greater1 Score: | 0.711 | 0.489 | 0.200 | 0.039 | 0.033 |
Raw Scores
The raw data derived from the Evalutron 6000 human evaluations are located on the 2006:Symbolic Melodic Similarity Raw Data page.