Difference between revisions of "2008:Audio Chord Detection Results"

Latest revision as of 16:04, 23 July 2010

Introduction

These are the results for the 2008 running of the Audio Chord Detection task set. For background information about this task set please refer to the 2008:Audio Chord Detection page.

Task Descriptions

Task 1 (Pretrained Systems) Go to Task 1 Results: Systems were pretrained and they were tested against 176 Beatles songs.

Task 2 (Train-Test Systems) Go to Task 2 Results: System trained on ~2/3 of the beatles dataset and tested on ~1/3. Album filtering was applied on each train-test fold such that the songs from the same album can not appear in both train and test sets simultaneously.

Overlap score was calculated as the ratio between the overlap of the ground truth and detected chords and ground truth duration. Also a secondary overlap score was calculated by ignoring the major-minor variations of the detected chord (e.g., C major == C minor, etc.).

Note that 4 songs were excluded from the original Beatles dataset because of alignment of ground truth to the audio problems. The ground truth to audio alignment was done automatically. The script to perform the alignment is going to be released soon by Chris Harte.

General Legend

Team ID for ChordPreTrained (Task 1)

BP = J. P. Bello, J. Pickens
KO = M. Khadkevich, M. Omologo
KL1 = K. Lee 1
KL2 = K. Lee 2
MM = M. Mehnert
PP = H.Papadopoulos, G. Peeters
PVM = J. Pauwels, M. Varewyck, J-P. Martens
RK = M. Ryynänen, A. Klapuri

Team ID for ChordTrainTest (Task 2)

DE = D. Ellis
ZL = X. Jhang, C. Lash
KO = M. Khadkevich, M. Omologo
KL = K. Lee (withtrain)
UMS = Y. Uchiyama, K. Miyamoto, S. Sagayama
WD1 = J. Weil
WD2 = J. Weil, J-L. Durrieu

Overall Summary Results

Task 1 Results

Task 1 Overall Results

TeamID	BP	KL1	KL2	KO	MM	PP	PVM	RK
Avg. Overlap Score	0.66	0.56	0.59	0.62	0.65	0.63	0.59	0.64
Avg. Overlap Score (Merged maj/min)	0.69	0.6	0.65	0.65	0.68	0.66	0.62	0.69

download these results as csv

TeamID	BP	KL1	KL2	KO	MM	PP	PVM	RK
Run Time(sec)	2059	853	827	2832	5580	6100	7411	1634

download these results as csv

Task 1 Summary Data for Download

File Name Set (Pretrained runs)
Summary Overlap Data (Pretrained runs)
Summary Overlap Data (Pretrained runs (Merged maj/min))

Task 1 Friedman's Test for Significant Differences

The Friedman test was run in MATLAB against the Task 1 Overlap Score data over the 176 ground truth songs.

Source	SS	df	MS	Chi-sq	Prob>Chi-sq
Columns	1180.83	7	168.69	196.83	0
Error	6210.17	1225	5.07
Total	7391	1407

download these results as csv

The Tukey-Kramer HSD multi-comparison data below was generate using the following MATLAB instruction.

Command: [c,m,h,gnames] = multcompare(stats, 'ctype', 'tukey-kramer', 'estimate', 'friedman', 'alpha', 0.05);

TeamID	TeamID	Lowerbound	Mean	Upperbound	Significance
BP	KL1	2.1575	2.9489	3.7402	TRUE
BP	KL2	1.4814	2.2727	3.0641	TRUE
BP	KO	0.7598	1.5511	2.3425	TRUE
BP	MM	-0.3681	0.4233	1.2147	FALSE
BP	PP	0.4842	1.2756	2.0669	TRUE
BP	PVM	1.2996	2.0909	2.8823	TRUE
BP	RK	0.2371	1.0284	1.8198	TRUE
KL1	KL2	-1.4675	-0.6761	0.1152	FALSE
KL1	KO	-2.1891	-1.3977	-0.6064	TRUE
KL1	MM	-3.3169	-2.5256	-1.7342	TRUE
KL1	PP	-2.4647	-1.6733	-0.8819	TRUE
KL1	PVM	-1.6493	-0.8580	-0.0666	TRUE
KL1	RK	-2.7118	-1.9205	-1.1291	TRUE
KL2	KO	-1.5129	-0.7216	0.0698	FALSE
KL2	MM	-2.6408	-1.8494	-1.0581	TRUE
KL2	PP	-1.7885	-0.9972	-0.2058	TRUE
KL2	PVM	-0.9732	-0.1818	0.6095	FALSE
KL2	RK	-2.0357	-1.2443	-0.4530	TRUE
KO	MM	-1.9192	-1.1278	-0.3365	TRUE
KO	PP	-1.0669	-0.2756	0.5158	FALSE
KO	PVM	-0.2516	0.5398	1.3311	FALSE
KO	RK	-1.3141	-0.5227	0.2686	FALSE
MM	PP	0.0609	0.8523	1.6436	TRUE
MM	PVM	0.8763	1.6676	2.4590	TRUE
MM	RK	-0.1862	0.6051	1.3965	FALSE
PP	PVM	0.0240	0.8153	1.6067	TRUE
PP	RK	-1.0385	-0.2472	0.5442	FALSE
PVM	RK	-1.8539	-1.0625	-0.2711	TRUE

download these results as csv

Task 2 Results

Task 2 Overall Results

TeamID	DE	KL	KO	UMS	WD1	WD2	ZL
Avg. Overlap Score	0.66	0.58	0.55	0.72	0.60	0.62	0.36
Avg. Overlap Score (Merged maj/min)	0.70	0.65	0.58	0.77	0.66	0.69	0.46

download these results as csv

TeamID	DE	KL	KO	UMS	WD1	WD2	ZL
Run Time (s)	6294	2745	4824	22593	3816	145812	7020

download these results as csv

Task 2 Summary Data for Download

File Name Set (Train-test runs)
Summary Overlap Data (Train-test runs)
Summary Overlap Data (Train-Test runs (Merged maj/min))
Per Fold Summary Data (Train-Test runs (Zip archive))

Task Friedman's Test for Significant Differences

The Friedman test was run in MATLAB against the Task 2 Overlap Score data over the 176 ground truth songs.

Source	SS	df	MS	Chi-sq	Prob>Chi-sq
Columns	2596.95	6	432.825	562.85	0
Error	2607.55	1122	2.324
Total	5204.5	1315

download these results as csv

The Tukey-Kramer HSD multi-comparison data below was generate using the following MATLAB instruction.

Command: [c,m,h,gnames] = multcompare(stats, 'ctype', 'tukey-kramer', 'estimate', 'friedman', 'alpha', 0.05);

TeamID	TeamID	Lowerbound	Mean	Upperbound	Significance
DE	KL	0.8069	1.4601	2.1133	TRUE
DE	KO	1.3840	2.0372	2.6904	TRUE
DE	UMS	-1.6851	-1.0319	-0.3787	TRUE
DE	WD1	0.6074	1.2606	1.9138	TRUE
DE	WD2	0.1287	0.7819	1.4351	TRUE
DE	ZL	3.0862	3.7394	4.3926	TRUE
KL	KO	-0.0761	0.5771	1.2303	FALSE
KL	UMS	-3.1452	-2.4920	-1.8388	TRUE
KL	WD1	-0.8527	-0.1995	0.4537	FALSE
KL	WD2	-1.3314	-0.6782	-0.0250	TRUE
KL	ZL	1.6261	2.2793	2.9325	TRUE
KO	UMS	-3.7223	-3.0691	-2.4159	TRUE
KO	WD1	-1.4298	-0.7766	-0.1234	TRUE
KO	WD2	-1.9085	-1.2553	-0.6021	TRUE
KO	ZL	1.0489	1.7021	2.3553	TRUE
UMS	WD1	1.6394	2.2926	2.9458	TRUE
UMS	WD2	1.1606	1.8138	2.4670	TRUE
UMS	ZL	4.1181	4.7713	5.4245	TRUE
WD1	WD2	-1.1319	-0.4787	0.1745	FALSE
WD1	ZL	1.8255	2.4787	3.1319	TRUE
WD2	ZL	2.3042	2.9574	3.6106	TRUE

download these results as csv

@@ Line 1: / Line 1: @@
 ==Introduction==
-These are the results for the 2008 running of the Audio Chord Detection task set. For background information about this task set please refer to the [[Audio Chord Detection]] page.
+These are the results for the 2008 running of the Audio Chord Detection task set. For background information about this task set please refer to the [[2008:Audio Chord Detection]] page.
+===Task Descriptions===
+'''Task 1 (Pretrained Systems) [[#Task 1 Results|Go to Task 1 Results]]''':
+Systems were pretrained and they were tested against 176 Beatles songs.
+'''Task 2 (Train-Test Systems) [[#Task 2 Results|Go to Task 2 Results]]''':
+System trained on ~2/3 of the beatles dataset and tested on ~1/3. Album filtering was applied on each train-test fold such that the songs from the same album can not appear in both train and test sets simultaneously.
+Overlap score was calculated as the ratio between the overlap of the ground truth and detected chords and ground truth duration. Also a secondary overlap score was calculated by ignoring the major-minor variations of the detected chord (e.g., C major == C minor, etc.).
+Note that 4 songs were excluded from the original Beatles dataset because of alignment of ground truth to the audio problems.
+The ground truth to audio alignment was done automatically. The script to perform the alignment is going to be released soon by Chris Harte.
 ===General Legend===
-====Team ID====
+====Team ID for ChordPreTrained (Task 1)====
+'''BP''' = [https://www.music-ir.org/mirex/abstracts/2008/CD_bello.pdf J. P. Bello, J. Pickens]<br />
+'''KO''' = [https://www.music-ir.org/mirex/abstracts/2008/khadkevich_omologo_final.pdf  M. Khadkevich, M. Omologo]<br />
+'''KL1''' = [https://www.music-ir.org/mirex/abstracts/2008/XXX.pdf  K. Lee 1]<br />
+'''KL2''' = [https://www.music-ir.org/mirex/abstracts/2008/XXX.pdf  K. Lee 2]<br />
+'''MM''' = [https://www.music-ir.org/mirex/abstracts/2008/mirex08__mehnert_et_al__cps_based_chord_analysis.pdf  M. Mehnert]<br />
+'''PP''' = [https://www.music-ir.org/mirex/abstracts/2008/mirex08_chord_papadopoulos.pdf  H.Papadopoulos, G. Peeters]<br />
+'''PVM''' = [https://www.music-ir.org/mirex/abstracts/2008/mirex2008-audio_chord_detection-ghent_university-johan_pauwels.pdf  J. Pauwels, M. Varewyck, J-P. Martens]<br />
+'''RK''' = [https://www.music-ir.org/mirex/abstracts/2008/CD_ryynanen.pdf  M. Ryynänen, A. Klapuri]<br />
+====Team ID for ChordTrainTest (Task 2)====
+'''DE''' = [https://www.music-ir.org/mirex/abstracts/2008/Ellis08-chordid.pdf  D. Ellis]<br />
+'''ZL''' = [https://www.music-ir.org/mirex/abstracts/2008/Abstract_xinglin.pdf  X. Jhang, C. Lash]<br />
+'''KO''' = [https://www.music-ir.org/mirex/abstracts/2008/khadkevich_omologo_final.pdf  M. Khadkevich, M. Omologo]<br />
+'''KL''' = [https://www.music-ir.org/mirex/abstracts/2008/XXX.pdf  K. Lee (withtrain)]<br />
+'''UMS''' = [https://www.music-ir.org/mirex/abstracts/2008/uchiyamamirex2008.pdf  Y. Uchiyama, K. Miyamoto, S. Sagayama]<br />
+'''WD1''' = [https://www.music-ir.org/mirex/abstracts/2008/Mirex08_AudioChordDetection_Weil_Durrieu.pdf  J. Weil]<br />
+'''WD2''' = [https://www.music-ir.org/mirex/abstracts/2008/Mirex08_AudioChordDetection_Weil_Durrieu.pdf  J. Weil, J-L. Durrieu]<br />
+==Overall Summary Results==
+===Task 1 Results===
+=====Task 1 Overall Results=====
+<csv>2008/chord/task1_results/pretrained_summary.csv</csv>
+<csv>2008/chord/task1_results/pretrained_runtimes.csv</csv>
+====Task 1 Summary Data for Download====
+[https://www.music-ir.org/mirex/results/2008/chord/task1_results/pretraineed_filenames.csv File Name Set (Pretrained runs)] <br />
+[https://www.music-ir.org/mirex/results/2008/chord/task1_results/ACD.task1.results.overlapScores.csv Summary Overlap Data (Pretrained runs)] <br />
+[https://www.music-ir.org/mirex/results/2008/chord/task1_results/ACD.task1.results.overlapScores.major_minor.csv Summary Overlap Data (Pretrained runs (Merged maj/min))] <br />
+====Task 1 Friedman's Test for Significant Differences====
+The Friedman test was run in MATLAB against the Task 1 Overlap Score data over the 176 ground truth songs.
+<csv>2008/chord/task1_results/task1_friedman.csv</csv>
+The Tukey-Kramer HSD multi-comparison data below was generate using the following MATLAB instruction.
+Command: [c,m,h,gnames] = multcompare(stats, 'ctype', 'tukey-kramer', 'estimate', 'friedman', 'alpha', 0.05);
+<csv>2008/chord/task1_results/friedman_detailed.csv</csv>
+[[Image:2008_task1.friedman.png]]
+===Task 2 Results===
+=====Task 2 Overall Results=====
+<csv>2008/chord/task2_results/summary.csv</csv>
+<csv>2008/chord/task2_results/task2_runtimes.csv</csv>
+====Task 2 Summary Data for Download====
+[https://www.music-ir.org/mirex/results/2008/chord/task2_results/all_filenames.csv File Name Set (Train-test runs)] <br />
+[https://www.music-ir.org/mirex/results/2008/chord/task2_results/all3folds_overlap_scores.csv Summary Overlap Data (Train-test runs)] <br />
+[https://www.music-ir.org/mirex/results/2008/chord/task2_results/all3folds_overlap_scores_majorMinor.csv Summary Overlap Data (Train-Test runs (Merged maj/min))] <br />
+[https://www.music-ir.org/mirex/results/2008/chord/task2_results/individualFriedmansForEachFold.zip Per Fold Summary Data (Train-Test runs (Zip archive))] <br />
+====Task  Friedman's Test for Significant Differences====
+The Friedman test was run in MATLAB against the Task 2 Overlap Score data over the 176 ground truth songs.
+<csv>2008/chord/task2_results/all3folds_friedman.txt</csv>
+The Tukey-Kramer HSD multi-comparison data below was generate using the following MATLAB instruction.
+Command: [c,m,h,gnames] = multcompare(stats, 'ctype', 'tukey-kramer', 'estimate', 'friedman', 'alpha', 0.05);
+<csv>2008/chord/task2_results/task2_allFolds_friedman_detailed.csv</csv>
+[[Image:2008_task2.allfolds_friedman.png]]
-'''BP''' = [https://www.music-ir.org/mirex/2008/abs/XXX.pdf J. P. Bello, J. Pickens]<br />
+[[Category: Results]]
-'''KO''' = [https://www.music-ir.org/mirex/2008/abs/XXX.pdf  M. Khadkevich, M. Omologo]<br />
-'''KL1''' = [https://www.music-ir.org/mirex/2008/abs/XXX.pdf  K. Lee 1]<br />
-'''KL2''' = [https://www.music-ir.org/mirex/2008/abs/XXX.pdf  K. Lee 2]<br />
-'''PP''' = [https://www.music-ir.org/mirex/2008/abs/XXX.pdf  H.Papadopoulos, G. Peeters]<br />
-'''PVM''' = [https://www.music-ir.org/mirex/2008/abs/XXX.pdf  J. Pauwels, M. Varewyck, J-P. Martens]<br />
-'''RK''' = [https://www.music-ir.org/mirex/2008/abs/XXX.pdf  M. Ryyn├ñnen, A. Klapuri]<br />

Difference between revisions of "2008:Audio Chord Detection Results"

Latest revision as of 16:04, 23 July 2010

Contents

Introduction

Task Descriptions

General Legend

Team ID for ChordPreTrained (Task 1)

Team ID for ChordTrainTest (Task 2)

Overall Summary Results

Task 1 Results

Task 1 Overall Results

Task 1 Summary Data for Download

Task 1 Friedman's Test for Significant Differences

Task 2 Results

Task 2 Overall Results

Task 2 Summary Data for Download

Task Friedman's Test for Significant Differences

Navigation menu

Views

Personal tools

MIREX by Year

Results by Year

Account Request

Search

Navigation

Tools