Difference between revisions of "2025:Audio Chord Estimation Results"
From MIREX Wiki
|  (→Test Sets) | |||
| Line 51: | Line 51: | ||
| ====Main Test Sets==== | ====Main Test Sets==== | ||
| + | |||
| + | The following datasets are served as pure test sets. No system is allowed to train on them. | ||
| + | |||
| * '''Billboard 2013''': The held-out portion of the McGill Billboard dataset, containing mainly western pop songs from the Billboard chart. | * '''Billboard 2013''': The held-out portion of the McGill Billboard dataset, containing mainly western pop songs from the Billboard chart. | ||
| * '''Yamaha_JPOP''': A private dataset annotated by Yamaha Corporation. The dataset contains 200 JPOP songs. | * '''Yamaha_JPOP''': A private dataset annotated by Yamaha Corporation. The dataset contains 200 JPOP songs. | ||
| Line 58: | Line 61: | ||
| These are datasets that may not be strictly held-out test sets. Some models might have been trained on these datasets; for specific details, please refer to the extended abstracts of each model. | These are datasets that may not be strictly held-out test sets. Some models might have been trained on these datasets; for specific details, please refer to the extended abstracts of each model. | ||
| + | |||
| * '''Billboard 2012''': The public portion of the McGill Billboard dataset. | * '''Billboard 2012''': The public portion of the McGill Billboard dataset. | ||
| * '''RWC Popular''': 100 pop songs from the RWC (Real World Computing) Music Database. 20% songs with English lyrics and 80% songs with Japanese lyrics. | * '''RWC Popular''': 100 pop songs from the RWC (Real World Computing) Music Database. 20% songs with English lyrics and 80% songs with Japanese lyrics. | ||
Revision as of 04:03, 8 September 2025
This page is still WIP. More submissions and descriptions may appear.
Contents
Submissions
| Submission | Title | Authors | |
|---|---|---|---|
| Baseline: Chordino | NNLS Chroma v1.1 | Link | |
| Baseline: ISMIR2019 | Large-Vocabulary Chord Transcription via Chord Structure Decomposition | Link | |
| MD1 | Degree-Based Automatic Chord Recognition with Enharmonic Distinction | TBA | Muhammad Waseem Akram et al. [*] | 
| wu-ensemble | wu-ensemble | TBA | Yiwei Ding, Christof Weiß | 
| wu-single | wu-single | TBA | Yiwei Ding, Christof Weiß | 
| YK1 | Semi-Supervised Audio Chord Estimator Based on Disentangled Generative Modeling | TBA | Yiming Wu, Kento Yoshida | 
| BMACE | A Mamba-Based Model for Automatic Chord Recognition | TBA | Chunyu Yuan, Jiyeoung Sim, Johanna Devaney | 
[*] Please submit an extended abstract containing the full author list.
Test Sets
Main Test Sets
The following datasets are served as pure test sets. No system is allowed to train on them.
- Billboard 2013: The held-out portion of the McGill Billboard dataset, containing mainly western pop songs from the Billboard chart.
- Yamaha_JPOP: A private dataset annotated by Yamaha Corporation. The dataset contains 200 JPOP songs.
- Yamaha_Balanced: A private dataset annotated by Yamaha Corporation. The dataset contains 241 songs. While it is still biased towards JPOP songs, the dataset covers a wider range of genres: J.Pop (10.37%), Rock (10.37%), J.Enka (10.37%), J.Kayoukyoku (10.37%), Soundtrack (10.37%), Western Pop (10.37%), Children's Song (10.37%), R&B (6.22%), Hiphop (4.56%), Jazz (2.49%), Dance (2.49%), World (2.07%), Techno (1.24%), Easy listening (1.24%), J.Minyou (1.24%), Others (5.81%).
Additional Test Sets
These are datasets that may not be strictly held-out test sets. Some models might have been trained on these datasets; for specific details, please refer to the extended abstracts of each model.
- Billboard 2012: The public portion of the McGill Billboard dataset.
- RWC Popular: 100 pop songs from the RWC (Real World Computing) Music Database. 20% songs with English lyrics and 80% songs with Japanese lyrics.
Main Results
The following datasets are served as pure test sets. No system is allowed to train on them.
Billboard2013
| Group | MirexRoot | MirexMajMin | MirexMajMinBass | MirexSevenths | MirexSeventhsBass | MeanSeg | UnderSeg | OverSeg | 
|---|---|---|---|---|---|---|---|---|
| Baseline: Chordino | 71.06 | 67.18 | 65.09 | 48.88 | 47.06 | 0.82 | 0.83 | 0.83 | 
| Baseline: ISMIR2019 | 78.61 | 76.39 | 74.72 | 64.15 | 62.65 | 0.83 | 0.79 | 0.93 | 
| MD1 | 81.35 | 79.15 | 77.91 | 66.40 | 65.33 | 0.86 | 0.85 | 0.89 | 
| wu-ensemble | 74.64 | 71.97 | 70.72 | 55.06 | 53.96 | 0.83 | 0.86 | 0.82 | 
| wu-single | 75.77 | 73.14 | 71.74 | 55.41 | 54.15 | 0.83 | 0.85 | 0.83 | 
| YK1 | 81.01 | 78.10 | 75.41 | 64.53 | 62.05 | 0.86 | 0.85 | 0.87 | 
YAMAHA_Balanced
| Group | MirexRoot | MirexMajMin | MirexMajMinBass | MirexSevenths | MirexSeventhsBass | MeanSeg | UnderSeg | OverSeg | 
|---|---|---|---|---|---|---|---|---|
| Baseline: Chordino | 77.57 | 74.64 | 71.59 | 56.38 | 53.90 | 0.87 | 0.87 | 0.87 | 
| Baseline: ISMIR2019 | 82.00 | 81.16 | 79.69 | 66.97 | 65.77 | 0.89 | 0.86 | 0.93 | 
| MD1 | 81.83 | 80.22 | 78.87 | 64.13 | 63.14 | 0.88 | 0.89 | 0.88 | 
| wu-ensemble | 82.54 | 81.29 | 78.99 | 62.84 | 60.84 | 0.87 | 0.89 | 0.87 | 
| wu-single | 81.37 | 79.69 | 77.61 | 61.60 | 59.84 | 0.87 | 0.90 | 0.86 | 
| YK1 | 82.53 | 79.71 | 75.60 | 66.02 | 62.31 | 0.89 | 0.90 | 0.89 | 
YAMAHA_JPop
| Group | MirexRoot | MirexMajMin | MirexMajMinBass | MirexSevenths | MirexSeventhsBass | MeanSeg | UnderSeg | OverSeg | 
|---|---|---|---|---|---|---|---|---|
| Baseline: Chordino | 74.49 | 71.99 | 69.24 | 52.40 | 49.97 | 0.87 | 0.86 | 0.88 | 
| Baseline: ISMIR2019 | 81.49 | 79.99 | 78.58 | 62.81 | 61.61 | 0.90 | 0.87 | 0.94 | 
| MD1 | 79.34 | 77.10 | 76.07 | 55.59 | 54.71 | 0.88 | 0.88 | 0.88 | 
| wu-ensemble | 79.58 | 77.58 | 75.57 | 54.36 | 52.58 | 0.87 | 0.88 | 0.87 | 
| wu-single | 78.87 | 76.56 | 74.66 | 55.35 | 53.60 | 0.87 | 0.89 | 0.86 | 
| YK1 | 80.13 | 77.03 | 72.85 | 61.24 | 57.26 | 0.89 | 0.90 | 0.89 | 
Additional Results
Below are results on datasets that may not be strictly held-out test sets. Some models might have been trained on these datasets; for specific details, please refer to the extended abstracts of each model.
Billboard2012
| Group | MirexRoot | MirexMajMin | MirexMajMinBass | MirexSevenths | MirexSeventhsBass | MeanSeg | UnderSeg | OverSeg | 
|---|---|---|---|---|---|---|---|---|
| Baseline: Chordino | 74.04 | 72.11 | 70.05 | 55.24 | 53.28 | 0.84 | 0.85 | 0.83 | 
| MD1 | 85.11 | 83.98 | 82.76 | 74.12 | 73.12 | 0.89 | 0.89 | 0.90 | 
| wu-ensemble | 78.26 | 77.15 | 75.58 | 59.99 | 58.79 | 0.84 | 0.88 | 0.83 | 
| wu-single | 79.23 | 78.21 | 76.76 | 60.23 | 59.07 | 0.85 | 0.87 | 0.84 | 
| YK1 | 85.90 | 84.66 | 81.81 | 77.22 | 74.45 | 0.88 | 0.88 | 0.90 | 
RWC-Popular
| Group | MirexRoot | MirexMajMin | MirexMajMinBass | MirexSevenths | MirexSeventhsBass | MeanSeg | UnderSeg | OverSeg | 
|---|---|---|---|---|---|---|---|---|
| Baseline: Chordino | 78.97 | 77.78 | 74.13 | 63.15 | 59.72 | 0.89 | 0.88 | 0.90 | 
| MD1 | 83.98 | 81.18 | 79.42 | 66.53 | 64.83 | 0.89 | 0.89 | 0.89 | 
| wu-ensemble | 81.87 | 80.30 | 77.58 | 62.65 | 60.25 | 0.88 | 0.90 | 0.86 | 
| wu-single | 82.48 | 81.35 | 78.48 | 62.86 | 60.28 | 0.88 | 0.89 | 0.87 | 
| YK1 | 88.76 | 87.27 | 81.14 | 76.88 | 70.90 | 0.92 | 0.92 | 0.92 | 
Task Captain's Note
- Results on Billboard & RWC Popular are competible with previous years.
- Evaluation tools: https://github.com/ismir-mirex/ace-task-captain-note
- Model Raw outputs: https://github.com/ismir-mirex/ace-output
- Detailed evaluation results: https://github.com/ismir-mirex/ace-results

