2025:Audio Chord Estimation Results
From MIREX Wiki
This page is still WIP. More submissions and descriptions may appear.
Contents
Submissions
Submission | Title | Authors | |
---|---|---|---|
Baseline: Chordino | NNLS Chroma v1.1 | Link | |
Baseline: ISMIR2019 | Large-Vocabulary Chord Transcription via Chord Structure Decomposition | Link | |
MD1 | Degree-Based Automatic Chord Recognition with Enharmonic Distinction | TBA | Muhammad Waseem Akram et al. [*] |
wu-ensemble | wu-ensemble | TBA | Yiwei Ding, Christof Weiß |
wu-single | wu-single | TBA | Yiwei Ding, Christof Weiß |
YK1 | Semi-Supervised Audio Chord Estimator Based on Disentangled Generative Modeling | TBA | Yiming Wu, Kento Yoshida |
BMACE | A Mamba-Based Model for Automatic Chord Recognition | TBA | Chunyu Yuan, Jiyeoung Sim, Johanna Devaney |
[*] Please submit an extended abstract containing the full author list.
Test Sets
Main Test Sets
The following datasets are served as pure test sets. No system is allowed to train on them.
- Billboard 2013: The held-out portion of the McGill Billboard dataset, containing mainly western pop songs from the Billboard chart.
- Yamaha_JPOP: A private dataset annotated by Yamaha Corporation. The dataset contains 200 JPOP songs.
- Yamaha_Balanced: A private dataset annotated by Yamaha Corporation. The dataset contains 241 songs. While it is still biased towards JPOP songs, the dataset covers a wider range of genres: J.Pop (10.37%), Rock (10.37%), J.Enka (10.37%), J.Kayoukyoku (10.37%), Soundtrack (10.37%), Western Pop (10.37%), Children's Song (10.37%), R&B (6.22%), Hiphop (4.56%), Jazz (2.49%), Dance (2.49%), World (2.07%), Techno (1.24%), Easy listening (1.24%), J.Minyou (1.24%), Others (5.81%).
Additional Test Sets
These are datasets that may not be strictly held-out test sets. Some models might have been trained on these datasets; for specific details, please refer to the extended abstracts of each model.
- Billboard 2012: The public portion of the McGill Billboard dataset.
- RWC Popular: 100 pop songs from the RWC (Real World Computing) Music Database. 20% songs with English lyrics and 80% songs with Japanese lyrics.
Main Results
The following datasets are served as pure test sets. No system is allowed to train on them.
Billboard2013
Group | MirexRoot | MirexMajMin | MirexMajMinBass | MirexSevenths | MirexSeventhsBass | MeanSeg | UnderSeg | OverSeg |
---|---|---|---|---|---|---|---|---|
Baseline: Chordino | 71.06 | 67.18 | 65.09 | 48.88 | 47.06 | 0.82 | 0.83 | 0.83 |
Baseline: ISMIR2019 | 78.61 | 76.39 | 74.72 | 64.15 | 62.65 | 0.83 | 0.79 | 0.93 |
MD1 | 81.35 | 79.15 | 77.91 | 66.40 | 65.33 | 0.86 | 0.85 | 0.89 |
wu-ensemble | 74.64 | 71.97 | 70.72 | 55.06 | 53.96 | 0.83 | 0.86 | 0.82 |
wu-single | 75.77 | 73.14 | 71.74 | 55.41 | 54.15 | 0.83 | 0.85 | 0.83 |
YK1 | 81.01 | 78.10 | 75.41 | 64.53 | 62.05 | 0.86 | 0.85 | 0.87 |
YAMAHA_Balanced
Group | MirexRoot | MirexMajMin | MirexMajMinBass | MirexSevenths | MirexSeventhsBass | MeanSeg | UnderSeg | OverSeg |
---|---|---|---|---|---|---|---|---|
Baseline: Chordino | 77.57 | 74.64 | 71.59 | 56.38 | 53.90 | 0.87 | 0.87 | 0.87 |
Baseline: ISMIR2019 | 82.00 | 81.16 | 79.69 | 66.97 | 65.77 | 0.89 | 0.86 | 0.93 |
MD1 | 81.83 | 80.22 | 78.87 | 64.13 | 63.14 | 0.88 | 0.89 | 0.88 |
wu-ensemble | 82.54 | 81.29 | 78.99 | 62.84 | 60.84 | 0.87 | 0.89 | 0.87 |
wu-single | 81.37 | 79.69 | 77.61 | 61.60 | 59.84 | 0.87 | 0.90 | 0.86 |
YK1 | 82.53 | 79.71 | 75.60 | 66.02 | 62.31 | 0.89 | 0.90 | 0.89 |
YAMAHA_JPop
Group | MirexRoot | MirexMajMin | MirexMajMinBass | MirexSevenths | MirexSeventhsBass | MeanSeg | UnderSeg | OverSeg |
---|---|---|---|---|---|---|---|---|
Baseline: Chordino | 74.49 | 71.99 | 69.24 | 52.40 | 49.97 | 0.87 | 0.86 | 0.88 |
Baseline: ISMIR2019 | 81.49 | 79.99 | 78.58 | 62.81 | 61.61 | 0.90 | 0.87 | 0.94 |
MD1 | 79.34 | 77.10 | 76.07 | 55.59 | 54.71 | 0.88 | 0.88 | 0.88 |
wu-ensemble | 79.58 | 77.58 | 75.57 | 54.36 | 52.58 | 0.87 | 0.88 | 0.87 |
wu-single | 78.87 | 76.56 | 74.66 | 55.35 | 53.60 | 0.87 | 0.89 | 0.86 |
YK1 | 80.13 | 77.03 | 72.85 | 61.24 | 57.26 | 0.89 | 0.90 | 0.89 |
Additional Results
Below are results on datasets that may not be strictly held-out test sets. Some models might have been trained on these datasets; for specific details, please refer to the extended abstracts of each model.
Billboard2012
Group | MirexRoot | MirexMajMin | MirexMajMinBass | MirexSevenths | MirexSeventhsBass | MeanSeg | UnderSeg | OverSeg |
---|---|---|---|---|---|---|---|---|
Baseline: Chordino | 74.04 | 72.11 | 70.05 | 55.24 | 53.28 | 0.84 | 0.85 | 0.83 |
MD1 | 85.11 | 83.98 | 82.76 | 74.12 | 73.12 | 0.89 | 0.89 | 0.90 |
wu-ensemble | 78.26 | 77.15 | 75.58 | 59.99 | 58.79 | 0.84 | 0.88 | 0.83 |
wu-single | 79.23 | 78.21 | 76.76 | 60.23 | 59.07 | 0.85 | 0.87 | 0.84 |
YK1 | 85.90 | 84.66 | 81.81 | 77.22 | 74.45 | 0.88 | 0.88 | 0.90 |
RWC-Popular
Group | MirexRoot | MirexMajMin | MirexMajMinBass | MirexSevenths | MirexSeventhsBass | MeanSeg | UnderSeg | OverSeg |
---|---|---|---|---|---|---|---|---|
Baseline: Chordino | 78.97 | 77.78 | 74.13 | 63.15 | 59.72 | 0.89 | 0.88 | 0.90 |
MD1 | 83.98 | 81.18 | 79.42 | 66.53 | 64.83 | 0.89 | 0.89 | 0.89 |
wu-ensemble | 81.87 | 80.30 | 77.58 | 62.65 | 60.25 | 0.88 | 0.90 | 0.86 |
wu-single | 82.48 | 81.35 | 78.48 | 62.86 | 60.28 | 0.88 | 0.89 | 0.87 |
YK1 | 88.76 | 87.27 | 81.14 | 76.88 | 70.90 | 0.92 | 0.92 | 0.92 |
Task Captain's Note
- Results on Billboard & RWC Popular are competible with previous years.
- Evaluation tools: https://github.com/ismir-mirex/ace-task-captain-note
- Model Raw outputs: https://github.com/ismir-mirex/ace-output
- Detailed evaluation results: https://github.com/ismir-mirex/ace-results