Difference between revisions of "2024:Lyrics-to-Audio Alignment Results"
From MIREX Wiki
Line 169: | Line 169: | ||
Notice that Hansen's dataset and Mauch's dataset overlap with commonly used training set (e.g, DALI). The results are shown for reference only. | Notice that Hansen's dataset and Mauch's dataset overlap with commonly used training set (e.g, DALI). The results are shown for reference only. | ||
− | + | {| class="wikitable" style="text-align:right;" | |
+ | |- style="font-weight:bold; text-align:left;" | ||
+ | ! style="vertical-align:bottom;" | Group | ||
+ | ! style="vertical-align:bottom;" | Average absolute error | ||
+ | ! style="vertical-align:bottom;" | Median absolute error | ||
+ | ! style="vertical-align:bottom;" | Percentage of correct segments | ||
+ | ! Percentage of correct onsets with tolerance | ||
+ | |- style="vertical-align:bottom;" | ||
+ | | style="text-align:left;" | FZZ1 | ||
+ | | 0.101 | ||
+ | | 0.044 | ||
+ | | 0.783 | ||
+ | | 0.971 | ||
+ | |- style="vertical-align:bottom;" | ||
+ | | style="text-align:left;" | NUS | ||
+ | | 0.132 | ||
+ | | 0.031 | ||
+ | | 0.791 | ||
+ | | 0.965 | ||
+ | |} | ||
== Hansen's dataset == | == Hansen's dataset == | ||
− | + | Notice that Hansen's dataset and Mauch's dataset overlap with commonly used training set (e.g, DALI). The results are shown for reference only. | |
+ | |||
+ | {| class="wikitable" style="text-align:right;" | ||
+ | |- style="font-weight:bold; text-align:left;" | ||
+ | ! style="vertical-align:bottom;" | Group | ||
+ | ! style="vertical-align:bottom;" | Average absolute error | ||
+ | ! style="vertical-align:bottom;" | Median absolute error | ||
+ | ! style="vertical-align:bottom;" | Percentage of correct segments | ||
+ | ! Percentage of correct onsets with tolerance | ||
+ | |- style="vertical-align:bottom;" | ||
+ | | style="text-align:left;" | FZZ1 | ||
+ | | 3.264 | ||
+ | | 3.604 | ||
+ | | 0.648 | ||
+ | | 0.870 | ||
+ | |- style="vertical-align:bottom;" | ||
+ | | style="text-align:left;" | NUS | ||
+ | | 0.107 | ||
+ | | 0.052 | ||
+ | | 0.764 | ||
+ | | 0.972 | ||
+ | |} | ||
== Mauch's dataset == | == Mauch's dataset == | ||
Results to be uploaded | Results to be uploaded |
Revision as of 04:08, 12 November 2024
Contents
Submissions
Sub Code | Extended Abstract | Contributors | Methods |
---|---|---|---|
FZZ1 | Wanpeng Fan, Jiaye Zhu, Peng Zhong | WavLM + Conformer | |
NUS (baseline) | Link | Xiaoxue Gao,Chitralekha Gupta, Haizhou Li | Genre-informed Silence + Phone Model |
Results
Jamendo V1
Jamendo V1 refers to the 20 English songs in the Jamendo dataset (https://github.com/f90/jamendolyrics) with old annotations. This is the dataset used in previous MIREXes to make a fair comparison with the previous submissions.
Group | Average absolute error | Median absolute error | Percentage of correct segments | Percentage of correct onsets with tolerance |
---|---|---|---|---|
FZZ1 | 0.547 | 0.047 | 0.686 | 0.912 |
NUS | 0.217 | 0.046 | 0.751 | 0.945 |
Jamendo V2 MultiLang
Jamendo V2 contains all 79 songs in the Jamendo dataset (https://github.com/f90/jamendolyrics) with new annotations.
Group | Average absolute error | Median absolute error | Percentage of correct segments | Percentage of correct onsets with tolerance |
---|---|---|---|---|
FZZ1 | 0.584 | 0.252 | 0.683 | 0.887 |
NUS | 0.651 | 0.136 | 0.502 | 0.729 |
Language-Specific Results
Jamendo V2 En
Group | Average absolute error | Median absolute error | Percentage of correct segments | Percentage of correct onsets with tolerance |
---|---|---|---|---|
FZZ1 | 0.619 | 0.143 | 0.698 | 0.896 |
NUS | 0.216 | 0.046 | 0.784 | 0.947 |
Jamendo V2 Fr
Group | Average absolute error | Median absolute error | Percentage of correct segments | Percentage of correct onsets with tolerance |
---|---|---|---|---|
FZZ1 | 0.371 | 0.045 | 0.661 | 0.897 |
NUS | 0.809 | 0.157 | 0.400 | 0.665 |
Jamendo V2 Gr
Group | Average absolute error | Median absolute error | Percentage of correct segments | Percentage of correct onsets with tolerance |
---|---|---|---|---|
FZZ1 | 0.371 | 0.045 | 0.661 | 0.897 |
NUS | 0.809 | 0.157 | 0.400 | 0.665 |
Jamendo V2 Es
Group | Average absolute error | Median absolute error | Percentage of correct segments | Percentage of correct onsets with tolerance |
---|---|---|---|---|
FZZ1 | 0.452 | 0.039 | 0.703 | 0.905 |
NUS | 0.969 | 0.230 | 0.392 | 0.613 |
Hansen's dataset a cappella
Notice that Hansen's dataset and Mauch's dataset overlap with commonly used training set (e.g, DALI). The results are shown for reference only.
Group | Average absolute error | Median absolute error | Percentage of correct segments | Percentage of correct onsets with tolerance |
---|---|---|---|---|
FZZ1 | 0.101 | 0.044 | 0.783 | 0.971 |
NUS | 0.132 | 0.031 | 0.791 | 0.965 |
Hansen's dataset
Notice that Hansen's dataset and Mauch's dataset overlap with commonly used training set (e.g, DALI). The results are shown for reference only.
Group | Average absolute error | Median absolute error | Percentage of correct segments | Percentage of correct onsets with tolerance |
---|---|---|---|---|
FZZ1 | 3.264 | 3.604 | 0.648 | 0.870 |
NUS | 0.107 | 0.052 | 0.764 | 0.972 |
Mauch's dataset
Results to be uploaded