Revision as of 12:00, 11 November 2024

Submissions

Sub Code	Extended Abstract	Contributors	Methods
FZZ1	PDF	Wanpeng Fan, Jiaye Zhu, Peng Zhong	WavLM + Conformer
NUS (baseline)	Link	Xiaoxue Gao,Chitralekha Gupta, Haizhou Li	Genre-informed Silence + Phone Model

Results

Jamendo V1

Jamendo V1 refers to the 20 English songs in the Jamendo dataset (https://github.com/f90/jamendolyrics) with old annotations. This is the dataset used in previous MIREXes to make a fair comparison with the previous submissions.

Group	Average absolute error	Median absolute error	Percentage of correct segments	Percentage of correct onsets with tolerance
FZZ1	0.547	0.047	0.686	0.912
NUS	0.217	0.046	0.751	0.945

Jamendo V2 MultiLang

Jamendo V2 contains all 79 songs in the Jamendo dataset (https://github.com/f90/jamendolyrics) with new annotations.

Group	Average absolute error	Median absolute error	Percentage of correct segments	Percentage of correct onsets with tolerance
FZZ1	0.584	0.252	0.683	0.887
NUS	0.651	0.136	0.502	0.729

Language-Specific Results

Jamendo V2 En

Group	Average absolute error	Median absolute error	Percentage of correct segments	Percentage of correct onsets with tolerance
FZZ1	0.619	0.143	0.698	0.896
NUS	0.216	0.046	0.784	0.947

Jamendo V2 Fr

Group	Average absolute error	Median absolute error	Percentage of correct segments	Percentage of correct onsets with tolerance
FZZ1	0.371	0.045	0.661	0.897
NUS	0.809	0.157	0.400	0.665

Jamendo V2 Gr

Group	Average absolute error	Median absolute error	Percentage of correct segments	Percentage of correct onsets with tolerance
FZZ1	0.371	0.045	0.661	0.897
NUS	0.809	0.157	0.400	0.665

Jamendo V2 Es

Group	Average absolute error	Median absolute error	Percentage of correct segments	Percentage of correct onsets with tolerance
FZZ1	0.452	0.039	0.703	0.905
NUS	0.969	0.230	0.392	0.613

Hansen's dataset a cappella

Notice that Hansen's dataset and Mauch's dataset overlap with commonly used training set (e.g, DALI). The results are shown for reference only.

Results to be uploaded

Hansen's dataset

Results to be uploaded

Mauch's dataset

Results to be uploaded

@@ Line 41: / Line 41: @@
 |- style="vertical-align:bottom;"
 | style="text-align:left;" | NUS
-| 0.217
+| '''0.217'''
-| 0.046
+| '''0.046'''
-| 0.751
+| '''0.751'''
-| 0.945
+| '''0.945'''
 |}
@@ Line 60: / Line 60: @@
 |- style="vertical-align:bottom;"
 | style="text-align:left;" | FZZ1
-| 0.584
+| '''0.584'''
 | 0.252
-| 0.683
+| '''0.683'''
-| 0.887
+| '''0.887'''
 |- style="vertical-align:bottom;"
 | style="text-align:left;" | NUS
 | 0.651
-| 0.136
+| '''0.136'''
 | 0.502
 | 0.729
@@ Line 166: / Line 166: @@
 == Hansen's dataset a cappella ==
+Notice that Hansen's dataset and Mauch's dataset overlap with commonly used training set (e.g, DALI). The results are shown for reference only.
 Results to be uploaded

Difference between revisions of "2024:Lyrics-to-Audio Alignment Results"

Revision as of 12:00, 11 November 2024

Contents

Submissions

Results

Jamendo V1

Jamendo V2 MultiLang

Language-Specific Results

Jamendo V2 En

Jamendo V2 Fr

Jamendo V2 Gr

Jamendo V2 Es

Hansen's dataset a cappella

Hansen's dataset

Mauch's dataset

Navigation menu

Views

Personal tools

MIREX by Year

Results by Year

Account Request

Search

Navigation

Tools