Revision as of 11:30, 11 November 2024

Submissions

Team	Extended Abstract	Methods	Methodology
Chart-Accompaniment		BART	A BART model generating piano accompaniments using beat-based tokenization.
AccoMontage (BL-1)	PDF	Style Transfer	A hybrid algorithm generating piano accompaniments by rule-based search and music representation learning.
Whole-Song-Gen (BL-2)	PDF	DDPM	A denoising diffusion probabilistic model (DDPM) generating piano accompaniments as piano-roll images
Compose-&-Embesslish (BL-3)	PDF	Transformer	A Transformer-based architecture generating piano performances in beat-based event sequences.

Results

Team	Subjective Evaluation				Objective Evaluation
Team	Coherecy ↑	Naturalness ↑	Creativity ↑	Musicality ↑	NLL ↓
Chart-Accompaniment	1.92 ± 0.11^d	1.87 ± 0.10^c	2.62 ± 0.13^c	2.01 ± 0.11^c	4.12 ± 0.12^c
AccoMontage (BL-1)	3.77 ± 0.11^a	3.59 ± 0.11^a	3.65 ± 0.11^a	3.63 ± 0.12^a	2.48 ± 0.07^a
Whole-Song-Gen (BL-2)	3.59 ± 0.11^b	3.24 ± 0.11^b	3.66 ± 0.10^a	3.47 ± 0.13^b	2.87 ± 0.08^b
Compose-&-Embesslish (BL-3)	3.39 ± 0.10^c	3.38 ± 0.12^b	3.13 ± 0.10^b	3.36 ± 0.11^b	7.41 ± 0.07^d

Note: Results are reported in the form of mean ± sem^s (sem refers to standard error of mean), where s is a letter. Different letters within a column indicate significant differences (p-value p < 0.05) based on a Wilcoxon signed rank test.

Objective Evaluation Details: Each model generates 16 samples for each of 6 test pieces. Negative Log Likelihood (NLL) is computed by inputing the molody and accompaniment into the MuseCoco 1B model.

Subjective Evaluation Details: One piece cherry-picked from 16 samples of each test piece, resulting in 6 pages of questions. We collect responses from 22 participants (18 complete submissions and 4 partial submissions). For complete submissions, the average completion time is 16min 59s.

@@ Line 74: / Line 74: @@
 '''Note''': Results are reported in the form of mean ± sem<sup>s</sup> (sem refers to standard error of mean), where s is a letter. Different letters within a column indicate significant differences (p-value p < 0.05) based on a Wilcoxon signed rank test.
 '''Objective Evaluation Details''': Each model generates 16 samples for each of 6 test pieces. Negative Log Likelihood (NLL) is computed by inputing the molody and accompaniment into the MuseCoco 1B model.
 '''Subjective Evaluation Details''': One piece cherry-picked from 16 samples of each test piece, resulting in 6 pages of questions. We collect responses from 22 participants (18 complete submissions and 4 partial submissions). For complete submissions, the average completion time is 16min 59s.

Difference between revisions of "2024:Symbolic Music Generation Results"

Revision as of 11:30, 11 November 2024

Submissions

Results

Navigation menu

Views

Personal tools

MIREX by Year

Results by Year

Account Request

Search

Navigation

Tools