<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://music-ir.org/mirex/w/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Zhaojw1998</id>
	<title>MIREX Wiki - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://music-ir.org/mirex/w/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Zhaojw1998"/>
	<link rel="alternate" type="text/html" href="https://music-ir.org/mirex/wiki/Special:Contributions/Zhaojw1998"/>
	<updated>2026-05-20T19:09:07Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.31.1</generator>
	<entry>
		<id>https://music-ir.org/mirex/w/index.php?title=2025:Audio_Beat_Tracking_Results&amp;diff=14815</id>
		<title>2025:Audio Beat Tracking Results</title>
		<link rel="alternate" type="text/html" href="https://music-ir.org/mirex/w/index.php?title=2025:Audio_Beat_Tracking_Results&amp;diff=14815"/>
		<updated>2025-09-13T06:04:44Z</updated>

		<summary type="html">&lt;p&gt;Zhaojw1998: /* Test Sets */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
== Submissions ==&lt;br /&gt;
&lt;br /&gt;
This page is still WIP. More submissions might appear later.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; &lt;br /&gt;
|-&lt;br /&gt;
! Submission&lt;br /&gt;
! Title&lt;br /&gt;
! PDF&lt;br /&gt;
! Authors&lt;br /&gt;
|-&lt;br /&gt;
| style=&amp;quot;vertical-align:bottom; background-color:#F8F9FA; color:#222;&amp;quot; | Baseline: CD1&lt;br /&gt;
| style=&amp;quot;vertical-align:bottom; background-color:#F8F9FA; color:#222;&amp;quot; | QM Tempo Tracker&lt;br /&gt;
| [https://vamp-plugins.org/plugin-doc/qm-vamp-plugins.html Link]&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| Baseline: BeatThis&lt;br /&gt;
| Beat This! Accurate Beat Tracking Without DBN Postprocessing&lt;br /&gt;
| [https://github.com/CPJKU/beat_this Link]&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| KG-ApolloBeats&lt;br /&gt;
| The 2025 KG Music Beats Tracking System&lt;br /&gt;
| TBA&lt;br /&gt;
| DingKun Xiao, Haijun Cai, Chuanyi Chen&lt;br /&gt;
|- &lt;br /&gt;
| BeatU&lt;br /&gt;
| Beat-U: Multi-Task Music Understanding with Hierarchical Timescales&lt;br /&gt;
| TBA&lt;br /&gt;
| YAMAHA Corporation&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== Test Sets ==&lt;br /&gt;
&lt;br /&gt;
* '''GTZAN''': 999 songs from the GTZAN dataset (starting from next year, training on GTZAN will be disallowed)&lt;br /&gt;
* '''SMC''': 217 songs from the SMC collection&lt;br /&gt;
* '''Yamaha_JPOP''': A private dataset annotated by Yamaha Corporation. The dataset contains 250 JPOP songs.&lt;br /&gt;
* '''Yamaha_Balanced''': A private dataset annotated by Yamaha Corporation. The dataset contains 239 songs. While it is still biased towards JPOP songs, the dataset covers a wider range of genres: J.Pop (10.37%), Rock (10.37%), J.Enka (10.37%), J.Kayoukyoku (10.37%), Soundtrack (10.37%), Western Pop (10.37%), Children's Song (10.37%), R&amp;amp;B (6.22%), Hiphop (4.56%), Jazz (2.49%), Dance (2.49%), World (2.07%), Techno (1.24%), Easy listening (1.24%), J.Minyou (1.24%), Others (5.81%).&lt;br /&gt;
&lt;br /&gt;
== GTZAN ==&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:right;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; text-align:left;&amp;quot;&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Group&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | F1&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Cemgil&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Goto&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | P-score&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLt&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLt&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | BeatU&lt;br /&gt;
| 84.93&lt;br /&gt;
| 68.23&lt;br /&gt;
| 64.06&lt;br /&gt;
| 84.27&lt;br /&gt;
| 71.07&lt;br /&gt;
| 75.26&lt;br /&gt;
| 78.78&lt;br /&gt;
| 84.27&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | KG-ApolloBeats*&lt;br /&gt;
| 92.53&lt;br /&gt;
| 80.66&lt;br /&gt;
| 79.38&lt;br /&gt;
| 93.55&lt;br /&gt;
| 83.44&lt;br /&gt;
| 88.49&lt;br /&gt;
| 86.86&lt;br /&gt;
| 92.77&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: BeatThis&lt;br /&gt;
| 89.02&lt;br /&gt;
| 80.15&lt;br /&gt;
| 72.27&lt;br /&gt;
| 88.00&lt;br /&gt;
| 76.00&lt;br /&gt;
| 79.64&lt;br /&gt;
| 84.63&lt;br /&gt;
| 90.01&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: CD1&lt;br /&gt;
| 81.19&lt;br /&gt;
| 69.64&lt;br /&gt;
| 62.06&lt;br /&gt;
| 79.97&lt;br /&gt;
| 65.02&lt;br /&gt;
| 66.94&lt;br /&gt;
| 83.08&lt;br /&gt;
| 86.69&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Entries with [*] are trained on this dataset.&lt;br /&gt;
&lt;br /&gt;
The baseline BeatThis reports different results compared to the paper because it uses a different number of test songs (999 vs. 993).&lt;br /&gt;
&lt;br /&gt;
== SMC ==&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:right;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; text-align:left;&amp;quot;&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Group&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | F1&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Cemgil&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Goto&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | P-score&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLt&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLt&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | BeatU*&lt;br /&gt;
| 53.14&lt;br /&gt;
| 40.67&lt;br /&gt;
| 14.75&lt;br /&gt;
| 63.52&lt;br /&gt;
| 27.24&lt;br /&gt;
| 41.16&lt;br /&gt;
| 30.88&lt;br /&gt;
| 47.44&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | KG-ApolloBeats*&lt;br /&gt;
| 74.15&lt;br /&gt;
| 57.07&lt;br /&gt;
| 32.72&lt;br /&gt;
| 84.51&lt;br /&gt;
| 53.73&lt;br /&gt;
| 72.66&lt;br /&gt;
| 55.96&lt;br /&gt;
| 76.22&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: BeatThis*&lt;br /&gt;
| 71.81&lt;br /&gt;
| 55.64&lt;br /&gt;
| 27.19&lt;br /&gt;
| 82.91&lt;br /&gt;
| 49.78&lt;br /&gt;
| 69.89&lt;br /&gt;
| 51.15&lt;br /&gt;
| 72.30&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: CD1&lt;br /&gt;
| 33.66&lt;br /&gt;
| 26.29&lt;br /&gt;
| 6.91&lt;br /&gt;
| 45.10&lt;br /&gt;
| 9.88&lt;br /&gt;
| 13.12&lt;br /&gt;
| 17.99&lt;br /&gt;
| 29.48&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Entries with [*] are trained on this dataset.&lt;br /&gt;
&lt;br /&gt;
== YAMAHA_Balanced ==&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:right;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; text-align:left;&amp;quot;&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Group&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | F1&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Cemgil&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Goto&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | P-score&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLt&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLt&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | BeatU&lt;br /&gt;
| 92.55&lt;br /&gt;
| 82.28&lt;br /&gt;
| 89.12&lt;br /&gt;
| 93.57&lt;br /&gt;
| 83.62&lt;br /&gt;
| 90.15&lt;br /&gt;
| 85.84&lt;br /&gt;
| 93.11&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | KG-ApolloBeats&lt;br /&gt;
| 91.97&lt;br /&gt;
| 81.29&lt;br /&gt;
| 88.28&lt;br /&gt;
| 92.54&lt;br /&gt;
| 79.91&lt;br /&gt;
| 88.30&lt;br /&gt;
| 82.26&lt;br /&gt;
| 91.79&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: BeatThis&lt;br /&gt;
| 90.59&lt;br /&gt;
| 79.43&lt;br /&gt;
| 81.59&lt;br /&gt;
| 91.42&lt;br /&gt;
| 64.94&lt;br /&gt;
| 84.52&lt;br /&gt;
| 66.87&lt;br /&gt;
| 87.73&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: CD1&lt;br /&gt;
| 76.43&lt;br /&gt;
| 67.85&lt;br /&gt;
| 64.44&lt;br /&gt;
| 74.42&lt;br /&gt;
| 55.13&lt;br /&gt;
| 59.47&lt;br /&gt;
| 71.86&lt;br /&gt;
| 83.63&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== YAMAHA_JPop ==&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:right;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; text-align:left;&amp;quot;&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Group&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | F1&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Cemgil&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Goto&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | P-score&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLt&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLt&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | BeatU&lt;br /&gt;
| 96.58&lt;br /&gt;
| 88.94&lt;br /&gt;
| 95.20&lt;br /&gt;
| 96.73&lt;br /&gt;
| 92.32&lt;br /&gt;
| 94.46&lt;br /&gt;
| 94.65&lt;br /&gt;
| 97.05&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | KG-ApolloBeats&lt;br /&gt;
| 95.39&lt;br /&gt;
| 86.66&lt;br /&gt;
| 93.20&lt;br /&gt;
| 94.63&lt;br /&gt;
| 82.87&lt;br /&gt;
| 90.54&lt;br /&gt;
| 84.72&lt;br /&gt;
| 93.48&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: BeatThis&lt;br /&gt;
| 94.00&lt;br /&gt;
| 84.08&lt;br /&gt;
| 86.80&lt;br /&gt;
| 93.15&lt;br /&gt;
| 69.66&lt;br /&gt;
| 86.64&lt;br /&gt;
| 71.39&lt;br /&gt;
| 89.57&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: CD1&lt;br /&gt;
| 77.38&lt;br /&gt;
| 70.76&lt;br /&gt;
| 64.40&lt;br /&gt;
| 73.55&lt;br /&gt;
| 54.98&lt;br /&gt;
| 58.71&lt;br /&gt;
| 77.28&lt;br /&gt;
| 85.29&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== Comparison with Previous MIREXes ==&lt;br /&gt;
&lt;br /&gt;
Since this year we have switched to using mir_eval for evaluation, some results may differ from those in previous MIREX editions due to differences in implementation. We confirm that the following metrics remain comparable with previous MIREX results:&lt;br /&gt;
&lt;br /&gt;
* Comparable: F1, Goto, CMLc, CMLt, AMLc, AMLt.&lt;br /&gt;
* Not comparable: Cemgil, P-score.&lt;/div&gt;</summary>
		<author><name>Zhaojw1998</name></author>
		
	</entry>
	<entry>
		<id>https://music-ir.org/mirex/w/index.php?title=2025:Audio_Beat_Tracking_Results&amp;diff=14814</id>
		<title>2025:Audio Beat Tracking Results</title>
		<link rel="alternate" type="text/html" href="https://music-ir.org/mirex/w/index.php?title=2025:Audio_Beat_Tracking_Results&amp;diff=14814"/>
		<updated>2025-09-13T05:57:03Z</updated>

		<summary type="html">&lt;p&gt;Zhaojw1998: /* SMC */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
== Submissions ==&lt;br /&gt;
&lt;br /&gt;
This page is still WIP. More submissions might appear later.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; &lt;br /&gt;
|-&lt;br /&gt;
! Submission&lt;br /&gt;
! Title&lt;br /&gt;
! PDF&lt;br /&gt;
! Authors&lt;br /&gt;
|-&lt;br /&gt;
| style=&amp;quot;vertical-align:bottom; background-color:#F8F9FA; color:#222;&amp;quot; | Baseline: CD1&lt;br /&gt;
| style=&amp;quot;vertical-align:bottom; background-color:#F8F9FA; color:#222;&amp;quot; | QM Tempo Tracker&lt;br /&gt;
| [https://vamp-plugins.org/plugin-doc/qm-vamp-plugins.html Link]&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| Baseline: BeatThis&lt;br /&gt;
| Beat This! Accurate Beat Tracking Without DBN Postprocessing&lt;br /&gt;
| [https://github.com/CPJKU/beat_this Link]&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| KG-ApolloBeats&lt;br /&gt;
| The 2025 KG Music Beats Tracking System&lt;br /&gt;
| TBA&lt;br /&gt;
| DingKun Xiao, Haijun Cai, Chuanyi Chen&lt;br /&gt;
|- &lt;br /&gt;
| BeatU&lt;br /&gt;
| Beat-U: Multi-Task Music Understanding with Hierarchical Timescales&lt;br /&gt;
| TBA&lt;br /&gt;
| YAMAHA Corporation&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== Test Sets ==&lt;br /&gt;
&lt;br /&gt;
* '''GTZAN''': 999 songs from the GTZAN dataset (starting from next year, training on GTZAN will be disallowed)&lt;br /&gt;
* '''SMC''': 217 songs from the SMC collection&lt;br /&gt;
* '''Yamaha_JPOP''': A private dataset annotated by Yamaha Corporation. The dataset contains 200 JPOP songs.&lt;br /&gt;
* '''Yamaha_Balanced''': A private dataset annotated by Yamaha Corporation. The dataset contains 241 songs. While it is still biased towards JPOP songs, the dataset covers a wider range of genres: J.Pop (10.37%), Rock (10.37%), J.Enka (10.37%), J.Kayoukyoku (10.37%), Soundtrack (10.37%), Western Pop (10.37%), Children's Song (10.37%), R&amp;amp;B (6.22%), Hiphop (4.56%), Jazz (2.49%), Dance (2.49%), World (2.07%), Techno (1.24%), Easy listening (1.24%), J.Minyou (1.24%), Others (5.81%).&lt;br /&gt;
&lt;br /&gt;
== GTZAN ==&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:right;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; text-align:left;&amp;quot;&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Group&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | F1&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Cemgil&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Goto&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | P-score&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLt&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLt&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | BeatU&lt;br /&gt;
| 84.93&lt;br /&gt;
| 68.23&lt;br /&gt;
| 64.06&lt;br /&gt;
| 84.27&lt;br /&gt;
| 71.07&lt;br /&gt;
| 75.26&lt;br /&gt;
| 78.78&lt;br /&gt;
| 84.27&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | KG-ApolloBeats*&lt;br /&gt;
| 92.53&lt;br /&gt;
| 80.66&lt;br /&gt;
| 79.38&lt;br /&gt;
| 93.55&lt;br /&gt;
| 83.44&lt;br /&gt;
| 88.49&lt;br /&gt;
| 86.86&lt;br /&gt;
| 92.77&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: BeatThis&lt;br /&gt;
| 89.02&lt;br /&gt;
| 80.15&lt;br /&gt;
| 72.27&lt;br /&gt;
| 88.00&lt;br /&gt;
| 76.00&lt;br /&gt;
| 79.64&lt;br /&gt;
| 84.63&lt;br /&gt;
| 90.01&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: CD1&lt;br /&gt;
| 81.19&lt;br /&gt;
| 69.64&lt;br /&gt;
| 62.06&lt;br /&gt;
| 79.97&lt;br /&gt;
| 65.02&lt;br /&gt;
| 66.94&lt;br /&gt;
| 83.08&lt;br /&gt;
| 86.69&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Entries with [*] are trained on this dataset.&lt;br /&gt;
&lt;br /&gt;
The baseline BeatThis reports different results compared to the paper because it uses a different number of test songs (999 vs. 993).&lt;br /&gt;
&lt;br /&gt;
== SMC ==&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:right;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; text-align:left;&amp;quot;&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Group&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | F1&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Cemgil&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Goto&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | P-score&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLt&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLt&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | BeatU*&lt;br /&gt;
| 53.14&lt;br /&gt;
| 40.67&lt;br /&gt;
| 14.75&lt;br /&gt;
| 63.52&lt;br /&gt;
| 27.24&lt;br /&gt;
| 41.16&lt;br /&gt;
| 30.88&lt;br /&gt;
| 47.44&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | KG-ApolloBeats*&lt;br /&gt;
| 74.15&lt;br /&gt;
| 57.07&lt;br /&gt;
| 32.72&lt;br /&gt;
| 84.51&lt;br /&gt;
| 53.73&lt;br /&gt;
| 72.66&lt;br /&gt;
| 55.96&lt;br /&gt;
| 76.22&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: BeatThis*&lt;br /&gt;
| 71.81&lt;br /&gt;
| 55.64&lt;br /&gt;
| 27.19&lt;br /&gt;
| 82.91&lt;br /&gt;
| 49.78&lt;br /&gt;
| 69.89&lt;br /&gt;
| 51.15&lt;br /&gt;
| 72.30&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: CD1&lt;br /&gt;
| 33.66&lt;br /&gt;
| 26.29&lt;br /&gt;
| 6.91&lt;br /&gt;
| 45.10&lt;br /&gt;
| 9.88&lt;br /&gt;
| 13.12&lt;br /&gt;
| 17.99&lt;br /&gt;
| 29.48&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Entries with [*] are trained on this dataset.&lt;br /&gt;
&lt;br /&gt;
== YAMAHA_Balanced ==&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:right;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; text-align:left;&amp;quot;&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Group&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | F1&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Cemgil&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Goto&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | P-score&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLt&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLt&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | BeatU&lt;br /&gt;
| 92.55&lt;br /&gt;
| 82.28&lt;br /&gt;
| 89.12&lt;br /&gt;
| 93.57&lt;br /&gt;
| 83.62&lt;br /&gt;
| 90.15&lt;br /&gt;
| 85.84&lt;br /&gt;
| 93.11&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | KG-ApolloBeats&lt;br /&gt;
| 91.97&lt;br /&gt;
| 81.29&lt;br /&gt;
| 88.28&lt;br /&gt;
| 92.54&lt;br /&gt;
| 79.91&lt;br /&gt;
| 88.30&lt;br /&gt;
| 82.26&lt;br /&gt;
| 91.79&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: BeatThis&lt;br /&gt;
| 90.59&lt;br /&gt;
| 79.43&lt;br /&gt;
| 81.59&lt;br /&gt;
| 91.42&lt;br /&gt;
| 64.94&lt;br /&gt;
| 84.52&lt;br /&gt;
| 66.87&lt;br /&gt;
| 87.73&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: CD1&lt;br /&gt;
| 76.43&lt;br /&gt;
| 67.85&lt;br /&gt;
| 64.44&lt;br /&gt;
| 74.42&lt;br /&gt;
| 55.13&lt;br /&gt;
| 59.47&lt;br /&gt;
| 71.86&lt;br /&gt;
| 83.63&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== YAMAHA_JPop ==&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:right;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; text-align:left;&amp;quot;&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Group&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | F1&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Cemgil&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Goto&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | P-score&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLt&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLt&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | BeatU&lt;br /&gt;
| 96.58&lt;br /&gt;
| 88.94&lt;br /&gt;
| 95.20&lt;br /&gt;
| 96.73&lt;br /&gt;
| 92.32&lt;br /&gt;
| 94.46&lt;br /&gt;
| 94.65&lt;br /&gt;
| 97.05&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | KG-ApolloBeats&lt;br /&gt;
| 95.39&lt;br /&gt;
| 86.66&lt;br /&gt;
| 93.20&lt;br /&gt;
| 94.63&lt;br /&gt;
| 82.87&lt;br /&gt;
| 90.54&lt;br /&gt;
| 84.72&lt;br /&gt;
| 93.48&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: BeatThis&lt;br /&gt;
| 94.00&lt;br /&gt;
| 84.08&lt;br /&gt;
| 86.80&lt;br /&gt;
| 93.15&lt;br /&gt;
| 69.66&lt;br /&gt;
| 86.64&lt;br /&gt;
| 71.39&lt;br /&gt;
| 89.57&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: CD1&lt;br /&gt;
| 77.38&lt;br /&gt;
| 70.76&lt;br /&gt;
| 64.40&lt;br /&gt;
| 73.55&lt;br /&gt;
| 54.98&lt;br /&gt;
| 58.71&lt;br /&gt;
| 77.28&lt;br /&gt;
| 85.29&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== Comparison with Previous MIREXes ==&lt;br /&gt;
&lt;br /&gt;
Since this year we have switched to using mir_eval for evaluation, some results may differ from those in previous MIREX editions due to differences in implementation. We confirm that the following metrics remain comparable with previous MIREX results:&lt;br /&gt;
&lt;br /&gt;
* Comparable: F1, Goto, CMLc, CMLt, AMLc, AMLt.&lt;br /&gt;
* Not comparable: Cemgil, P-score.&lt;/div&gt;</summary>
		<author><name>Zhaojw1998</name></author>
		
	</entry>
	<entry>
		<id>https://music-ir.org/mirex/w/index.php?title=2025:Audio_Beat_Tracking_Results&amp;diff=14813</id>
		<title>2025:Audio Beat Tracking Results</title>
		<link rel="alternate" type="text/html" href="https://music-ir.org/mirex/w/index.php?title=2025:Audio_Beat_Tracking_Results&amp;diff=14813"/>
		<updated>2025-09-13T05:56:27Z</updated>

		<summary type="html">&lt;p&gt;Zhaojw1998: /* Submissions */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
== Submissions ==&lt;br /&gt;
&lt;br /&gt;
This page is still WIP. More submissions might appear later.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; &lt;br /&gt;
|-&lt;br /&gt;
! Submission&lt;br /&gt;
! Title&lt;br /&gt;
! PDF&lt;br /&gt;
! Authors&lt;br /&gt;
|-&lt;br /&gt;
| style=&amp;quot;vertical-align:bottom; background-color:#F8F9FA; color:#222;&amp;quot; | Baseline: CD1&lt;br /&gt;
| style=&amp;quot;vertical-align:bottom; background-color:#F8F9FA; color:#222;&amp;quot; | QM Tempo Tracker&lt;br /&gt;
| [https://vamp-plugins.org/plugin-doc/qm-vamp-plugins.html Link]&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| Baseline: BeatThis&lt;br /&gt;
| Beat This! Accurate Beat Tracking Without DBN Postprocessing&lt;br /&gt;
| [https://github.com/CPJKU/beat_this Link]&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| KG-ApolloBeats&lt;br /&gt;
| The 2025 KG Music Beats Tracking System&lt;br /&gt;
| TBA&lt;br /&gt;
| DingKun Xiao, Haijun Cai, Chuanyi Chen&lt;br /&gt;
|- &lt;br /&gt;
| BeatU&lt;br /&gt;
| Beat-U: Multi-Task Music Understanding with Hierarchical Timescales&lt;br /&gt;
| TBA&lt;br /&gt;
| YAMAHA Corporation&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== Test Sets ==&lt;br /&gt;
&lt;br /&gt;
* '''GTZAN''': 999 songs from the GTZAN dataset (starting from next year, training on GTZAN will be disallowed)&lt;br /&gt;
* '''SMC''': 217 songs from the SMC collection&lt;br /&gt;
* '''Yamaha_JPOP''': A private dataset annotated by Yamaha Corporation. The dataset contains 200 JPOP songs.&lt;br /&gt;
* '''Yamaha_Balanced''': A private dataset annotated by Yamaha Corporation. The dataset contains 241 songs. While it is still biased towards JPOP songs, the dataset covers a wider range of genres: J.Pop (10.37%), Rock (10.37%), J.Enka (10.37%), J.Kayoukyoku (10.37%), Soundtrack (10.37%), Western Pop (10.37%), Children's Song (10.37%), R&amp;amp;B (6.22%), Hiphop (4.56%), Jazz (2.49%), Dance (2.49%), World (2.07%), Techno (1.24%), Easy listening (1.24%), J.Minyou (1.24%), Others (5.81%).&lt;br /&gt;
&lt;br /&gt;
== GTZAN ==&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:right;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; text-align:left;&amp;quot;&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Group&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | F1&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Cemgil&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Goto&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | P-score&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLt&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLt&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | BeatU&lt;br /&gt;
| 84.93&lt;br /&gt;
| 68.23&lt;br /&gt;
| 64.06&lt;br /&gt;
| 84.27&lt;br /&gt;
| 71.07&lt;br /&gt;
| 75.26&lt;br /&gt;
| 78.78&lt;br /&gt;
| 84.27&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | KG-ApolloBeats*&lt;br /&gt;
| 92.53&lt;br /&gt;
| 80.66&lt;br /&gt;
| 79.38&lt;br /&gt;
| 93.55&lt;br /&gt;
| 83.44&lt;br /&gt;
| 88.49&lt;br /&gt;
| 86.86&lt;br /&gt;
| 92.77&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: BeatThis&lt;br /&gt;
| 89.02&lt;br /&gt;
| 80.15&lt;br /&gt;
| 72.27&lt;br /&gt;
| 88.00&lt;br /&gt;
| 76.00&lt;br /&gt;
| 79.64&lt;br /&gt;
| 84.63&lt;br /&gt;
| 90.01&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: CD1&lt;br /&gt;
| 81.19&lt;br /&gt;
| 69.64&lt;br /&gt;
| 62.06&lt;br /&gt;
| 79.97&lt;br /&gt;
| 65.02&lt;br /&gt;
| 66.94&lt;br /&gt;
| 83.08&lt;br /&gt;
| 86.69&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Entries with [*] are trained on this dataset.&lt;br /&gt;
&lt;br /&gt;
The baseline BeatThis reports different results compared to the paper because it uses a different number of test songs (999 vs. 993).&lt;br /&gt;
&lt;br /&gt;
== SMC ==&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:right;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; text-align:left;&amp;quot;&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Group&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | F1&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Cemgil&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Goto&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | P-score&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLt&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLt&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | BeatU&lt;br /&gt;
| 53.14&lt;br /&gt;
| 40.67&lt;br /&gt;
| 14.75&lt;br /&gt;
| 63.52&lt;br /&gt;
| 27.24&lt;br /&gt;
| 41.16&lt;br /&gt;
| 30.88&lt;br /&gt;
| 47.44&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | KG-ApolloBeats*&lt;br /&gt;
| 74.15&lt;br /&gt;
| 57.07&lt;br /&gt;
| 32.72&lt;br /&gt;
| 84.51&lt;br /&gt;
| 53.73&lt;br /&gt;
| 72.66&lt;br /&gt;
| 55.96&lt;br /&gt;
| 76.22&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: BeatThis*&lt;br /&gt;
| 71.81&lt;br /&gt;
| 55.64&lt;br /&gt;
| 27.19&lt;br /&gt;
| 82.91&lt;br /&gt;
| 49.78&lt;br /&gt;
| 69.89&lt;br /&gt;
| 51.15&lt;br /&gt;
| 72.30&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: CD1&lt;br /&gt;
| 33.66&lt;br /&gt;
| 26.29&lt;br /&gt;
| 6.91&lt;br /&gt;
| 45.10&lt;br /&gt;
| 9.88&lt;br /&gt;
| 13.12&lt;br /&gt;
| 17.99&lt;br /&gt;
| 29.48&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Entries with [*] are trained on this dataset.&lt;br /&gt;
&lt;br /&gt;
== YAMAHA_Balanced ==&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:right;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; text-align:left;&amp;quot;&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Group&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | F1&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Cemgil&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Goto&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | P-score&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLt&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLt&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | BeatU&lt;br /&gt;
| 92.55&lt;br /&gt;
| 82.28&lt;br /&gt;
| 89.12&lt;br /&gt;
| 93.57&lt;br /&gt;
| 83.62&lt;br /&gt;
| 90.15&lt;br /&gt;
| 85.84&lt;br /&gt;
| 93.11&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | KG-ApolloBeats&lt;br /&gt;
| 91.97&lt;br /&gt;
| 81.29&lt;br /&gt;
| 88.28&lt;br /&gt;
| 92.54&lt;br /&gt;
| 79.91&lt;br /&gt;
| 88.30&lt;br /&gt;
| 82.26&lt;br /&gt;
| 91.79&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: BeatThis&lt;br /&gt;
| 90.59&lt;br /&gt;
| 79.43&lt;br /&gt;
| 81.59&lt;br /&gt;
| 91.42&lt;br /&gt;
| 64.94&lt;br /&gt;
| 84.52&lt;br /&gt;
| 66.87&lt;br /&gt;
| 87.73&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: CD1&lt;br /&gt;
| 76.43&lt;br /&gt;
| 67.85&lt;br /&gt;
| 64.44&lt;br /&gt;
| 74.42&lt;br /&gt;
| 55.13&lt;br /&gt;
| 59.47&lt;br /&gt;
| 71.86&lt;br /&gt;
| 83.63&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== YAMAHA_JPop ==&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:right;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; text-align:left;&amp;quot;&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Group&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | F1&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Cemgil&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Goto&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | P-score&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLt&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLt&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | BeatU&lt;br /&gt;
| 96.58&lt;br /&gt;
| 88.94&lt;br /&gt;
| 95.20&lt;br /&gt;
| 96.73&lt;br /&gt;
| 92.32&lt;br /&gt;
| 94.46&lt;br /&gt;
| 94.65&lt;br /&gt;
| 97.05&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | KG-ApolloBeats&lt;br /&gt;
| 95.39&lt;br /&gt;
| 86.66&lt;br /&gt;
| 93.20&lt;br /&gt;
| 94.63&lt;br /&gt;
| 82.87&lt;br /&gt;
| 90.54&lt;br /&gt;
| 84.72&lt;br /&gt;
| 93.48&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: BeatThis&lt;br /&gt;
| 94.00&lt;br /&gt;
| 84.08&lt;br /&gt;
| 86.80&lt;br /&gt;
| 93.15&lt;br /&gt;
| 69.66&lt;br /&gt;
| 86.64&lt;br /&gt;
| 71.39&lt;br /&gt;
| 89.57&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: CD1&lt;br /&gt;
| 77.38&lt;br /&gt;
| 70.76&lt;br /&gt;
| 64.40&lt;br /&gt;
| 73.55&lt;br /&gt;
| 54.98&lt;br /&gt;
| 58.71&lt;br /&gt;
| 77.28&lt;br /&gt;
| 85.29&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== Comparison with Previous MIREXes ==&lt;br /&gt;
&lt;br /&gt;
Since this year we have switched to using mir_eval for evaluation, some results may differ from those in previous MIREX editions due to differences in implementation. We confirm that the following metrics remain comparable with previous MIREX results:&lt;br /&gt;
&lt;br /&gt;
* Comparable: F1, Goto, CMLc, CMLt, AMLc, AMLt.&lt;br /&gt;
* Not comparable: Cemgil, P-score.&lt;/div&gt;</summary>
		<author><name>Zhaojw1998</name></author>
		
	</entry>
	<entry>
		<id>https://music-ir.org/mirex/w/index.php?title=2025:Audio_Beat_Tracking_Results&amp;diff=14812</id>
		<title>2025:Audio Beat Tracking Results</title>
		<link rel="alternate" type="text/html" href="https://music-ir.org/mirex/w/index.php?title=2025:Audio_Beat_Tracking_Results&amp;diff=14812"/>
		<updated>2025-09-13T05:56:00Z</updated>

		<summary type="html">&lt;p&gt;Zhaojw1998: /* GTZAN */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
== Submissions ==&lt;br /&gt;
&lt;br /&gt;
This page is still WIP. More submissions might appear later.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; &lt;br /&gt;
|-&lt;br /&gt;
! Submission&lt;br /&gt;
! Title&lt;br /&gt;
! PDF&lt;br /&gt;
! Authors&lt;br /&gt;
|-&lt;br /&gt;
| style=&amp;quot;vertical-align:bottom; background-color:#F8F9FA; color:#222;&amp;quot; | Baseline: CD1&lt;br /&gt;
| style=&amp;quot;vertical-align:bottom; background-color:#F8F9FA; color:#222;&amp;quot; | QM Tempo Tracker&lt;br /&gt;
| [https://vamp-plugins.org/plugin-doc/qm-vamp-plugins.html Link]&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| Baseline: BeatThis&lt;br /&gt;
| Beat This! Accurate Beat Tracking Without DBN Postprocessing&lt;br /&gt;
| [https://github.com/CPJKU/beat_this Link]&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| KG-ApolloBeats&lt;br /&gt;
| The 2025 KG Music Beats Tracking System&lt;br /&gt;
| TBA&lt;br /&gt;
| DingKun Xiao, Haijun Cai, Chuanyi Chen&lt;br /&gt;
|- &lt;br /&gt;
| Beat-U&lt;br /&gt;
| Beat-U: Multi-Task Music Understanding with Hierarchical Timescales&lt;br /&gt;
| TBA&lt;br /&gt;
| YAMAHA Corporation&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== Test Sets ==&lt;br /&gt;
&lt;br /&gt;
* '''GTZAN''': 999 songs from the GTZAN dataset (starting from next year, training on GTZAN will be disallowed)&lt;br /&gt;
* '''SMC''': 217 songs from the SMC collection&lt;br /&gt;
* '''Yamaha_JPOP''': A private dataset annotated by Yamaha Corporation. The dataset contains 200 JPOP songs.&lt;br /&gt;
* '''Yamaha_Balanced''': A private dataset annotated by Yamaha Corporation. The dataset contains 241 songs. While it is still biased towards JPOP songs, the dataset covers a wider range of genres: J.Pop (10.37%), Rock (10.37%), J.Enka (10.37%), J.Kayoukyoku (10.37%), Soundtrack (10.37%), Western Pop (10.37%), Children's Song (10.37%), R&amp;amp;B (6.22%), Hiphop (4.56%), Jazz (2.49%), Dance (2.49%), World (2.07%), Techno (1.24%), Easy listening (1.24%), J.Minyou (1.24%), Others (5.81%).&lt;br /&gt;
&lt;br /&gt;
== GTZAN ==&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:right;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; text-align:left;&amp;quot;&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Group&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | F1&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Cemgil&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Goto&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | P-score&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLt&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLt&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | BeatU&lt;br /&gt;
| 84.93&lt;br /&gt;
| 68.23&lt;br /&gt;
| 64.06&lt;br /&gt;
| 84.27&lt;br /&gt;
| 71.07&lt;br /&gt;
| 75.26&lt;br /&gt;
| 78.78&lt;br /&gt;
| 84.27&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | KG-ApolloBeats*&lt;br /&gt;
| 92.53&lt;br /&gt;
| 80.66&lt;br /&gt;
| 79.38&lt;br /&gt;
| 93.55&lt;br /&gt;
| 83.44&lt;br /&gt;
| 88.49&lt;br /&gt;
| 86.86&lt;br /&gt;
| 92.77&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: BeatThis&lt;br /&gt;
| 89.02&lt;br /&gt;
| 80.15&lt;br /&gt;
| 72.27&lt;br /&gt;
| 88.00&lt;br /&gt;
| 76.00&lt;br /&gt;
| 79.64&lt;br /&gt;
| 84.63&lt;br /&gt;
| 90.01&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: CD1&lt;br /&gt;
| 81.19&lt;br /&gt;
| 69.64&lt;br /&gt;
| 62.06&lt;br /&gt;
| 79.97&lt;br /&gt;
| 65.02&lt;br /&gt;
| 66.94&lt;br /&gt;
| 83.08&lt;br /&gt;
| 86.69&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Entries with [*] are trained on this dataset.&lt;br /&gt;
&lt;br /&gt;
The baseline BeatThis reports different results compared to the paper because it uses a different number of test songs (999 vs. 993).&lt;br /&gt;
&lt;br /&gt;
== SMC ==&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:right;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; text-align:left;&amp;quot;&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Group&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | F1&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Cemgil&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Goto&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | P-score&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLt&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLt&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | BeatU&lt;br /&gt;
| 53.14&lt;br /&gt;
| 40.67&lt;br /&gt;
| 14.75&lt;br /&gt;
| 63.52&lt;br /&gt;
| 27.24&lt;br /&gt;
| 41.16&lt;br /&gt;
| 30.88&lt;br /&gt;
| 47.44&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | KG-ApolloBeats*&lt;br /&gt;
| 74.15&lt;br /&gt;
| 57.07&lt;br /&gt;
| 32.72&lt;br /&gt;
| 84.51&lt;br /&gt;
| 53.73&lt;br /&gt;
| 72.66&lt;br /&gt;
| 55.96&lt;br /&gt;
| 76.22&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: BeatThis*&lt;br /&gt;
| 71.81&lt;br /&gt;
| 55.64&lt;br /&gt;
| 27.19&lt;br /&gt;
| 82.91&lt;br /&gt;
| 49.78&lt;br /&gt;
| 69.89&lt;br /&gt;
| 51.15&lt;br /&gt;
| 72.30&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: CD1&lt;br /&gt;
| 33.66&lt;br /&gt;
| 26.29&lt;br /&gt;
| 6.91&lt;br /&gt;
| 45.10&lt;br /&gt;
| 9.88&lt;br /&gt;
| 13.12&lt;br /&gt;
| 17.99&lt;br /&gt;
| 29.48&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Entries with [*] are trained on this dataset.&lt;br /&gt;
&lt;br /&gt;
== YAMAHA_Balanced ==&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:right;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; text-align:left;&amp;quot;&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Group&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | F1&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Cemgil&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Goto&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | P-score&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLt&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLt&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | BeatU&lt;br /&gt;
| 92.55&lt;br /&gt;
| 82.28&lt;br /&gt;
| 89.12&lt;br /&gt;
| 93.57&lt;br /&gt;
| 83.62&lt;br /&gt;
| 90.15&lt;br /&gt;
| 85.84&lt;br /&gt;
| 93.11&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | KG-ApolloBeats&lt;br /&gt;
| 91.97&lt;br /&gt;
| 81.29&lt;br /&gt;
| 88.28&lt;br /&gt;
| 92.54&lt;br /&gt;
| 79.91&lt;br /&gt;
| 88.30&lt;br /&gt;
| 82.26&lt;br /&gt;
| 91.79&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: BeatThis&lt;br /&gt;
| 90.59&lt;br /&gt;
| 79.43&lt;br /&gt;
| 81.59&lt;br /&gt;
| 91.42&lt;br /&gt;
| 64.94&lt;br /&gt;
| 84.52&lt;br /&gt;
| 66.87&lt;br /&gt;
| 87.73&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: CD1&lt;br /&gt;
| 76.43&lt;br /&gt;
| 67.85&lt;br /&gt;
| 64.44&lt;br /&gt;
| 74.42&lt;br /&gt;
| 55.13&lt;br /&gt;
| 59.47&lt;br /&gt;
| 71.86&lt;br /&gt;
| 83.63&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== YAMAHA_JPop ==&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:right;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; text-align:left;&amp;quot;&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Group&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | F1&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Cemgil&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Goto&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | P-score&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLt&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLt&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | BeatU&lt;br /&gt;
| 96.58&lt;br /&gt;
| 88.94&lt;br /&gt;
| 95.20&lt;br /&gt;
| 96.73&lt;br /&gt;
| 92.32&lt;br /&gt;
| 94.46&lt;br /&gt;
| 94.65&lt;br /&gt;
| 97.05&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | KG-ApolloBeats&lt;br /&gt;
| 95.39&lt;br /&gt;
| 86.66&lt;br /&gt;
| 93.20&lt;br /&gt;
| 94.63&lt;br /&gt;
| 82.87&lt;br /&gt;
| 90.54&lt;br /&gt;
| 84.72&lt;br /&gt;
| 93.48&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: BeatThis&lt;br /&gt;
| 94.00&lt;br /&gt;
| 84.08&lt;br /&gt;
| 86.80&lt;br /&gt;
| 93.15&lt;br /&gt;
| 69.66&lt;br /&gt;
| 86.64&lt;br /&gt;
| 71.39&lt;br /&gt;
| 89.57&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: CD1&lt;br /&gt;
| 77.38&lt;br /&gt;
| 70.76&lt;br /&gt;
| 64.40&lt;br /&gt;
| 73.55&lt;br /&gt;
| 54.98&lt;br /&gt;
| 58.71&lt;br /&gt;
| 77.28&lt;br /&gt;
| 85.29&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== Comparison with Previous MIREXes ==&lt;br /&gt;
&lt;br /&gt;
Since this year we have switched to using mir_eval for evaluation, some results may differ from those in previous MIREX editions due to differences in implementation. We confirm that the following metrics remain comparable with previous MIREX results:&lt;br /&gt;
&lt;br /&gt;
* Comparable: F1, Goto, CMLc, CMLt, AMLc, AMLt.&lt;br /&gt;
* Not comparable: Cemgil, P-score.&lt;/div&gt;</summary>
		<author><name>Zhaojw1998</name></author>
		
	</entry>
	<entry>
		<id>https://music-ir.org/mirex/w/index.php?title=2025:Audio_Beat_Tracking_Results&amp;diff=14811</id>
		<title>2025:Audio Beat Tracking Results</title>
		<link rel="alternate" type="text/html" href="https://music-ir.org/mirex/w/index.php?title=2025:Audio_Beat_Tracking_Results&amp;diff=14811"/>
		<updated>2025-09-13T05:55:40Z</updated>

		<summary type="html">&lt;p&gt;Zhaojw1998: /* GTZAN */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
== Submissions ==&lt;br /&gt;
&lt;br /&gt;
This page is still WIP. More submissions might appear later.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; &lt;br /&gt;
|-&lt;br /&gt;
! Submission&lt;br /&gt;
! Title&lt;br /&gt;
! PDF&lt;br /&gt;
! Authors&lt;br /&gt;
|-&lt;br /&gt;
| style=&amp;quot;vertical-align:bottom; background-color:#F8F9FA; color:#222;&amp;quot; | Baseline: CD1&lt;br /&gt;
| style=&amp;quot;vertical-align:bottom; background-color:#F8F9FA; color:#222;&amp;quot; | QM Tempo Tracker&lt;br /&gt;
| [https://vamp-plugins.org/plugin-doc/qm-vamp-plugins.html Link]&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| Baseline: BeatThis&lt;br /&gt;
| Beat This! Accurate Beat Tracking Without DBN Postprocessing&lt;br /&gt;
| [https://github.com/CPJKU/beat_this Link]&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| KG-ApolloBeats&lt;br /&gt;
| The 2025 KG Music Beats Tracking System&lt;br /&gt;
| TBA&lt;br /&gt;
| DingKun Xiao, Haijun Cai, Chuanyi Chen&lt;br /&gt;
|- &lt;br /&gt;
| Beat-U&lt;br /&gt;
| Beat-U: Multi-Task Music Understanding with Hierarchical Timescales&lt;br /&gt;
| TBA&lt;br /&gt;
| YAMAHA Corporation&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== Test Sets ==&lt;br /&gt;
&lt;br /&gt;
* '''GTZAN''': 999 songs from the GTZAN dataset (starting from next year, training on GTZAN will be disallowed)&lt;br /&gt;
* '''SMC''': 217 songs from the SMC collection&lt;br /&gt;
* '''Yamaha_JPOP''': A private dataset annotated by Yamaha Corporation. The dataset contains 200 JPOP songs.&lt;br /&gt;
* '''Yamaha_Balanced''': A private dataset annotated by Yamaha Corporation. The dataset contains 241 songs. While it is still biased towards JPOP songs, the dataset covers a wider range of genres: J.Pop (10.37%), Rock (10.37%), J.Enka (10.37%), J.Kayoukyoku (10.37%), Soundtrack (10.37%), Western Pop (10.37%), Children's Song (10.37%), R&amp;amp;B (6.22%), Hiphop (4.56%), Jazz (2.49%), Dance (2.49%), World (2.07%), Techno (1.24%), Easy listening (1.24%), J.Minyou (1.24%), Others (5.81%).&lt;br /&gt;
&lt;br /&gt;
== GTZAN ==&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:right;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; text-align:left;&amp;quot;&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Group&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | F1&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Cemgil&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Goto&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | P-score&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLt&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLt&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Beat-U&lt;br /&gt;
| 84.93&lt;br /&gt;
| 68.23&lt;br /&gt;
| 64.06&lt;br /&gt;
| 84.27&lt;br /&gt;
| 71.07&lt;br /&gt;
| 75.26&lt;br /&gt;
| 78.78&lt;br /&gt;
| 84.27&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | KG-ApolloBeats*&lt;br /&gt;
| 92.53&lt;br /&gt;
| 80.66&lt;br /&gt;
| 79.38&lt;br /&gt;
| 93.55&lt;br /&gt;
| 83.44&lt;br /&gt;
| 88.49&lt;br /&gt;
| 86.86&lt;br /&gt;
| 92.77&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: BeatThis&lt;br /&gt;
| 89.02&lt;br /&gt;
| 80.15&lt;br /&gt;
| 72.27&lt;br /&gt;
| 88.00&lt;br /&gt;
| 76.00&lt;br /&gt;
| 79.64&lt;br /&gt;
| 84.63&lt;br /&gt;
| 90.01&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: CD1&lt;br /&gt;
| 81.19&lt;br /&gt;
| 69.64&lt;br /&gt;
| 62.06&lt;br /&gt;
| 79.97&lt;br /&gt;
| 65.02&lt;br /&gt;
| 66.94&lt;br /&gt;
| 83.08&lt;br /&gt;
| 86.69&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Entries with [*] are trained on this dataset.&lt;br /&gt;
&lt;br /&gt;
The baseline BeatThis reports different results compared to the paper because it uses a different number of test songs (999 vs. 993).&lt;br /&gt;
&lt;br /&gt;
== SMC ==&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:right;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; text-align:left;&amp;quot;&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Group&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | F1&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Cemgil&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Goto&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | P-score&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLt&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLt&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | BeatU&lt;br /&gt;
| 53.14&lt;br /&gt;
| 40.67&lt;br /&gt;
| 14.75&lt;br /&gt;
| 63.52&lt;br /&gt;
| 27.24&lt;br /&gt;
| 41.16&lt;br /&gt;
| 30.88&lt;br /&gt;
| 47.44&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | KG-ApolloBeats*&lt;br /&gt;
| 74.15&lt;br /&gt;
| 57.07&lt;br /&gt;
| 32.72&lt;br /&gt;
| 84.51&lt;br /&gt;
| 53.73&lt;br /&gt;
| 72.66&lt;br /&gt;
| 55.96&lt;br /&gt;
| 76.22&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: BeatThis*&lt;br /&gt;
| 71.81&lt;br /&gt;
| 55.64&lt;br /&gt;
| 27.19&lt;br /&gt;
| 82.91&lt;br /&gt;
| 49.78&lt;br /&gt;
| 69.89&lt;br /&gt;
| 51.15&lt;br /&gt;
| 72.30&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: CD1&lt;br /&gt;
| 33.66&lt;br /&gt;
| 26.29&lt;br /&gt;
| 6.91&lt;br /&gt;
| 45.10&lt;br /&gt;
| 9.88&lt;br /&gt;
| 13.12&lt;br /&gt;
| 17.99&lt;br /&gt;
| 29.48&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Entries with [*] are trained on this dataset.&lt;br /&gt;
&lt;br /&gt;
== YAMAHA_Balanced ==&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:right;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; text-align:left;&amp;quot;&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Group&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | F1&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Cemgil&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Goto&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | P-score&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLt&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLt&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | BeatU&lt;br /&gt;
| 92.55&lt;br /&gt;
| 82.28&lt;br /&gt;
| 89.12&lt;br /&gt;
| 93.57&lt;br /&gt;
| 83.62&lt;br /&gt;
| 90.15&lt;br /&gt;
| 85.84&lt;br /&gt;
| 93.11&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | KG-ApolloBeats&lt;br /&gt;
| 91.97&lt;br /&gt;
| 81.29&lt;br /&gt;
| 88.28&lt;br /&gt;
| 92.54&lt;br /&gt;
| 79.91&lt;br /&gt;
| 88.30&lt;br /&gt;
| 82.26&lt;br /&gt;
| 91.79&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: BeatThis&lt;br /&gt;
| 90.59&lt;br /&gt;
| 79.43&lt;br /&gt;
| 81.59&lt;br /&gt;
| 91.42&lt;br /&gt;
| 64.94&lt;br /&gt;
| 84.52&lt;br /&gt;
| 66.87&lt;br /&gt;
| 87.73&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: CD1&lt;br /&gt;
| 76.43&lt;br /&gt;
| 67.85&lt;br /&gt;
| 64.44&lt;br /&gt;
| 74.42&lt;br /&gt;
| 55.13&lt;br /&gt;
| 59.47&lt;br /&gt;
| 71.86&lt;br /&gt;
| 83.63&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== YAMAHA_JPop ==&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:right;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; text-align:left;&amp;quot;&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Group&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | F1&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Cemgil&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Goto&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | P-score&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLt&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLt&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | BeatU&lt;br /&gt;
| 96.58&lt;br /&gt;
| 88.94&lt;br /&gt;
| 95.20&lt;br /&gt;
| 96.73&lt;br /&gt;
| 92.32&lt;br /&gt;
| 94.46&lt;br /&gt;
| 94.65&lt;br /&gt;
| 97.05&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | KG-ApolloBeats&lt;br /&gt;
| 95.39&lt;br /&gt;
| 86.66&lt;br /&gt;
| 93.20&lt;br /&gt;
| 94.63&lt;br /&gt;
| 82.87&lt;br /&gt;
| 90.54&lt;br /&gt;
| 84.72&lt;br /&gt;
| 93.48&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: BeatThis&lt;br /&gt;
| 94.00&lt;br /&gt;
| 84.08&lt;br /&gt;
| 86.80&lt;br /&gt;
| 93.15&lt;br /&gt;
| 69.66&lt;br /&gt;
| 86.64&lt;br /&gt;
| 71.39&lt;br /&gt;
| 89.57&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: CD1&lt;br /&gt;
| 77.38&lt;br /&gt;
| 70.76&lt;br /&gt;
| 64.40&lt;br /&gt;
| 73.55&lt;br /&gt;
| 54.98&lt;br /&gt;
| 58.71&lt;br /&gt;
| 77.28&lt;br /&gt;
| 85.29&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== Comparison with Previous MIREXes ==&lt;br /&gt;
&lt;br /&gt;
Since this year we have switched to using mir_eval for evaluation, some results may differ from those in previous MIREX editions due to differences in implementation. We confirm that the following metrics remain comparable with previous MIREX results:&lt;br /&gt;
&lt;br /&gt;
* Comparable: F1, Goto, CMLc, CMLt, AMLc, AMLt.&lt;br /&gt;
* Not comparable: Cemgil, P-score.&lt;/div&gt;</summary>
		<author><name>Zhaojw1998</name></author>
		
	</entry>
	<entry>
		<id>https://music-ir.org/mirex/w/index.php?title=2025:Audio_Beat_Tracking_Results&amp;diff=14810</id>
		<title>2025:Audio Beat Tracking Results</title>
		<link rel="alternate" type="text/html" href="https://music-ir.org/mirex/w/index.php?title=2025:Audio_Beat_Tracking_Results&amp;diff=14810"/>
		<updated>2025-09-13T05:55:17Z</updated>

		<summary type="html">&lt;p&gt;Zhaojw1998: /* Submissions */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
== Submissions ==&lt;br /&gt;
&lt;br /&gt;
This page is still WIP. More submissions might appear later.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; &lt;br /&gt;
|-&lt;br /&gt;
! Submission&lt;br /&gt;
! Title&lt;br /&gt;
! PDF&lt;br /&gt;
! Authors&lt;br /&gt;
|-&lt;br /&gt;
| style=&amp;quot;vertical-align:bottom; background-color:#F8F9FA; color:#222;&amp;quot; | Baseline: CD1&lt;br /&gt;
| style=&amp;quot;vertical-align:bottom; background-color:#F8F9FA; color:#222;&amp;quot; | QM Tempo Tracker&lt;br /&gt;
| [https://vamp-plugins.org/plugin-doc/qm-vamp-plugins.html Link]&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| Baseline: BeatThis&lt;br /&gt;
| Beat This! Accurate Beat Tracking Without DBN Postprocessing&lt;br /&gt;
| [https://github.com/CPJKU/beat_this Link]&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| KG-ApolloBeats&lt;br /&gt;
| The 2025 KG Music Beats Tracking System&lt;br /&gt;
| TBA&lt;br /&gt;
| DingKun Xiao, Haijun Cai, Chuanyi Chen&lt;br /&gt;
|- &lt;br /&gt;
| Beat-U&lt;br /&gt;
| Beat-U: Multi-Task Music Understanding with Hierarchical Timescales&lt;br /&gt;
| TBA&lt;br /&gt;
| YAMAHA Corporation&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== Test Sets ==&lt;br /&gt;
&lt;br /&gt;
* '''GTZAN''': 999 songs from the GTZAN dataset (starting from next year, training on GTZAN will be disallowed)&lt;br /&gt;
* '''SMC''': 217 songs from the SMC collection&lt;br /&gt;
* '''Yamaha_JPOP''': A private dataset annotated by Yamaha Corporation. The dataset contains 200 JPOP songs.&lt;br /&gt;
* '''Yamaha_Balanced''': A private dataset annotated by Yamaha Corporation. The dataset contains 241 songs. While it is still biased towards JPOP songs, the dataset covers a wider range of genres: J.Pop (10.37%), Rock (10.37%), J.Enka (10.37%), J.Kayoukyoku (10.37%), Soundtrack (10.37%), Western Pop (10.37%), Children's Song (10.37%), R&amp;amp;B (6.22%), Hiphop (4.56%), Jazz (2.49%), Dance (2.49%), World (2.07%), Techno (1.24%), Easy listening (1.24%), J.Minyou (1.24%), Others (5.81%).&lt;br /&gt;
&lt;br /&gt;
== GTZAN ==&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:right;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; text-align:left;&amp;quot;&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Group&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | F1&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Cemgil&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Goto&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | P-score&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLt&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLt&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | BeatU&lt;br /&gt;
| 84.93&lt;br /&gt;
| 68.23&lt;br /&gt;
| 64.06&lt;br /&gt;
| 84.27&lt;br /&gt;
| 71.07&lt;br /&gt;
| 75.26&lt;br /&gt;
| 78.78&lt;br /&gt;
| 84.27&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | KG-ApolloBeats*&lt;br /&gt;
| 92.53&lt;br /&gt;
| 80.66&lt;br /&gt;
| 79.38&lt;br /&gt;
| 93.55&lt;br /&gt;
| 83.44&lt;br /&gt;
| 88.49&lt;br /&gt;
| 86.86&lt;br /&gt;
| 92.77&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: BeatThis&lt;br /&gt;
| 89.02&lt;br /&gt;
| 80.15&lt;br /&gt;
| 72.27&lt;br /&gt;
| 88.00&lt;br /&gt;
| 76.00&lt;br /&gt;
| 79.64&lt;br /&gt;
| 84.63&lt;br /&gt;
| 90.01&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: CD1&lt;br /&gt;
| 81.19&lt;br /&gt;
| 69.64&lt;br /&gt;
| 62.06&lt;br /&gt;
| 79.97&lt;br /&gt;
| 65.02&lt;br /&gt;
| 66.94&lt;br /&gt;
| 83.08&lt;br /&gt;
| 86.69&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Entries with [*] are trained on this dataset.&lt;br /&gt;
&lt;br /&gt;
The baseline BeatThis reports different results compared to the paper because it uses a different number of test songs (999 vs. 993).&lt;br /&gt;
&lt;br /&gt;
== SMC ==&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:right;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; text-align:left;&amp;quot;&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Group&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | F1&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Cemgil&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Goto&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | P-score&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLt&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLt&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | BeatU&lt;br /&gt;
| 53.14&lt;br /&gt;
| 40.67&lt;br /&gt;
| 14.75&lt;br /&gt;
| 63.52&lt;br /&gt;
| 27.24&lt;br /&gt;
| 41.16&lt;br /&gt;
| 30.88&lt;br /&gt;
| 47.44&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | KG-ApolloBeats*&lt;br /&gt;
| 74.15&lt;br /&gt;
| 57.07&lt;br /&gt;
| 32.72&lt;br /&gt;
| 84.51&lt;br /&gt;
| 53.73&lt;br /&gt;
| 72.66&lt;br /&gt;
| 55.96&lt;br /&gt;
| 76.22&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: BeatThis*&lt;br /&gt;
| 71.81&lt;br /&gt;
| 55.64&lt;br /&gt;
| 27.19&lt;br /&gt;
| 82.91&lt;br /&gt;
| 49.78&lt;br /&gt;
| 69.89&lt;br /&gt;
| 51.15&lt;br /&gt;
| 72.30&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: CD1&lt;br /&gt;
| 33.66&lt;br /&gt;
| 26.29&lt;br /&gt;
| 6.91&lt;br /&gt;
| 45.10&lt;br /&gt;
| 9.88&lt;br /&gt;
| 13.12&lt;br /&gt;
| 17.99&lt;br /&gt;
| 29.48&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Entries with [*] are trained on this dataset.&lt;br /&gt;
&lt;br /&gt;
== YAMAHA_Balanced ==&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:right;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; text-align:left;&amp;quot;&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Group&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | F1&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Cemgil&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Goto&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | P-score&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLt&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLt&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | BeatU&lt;br /&gt;
| 92.55&lt;br /&gt;
| 82.28&lt;br /&gt;
| 89.12&lt;br /&gt;
| 93.57&lt;br /&gt;
| 83.62&lt;br /&gt;
| 90.15&lt;br /&gt;
| 85.84&lt;br /&gt;
| 93.11&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | KG-ApolloBeats&lt;br /&gt;
| 91.97&lt;br /&gt;
| 81.29&lt;br /&gt;
| 88.28&lt;br /&gt;
| 92.54&lt;br /&gt;
| 79.91&lt;br /&gt;
| 88.30&lt;br /&gt;
| 82.26&lt;br /&gt;
| 91.79&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: BeatThis&lt;br /&gt;
| 90.59&lt;br /&gt;
| 79.43&lt;br /&gt;
| 81.59&lt;br /&gt;
| 91.42&lt;br /&gt;
| 64.94&lt;br /&gt;
| 84.52&lt;br /&gt;
| 66.87&lt;br /&gt;
| 87.73&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: CD1&lt;br /&gt;
| 76.43&lt;br /&gt;
| 67.85&lt;br /&gt;
| 64.44&lt;br /&gt;
| 74.42&lt;br /&gt;
| 55.13&lt;br /&gt;
| 59.47&lt;br /&gt;
| 71.86&lt;br /&gt;
| 83.63&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== YAMAHA_JPop ==&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:right;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; text-align:left;&amp;quot;&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Group&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | F1&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Cemgil&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Goto&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | P-score&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLt&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLt&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | BeatU&lt;br /&gt;
| 96.58&lt;br /&gt;
| 88.94&lt;br /&gt;
| 95.20&lt;br /&gt;
| 96.73&lt;br /&gt;
| 92.32&lt;br /&gt;
| 94.46&lt;br /&gt;
| 94.65&lt;br /&gt;
| 97.05&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | KG-ApolloBeats&lt;br /&gt;
| 95.39&lt;br /&gt;
| 86.66&lt;br /&gt;
| 93.20&lt;br /&gt;
| 94.63&lt;br /&gt;
| 82.87&lt;br /&gt;
| 90.54&lt;br /&gt;
| 84.72&lt;br /&gt;
| 93.48&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: BeatThis&lt;br /&gt;
| 94.00&lt;br /&gt;
| 84.08&lt;br /&gt;
| 86.80&lt;br /&gt;
| 93.15&lt;br /&gt;
| 69.66&lt;br /&gt;
| 86.64&lt;br /&gt;
| 71.39&lt;br /&gt;
| 89.57&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: CD1&lt;br /&gt;
| 77.38&lt;br /&gt;
| 70.76&lt;br /&gt;
| 64.40&lt;br /&gt;
| 73.55&lt;br /&gt;
| 54.98&lt;br /&gt;
| 58.71&lt;br /&gt;
| 77.28&lt;br /&gt;
| 85.29&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== Comparison with Previous MIREXes ==&lt;br /&gt;
&lt;br /&gt;
Since this year we have switched to using mir_eval for evaluation, some results may differ from those in previous MIREX editions due to differences in implementation. We confirm that the following metrics remain comparable with previous MIREX results:&lt;br /&gt;
&lt;br /&gt;
* Comparable: F1, Goto, CMLc, CMLt, AMLc, AMLt.&lt;br /&gt;
* Not comparable: Cemgil, P-score.&lt;/div&gt;</summary>
		<author><name>Zhaojw1998</name></author>
		
	</entry>
	<entry>
		<id>https://music-ir.org/mirex/w/index.php?title=2025:Audio_Beat_Tracking_Results&amp;diff=14809</id>
		<title>2025:Audio Beat Tracking Results</title>
		<link rel="alternate" type="text/html" href="https://music-ir.org/mirex/w/index.php?title=2025:Audio_Beat_Tracking_Results&amp;diff=14809"/>
		<updated>2025-09-13T05:54:20Z</updated>

		<summary type="html">&lt;p&gt;Zhaojw1998: /* Submissions */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
== Submissions ==&lt;br /&gt;
&lt;br /&gt;
This page is still WIP. More submissions might appear later.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; &lt;br /&gt;
|-&lt;br /&gt;
! Submission&lt;br /&gt;
! Title&lt;br /&gt;
! PDF&lt;br /&gt;
! Authors&lt;br /&gt;
|-&lt;br /&gt;
| style=&amp;quot;vertical-align:bottom; background-color:#F8F9FA; color:#222;&amp;quot; | Baseline: CD1&lt;br /&gt;
| style=&amp;quot;vertical-align:bottom; background-color:#F8F9FA; color:#222;&amp;quot; | QM Tempo Tracker&lt;br /&gt;
| [https://vamp-plugins.org/plugin-doc/qm-vamp-plugins.html Link]&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| Baseline: BeatThis&lt;br /&gt;
| Beat This! Accurate Beat Tracking Without DBN Postprocessing&lt;br /&gt;
| [https://github.com/CPJKU/beat_this Link]&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| KG-ApolloBeats&lt;br /&gt;
| The 2025 KG Music Beats Tracking System&lt;br /&gt;
| TBA&lt;br /&gt;
| DingKun Xiao, Haijun Cai, Chuanyi Chen&lt;br /&gt;
|- &lt;br /&gt;
| BeatU&lt;br /&gt;
| BeatU&lt;br /&gt;
| TBA&lt;br /&gt;
| YAMAHA Corporation&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== Test Sets ==&lt;br /&gt;
&lt;br /&gt;
* '''GTZAN''': 999 songs from the GTZAN dataset (starting from next year, training on GTZAN will be disallowed)&lt;br /&gt;
* '''SMC''': 217 songs from the SMC collection&lt;br /&gt;
* '''Yamaha_JPOP''': A private dataset annotated by Yamaha Corporation. The dataset contains 200 JPOP songs.&lt;br /&gt;
* '''Yamaha_Balanced''': A private dataset annotated by Yamaha Corporation. The dataset contains 241 songs. While it is still biased towards JPOP songs, the dataset covers a wider range of genres: J.Pop (10.37%), Rock (10.37%), J.Enka (10.37%), J.Kayoukyoku (10.37%), Soundtrack (10.37%), Western Pop (10.37%), Children's Song (10.37%), R&amp;amp;B (6.22%), Hiphop (4.56%), Jazz (2.49%), Dance (2.49%), World (2.07%), Techno (1.24%), Easy listening (1.24%), J.Minyou (1.24%), Others (5.81%).&lt;br /&gt;
&lt;br /&gt;
== GTZAN ==&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:right;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; text-align:left;&amp;quot;&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Group&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | F1&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Cemgil&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Goto&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | P-score&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLt&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLt&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | BeatU&lt;br /&gt;
| 84.93&lt;br /&gt;
| 68.23&lt;br /&gt;
| 64.06&lt;br /&gt;
| 84.27&lt;br /&gt;
| 71.07&lt;br /&gt;
| 75.26&lt;br /&gt;
| 78.78&lt;br /&gt;
| 84.27&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | KG-ApolloBeats*&lt;br /&gt;
| 92.53&lt;br /&gt;
| 80.66&lt;br /&gt;
| 79.38&lt;br /&gt;
| 93.55&lt;br /&gt;
| 83.44&lt;br /&gt;
| 88.49&lt;br /&gt;
| 86.86&lt;br /&gt;
| 92.77&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: BeatThis&lt;br /&gt;
| 89.02&lt;br /&gt;
| 80.15&lt;br /&gt;
| 72.27&lt;br /&gt;
| 88.00&lt;br /&gt;
| 76.00&lt;br /&gt;
| 79.64&lt;br /&gt;
| 84.63&lt;br /&gt;
| 90.01&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: CD1&lt;br /&gt;
| 81.19&lt;br /&gt;
| 69.64&lt;br /&gt;
| 62.06&lt;br /&gt;
| 79.97&lt;br /&gt;
| 65.02&lt;br /&gt;
| 66.94&lt;br /&gt;
| 83.08&lt;br /&gt;
| 86.69&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Entries with [*] are trained on this dataset.&lt;br /&gt;
&lt;br /&gt;
The baseline BeatThis reports different results compared to the paper because it uses a different number of test songs (999 vs. 993).&lt;br /&gt;
&lt;br /&gt;
== SMC ==&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:right;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; text-align:left;&amp;quot;&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Group&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | F1&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Cemgil&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Goto&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | P-score&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLt&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLt&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | BeatU&lt;br /&gt;
| 53.14&lt;br /&gt;
| 40.67&lt;br /&gt;
| 14.75&lt;br /&gt;
| 63.52&lt;br /&gt;
| 27.24&lt;br /&gt;
| 41.16&lt;br /&gt;
| 30.88&lt;br /&gt;
| 47.44&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | KG-ApolloBeats*&lt;br /&gt;
| 74.15&lt;br /&gt;
| 57.07&lt;br /&gt;
| 32.72&lt;br /&gt;
| 84.51&lt;br /&gt;
| 53.73&lt;br /&gt;
| 72.66&lt;br /&gt;
| 55.96&lt;br /&gt;
| 76.22&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: BeatThis*&lt;br /&gt;
| 71.81&lt;br /&gt;
| 55.64&lt;br /&gt;
| 27.19&lt;br /&gt;
| 82.91&lt;br /&gt;
| 49.78&lt;br /&gt;
| 69.89&lt;br /&gt;
| 51.15&lt;br /&gt;
| 72.30&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: CD1&lt;br /&gt;
| 33.66&lt;br /&gt;
| 26.29&lt;br /&gt;
| 6.91&lt;br /&gt;
| 45.10&lt;br /&gt;
| 9.88&lt;br /&gt;
| 13.12&lt;br /&gt;
| 17.99&lt;br /&gt;
| 29.48&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Entries with [*] are trained on this dataset.&lt;br /&gt;
&lt;br /&gt;
== YAMAHA_Balanced ==&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:right;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; text-align:left;&amp;quot;&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Group&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | F1&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Cemgil&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Goto&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | P-score&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLt&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLt&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | BeatU&lt;br /&gt;
| 92.55&lt;br /&gt;
| 82.28&lt;br /&gt;
| 89.12&lt;br /&gt;
| 93.57&lt;br /&gt;
| 83.62&lt;br /&gt;
| 90.15&lt;br /&gt;
| 85.84&lt;br /&gt;
| 93.11&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | KG-ApolloBeats&lt;br /&gt;
| 91.97&lt;br /&gt;
| 81.29&lt;br /&gt;
| 88.28&lt;br /&gt;
| 92.54&lt;br /&gt;
| 79.91&lt;br /&gt;
| 88.30&lt;br /&gt;
| 82.26&lt;br /&gt;
| 91.79&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: BeatThis&lt;br /&gt;
| 90.59&lt;br /&gt;
| 79.43&lt;br /&gt;
| 81.59&lt;br /&gt;
| 91.42&lt;br /&gt;
| 64.94&lt;br /&gt;
| 84.52&lt;br /&gt;
| 66.87&lt;br /&gt;
| 87.73&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: CD1&lt;br /&gt;
| 76.43&lt;br /&gt;
| 67.85&lt;br /&gt;
| 64.44&lt;br /&gt;
| 74.42&lt;br /&gt;
| 55.13&lt;br /&gt;
| 59.47&lt;br /&gt;
| 71.86&lt;br /&gt;
| 83.63&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== YAMAHA_JPop ==&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:right;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; text-align:left;&amp;quot;&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Group&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | F1&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Cemgil&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | Goto&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | P-score&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | CMLt&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLc&lt;br /&gt;
! style=&amp;quot;vertical-align:bottom;&amp;quot; | AMLt&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | BeatU&lt;br /&gt;
| 96.58&lt;br /&gt;
| 88.94&lt;br /&gt;
| 95.20&lt;br /&gt;
| 96.73&lt;br /&gt;
| 92.32&lt;br /&gt;
| 94.46&lt;br /&gt;
| 94.65&lt;br /&gt;
| 97.05&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | KG-ApolloBeats&lt;br /&gt;
| 95.39&lt;br /&gt;
| 86.66&lt;br /&gt;
| 93.20&lt;br /&gt;
| 94.63&lt;br /&gt;
| 82.87&lt;br /&gt;
| 90.54&lt;br /&gt;
| 84.72&lt;br /&gt;
| 93.48&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: BeatThis&lt;br /&gt;
| 94.00&lt;br /&gt;
| 84.08&lt;br /&gt;
| 86.80&lt;br /&gt;
| 93.15&lt;br /&gt;
| 69.66&lt;br /&gt;
| 86.64&lt;br /&gt;
| 71.39&lt;br /&gt;
| 89.57&lt;br /&gt;
|- style=&amp;quot;vertical-align:bottom;&amp;quot;&lt;br /&gt;
| style=&amp;quot;text-align:left;&amp;quot; | Baseline: CD1&lt;br /&gt;
| 77.38&lt;br /&gt;
| 70.76&lt;br /&gt;
| 64.40&lt;br /&gt;
| 73.55&lt;br /&gt;
| 54.98&lt;br /&gt;
| 58.71&lt;br /&gt;
| 77.28&lt;br /&gt;
| 85.29&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== Comparison with Previous MIREXes ==&lt;br /&gt;
&lt;br /&gt;
Since this year we have switched to using mir_eval for evaluation, some results may differ from those in previous MIREX editions due to differences in implementation. We confirm that the following metrics remain comparable with previous MIREX results:&lt;br /&gt;
&lt;br /&gt;
* Comparable: F1, Goto, CMLc, CMLt, AMLc, AMLt.&lt;br /&gt;
* Not comparable: Cemgil, P-score.&lt;/div&gt;</summary>
		<author><name>Zhaojw1998</name></author>
		
	</entry>
	<entry>
		<id>https://music-ir.org/mirex/w/index.php?title=2025:Symbolic_Music_Generation_Results&amp;diff=14808</id>
		<title>2025:Symbolic Music Generation Results</title>
		<link rel="alternate" type="text/html" href="https://music-ir.org/mirex/w/index.php?title=2025:Symbolic_Music_Generation_Results&amp;diff=14808"/>
		<updated>2025-09-13T05:46:04Z</updated>

		<summary type="html">&lt;p&gt;Zhaojw1998: /* Results */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
= Submissions =&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; &lt;br /&gt;
|- style=&amp;quot;font-weight:bold;&amp;quot;&lt;br /&gt;
! Team&lt;br /&gt;
! Extended Abstract&lt;br /&gt;
! Methods&lt;br /&gt;
|-&lt;br /&gt;
| RWKV (Zhou-Zheng et al.)&lt;br /&gt;
| [https://www.music-ir.org/mirex/wiki/MIREX_HOME]&lt;br /&gt;
| RWKV&lt;br /&gt;
|-&lt;br /&gt;
| PixelGen&lt;br /&gt;
| [https://www.music-ir.org/mirex/wiki/MIREX_HOME]&lt;br /&gt;
| Hierarchical Transformer&lt;br /&gt;
|-&lt;br /&gt;
| MuseCoco (BL-1)&lt;br /&gt;
| [https://arxiv.org/abs/2306.00110]&lt;br /&gt;
| Transformer&lt;br /&gt;
|-&lt;br /&gt;
| Anticipatory Music Transformer (BL-2)&lt;br /&gt;
| [https://arxiv.org/abs/2306.08620]&lt;br /&gt;
| Transformer&lt;br /&gt;
|-&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
=Results=&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:center;&amp;quot;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; vertical-align:center;&amp;quot;&lt;br /&gt;
! rowspan=&amp;quot;2&amp;quot; | Team&lt;br /&gt;
! colspan=&amp;quot;4&amp;quot; | Subjective Evaluation&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; vertical-align:center;&amp;quot;&lt;br /&gt;
| Coherecy ↑&lt;br /&gt;
| Structure ↑&lt;br /&gt;
| Creativity ↑&lt;br /&gt;
| Musicality ↑&lt;br /&gt;
|-&lt;br /&gt;
| RWKV (Zhou-Zheng et al.)&lt;br /&gt;
| 3.57 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.58 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.26 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
| '''3.50 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
|-&lt;br /&gt;
| PixelGen&lt;br /&gt;
| 2.39 ± 0.10&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.37 ± 0.09&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.85 ± 0.09&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.48 ± 0.09&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| MuseCoco (BL-1)&lt;br /&gt;
| 3.11 ± 0.10&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.07 ± 0.09&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.08 ± 0.09&amp;lt;sup&amp;gt;ab&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.95 ± 0.09&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| Anticipatory Music Transformer (BL-2)&lt;br /&gt;
| '''3.70 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| '''3.69 ± 0.09&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| '''3.30 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| 3.45 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Evaluation Results'''&lt;br /&gt;
&lt;br /&gt;
Results are reported in the form of mean ± sem&amp;lt;sup&amp;gt;s&amp;lt;/sup&amp;gt; (sem refers to standard error of mean), where s is a letter. Different letters within a column indicate significant differences (p-value p &amp;lt; 0.05) based on a Wilcoxon signed rank test with Holm-Bonferroni correction.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Baseline Models'''&lt;br /&gt;
&lt;br /&gt;
For MuseCoco, we use the ''xlarge'' model with 1.2 billion learnable parameters. For Anticipatory Music Transformer, we use the ''Large'' model with 780M learnable parameters.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Subjective Evaluation Details'''&lt;br /&gt;
&lt;br /&gt;
A double-blind online survey was conducted to test music quality. Each model was anonymised, and for each test prompt, a sample was cherry-picked from 8 generated candidates. A total of 8 prompts of varied styles (pop, classical, and jazzy) were tested, resulting in an 8-page survey. The page order and the sample order within each page were both randomised. &lt;br /&gt;
&lt;br /&gt;
Responses were collected from 20 participants with diverse music backgrounds. 14 participants completed all 8 pages with an average completion time of 32 minutes.&lt;/div&gt;</summary>
		<author><name>Zhaojw1998</name></author>
		
	</entry>
	<entry>
		<id>https://music-ir.org/mirex/w/index.php?title=2025:Symbolic_Music_Generation_Results&amp;diff=14807</id>
		<title>2025:Symbolic Music Generation Results</title>
		<link rel="alternate" type="text/html" href="https://music-ir.org/mirex/w/index.php?title=2025:Symbolic_Music_Generation_Results&amp;diff=14807"/>
		<updated>2025-09-13T05:45:20Z</updated>

		<summary type="html">&lt;p&gt;Zhaojw1998: /* Results */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
= Submissions =&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; &lt;br /&gt;
|- style=&amp;quot;font-weight:bold;&amp;quot;&lt;br /&gt;
! Team&lt;br /&gt;
! Extended Abstract&lt;br /&gt;
! Methods&lt;br /&gt;
|-&lt;br /&gt;
| RWKV (Zhou-Zheng et al.)&lt;br /&gt;
| [https://www.music-ir.org/mirex/wiki/MIREX_HOME]&lt;br /&gt;
| RWKV&lt;br /&gt;
|-&lt;br /&gt;
| PixelGen&lt;br /&gt;
| [https://www.music-ir.org/mirex/wiki/MIREX_HOME]&lt;br /&gt;
| Hierarchical Transformer&lt;br /&gt;
|-&lt;br /&gt;
| MuseCoco (BL-1)&lt;br /&gt;
| [https://arxiv.org/abs/2306.00110]&lt;br /&gt;
| Transformer&lt;br /&gt;
|-&lt;br /&gt;
| Anticipatory Music Transformer (BL-2)&lt;br /&gt;
| [https://arxiv.org/abs/2306.08620]&lt;br /&gt;
| Transformer&lt;br /&gt;
|-&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
=Results=&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:center;&amp;quot;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; vertical-align:center;&amp;quot;&lt;br /&gt;
! rowspan=&amp;quot;2&amp;quot; | Team&lt;br /&gt;
! colspan=&amp;quot;4&amp;quot; | Subjective Evaluation&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; vertical-align:center;&amp;quot;&lt;br /&gt;
| Coherecy ↑&lt;br /&gt;
| Structure ↑&lt;br /&gt;
| Creativity ↑&lt;br /&gt;
| Musicality ↑&lt;br /&gt;
|-&lt;br /&gt;
| RWKV (Zhou-Zheng et al.)&lt;br /&gt;
| 3.57 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.58 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.26 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
| '''3.50 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
|-&lt;br /&gt;
| PixelGen&lt;br /&gt;
| 2.39 ± 0.10&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.37 ± 0.09&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.85 ± 0.09&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.48 ± 0.09&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| MuseCoco (BL-1)&lt;br /&gt;
| 3.11 ± 0.10&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.07 ± 0.09&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.08 ± 0.09&amp;lt;sup&amp;gt;ab&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.95 ± 0.09&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| Anticipatory Music Transformer (BL-2)&lt;br /&gt;
| '''3.70 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| '''3.69 ± 0.09&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| '''3.30 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| 3.45 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Evaluation Results'''&lt;br /&gt;
&lt;br /&gt;
Results are reported in the form of mean ± sem&amp;lt;sup&amp;gt;s&amp;lt;/sup&amp;gt; (sem refers to standard error of mean), where s is a letter. Different letters within a column indicate significant differences (p-value p &amp;lt; 0.05) based on a Wilcoxon signed rank test with Holm-Bonferroni correction.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Baseline Models'''&lt;br /&gt;
&lt;br /&gt;
For MuseCoco, we use the ''xlarge'' model variant with 1.2 billion learnable parameters. For Anticipatory Music Transformer, we use the ''Large'' model variant with 780M learnable parameters.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Subjective Evaluation Details'''&lt;br /&gt;
&lt;br /&gt;
A double-blind online survey was conducted to test music quality. Each model was anonymised, and for each test prompt, a sample was cherry-picked from 8 generated candidates. A total of 8 prompts of varied styles (pop, classical, and jazzy) were tested, resulting in an 8-page survey. The page order and the sample order within each page were both randomised. &lt;br /&gt;
&lt;br /&gt;
Responses were collected from 20 participants with diverse music backgrounds. 14 participants completed all 8 pages with an average completion time of 32 minutes.&lt;/div&gt;</summary>
		<author><name>Zhaojw1998</name></author>
		
	</entry>
	<entry>
		<id>https://music-ir.org/mirex/w/index.php?title=2025:Symbolic_Music_Generation_Results&amp;diff=14806</id>
		<title>2025:Symbolic Music Generation Results</title>
		<link rel="alternate" type="text/html" href="https://music-ir.org/mirex/w/index.php?title=2025:Symbolic_Music_Generation_Results&amp;diff=14806"/>
		<updated>2025-09-13T05:44:26Z</updated>

		<summary type="html">&lt;p&gt;Zhaojw1998: /* Results */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
= Submissions =&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; &lt;br /&gt;
|- style=&amp;quot;font-weight:bold;&amp;quot;&lt;br /&gt;
! Team&lt;br /&gt;
! Extended Abstract&lt;br /&gt;
! Methods&lt;br /&gt;
|-&lt;br /&gt;
| RWKV (Zhou-Zheng et al.)&lt;br /&gt;
| [https://www.music-ir.org/mirex/wiki/MIREX_HOME]&lt;br /&gt;
| RWKV&lt;br /&gt;
|-&lt;br /&gt;
| PixelGen&lt;br /&gt;
| [https://www.music-ir.org/mirex/wiki/MIREX_HOME]&lt;br /&gt;
| Hierarchical Transformer&lt;br /&gt;
|-&lt;br /&gt;
| MuseCoco (BL-1)&lt;br /&gt;
| [https://arxiv.org/abs/2306.00110]&lt;br /&gt;
| Transformer&lt;br /&gt;
|-&lt;br /&gt;
| Anticipatory Music Transformer (BL-2)&lt;br /&gt;
| [https://arxiv.org/abs/2306.08620]&lt;br /&gt;
| Transformer&lt;br /&gt;
|-&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
=Results=&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:center;&amp;quot;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; vertical-align:center;&amp;quot;&lt;br /&gt;
! rowspan=&amp;quot;2&amp;quot; | Team&lt;br /&gt;
! colspan=&amp;quot;4&amp;quot; | Subjective Evaluation&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; vertical-align:center;&amp;quot;&lt;br /&gt;
| Coherecy ↑&lt;br /&gt;
| Structure ↑&lt;br /&gt;
| Creativity ↑&lt;br /&gt;
| Musicality ↑&lt;br /&gt;
|-&lt;br /&gt;
| RWKV (Zhou-Zheng et al.)&lt;br /&gt;
| 3.57 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.58 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.26 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
| '''3.50 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
|-&lt;br /&gt;
| PixelGen&lt;br /&gt;
| 2.39 ± 0.10&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.37 ± 0.09&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.85 ± 0.09&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.48 ± 0.09&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| MuseCoco (BL-1)&lt;br /&gt;
| 3.11 ± 0.10&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.07 ± 0.09&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.08 ± 0.09&amp;lt;sup&amp;gt;ab&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.95 ± 0.09&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| Anticipatory Music Transformer (BL-2)&lt;br /&gt;
| '''3.70 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| '''3.69 ± 0.09&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| '''3.30 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| 3.45 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Evaluation Results'''&lt;br /&gt;
&lt;br /&gt;
Results are reported in the form of mean ± sem&amp;lt;sup&amp;gt;s&amp;lt;/sup&amp;gt; (sem refers to standard error of mean), where s is a letter. Different letters within a column indicate significant differences (p-value p &amp;lt; 0.05) based on a Wilcoxon signed rank test with Holm-Bonferroni correction.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Baseline Models'''&lt;br /&gt;
&lt;br /&gt;
For MuseCoco, we use the 'xlarge' model variant with 1.2 billion learnable parameters. For Anticipatory Music Transformer, we use the 'Large' model variant with 780M learnable parameters.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Subjective Evaluation Details'''&lt;br /&gt;
&lt;br /&gt;
A double-blind online survey was conducted to test music quality. Each model was anonymised, and for each test prompt, a sample was cherry-picked from 8 generated candidates. A total of 8 prompts of varied styles (pop, classical, and jazzy) were tested, resulting in an 8-page survey. The page order and the sample order within each page were both randomised. &lt;br /&gt;
&lt;br /&gt;
Responses were collected from 20 participants with diverse music backgrounds. 14 participants completed all 8 pages with an average completion time of 32 minutes.&lt;/div&gt;</summary>
		<author><name>Zhaojw1998</name></author>
		
	</entry>
	<entry>
		<id>https://music-ir.org/mirex/w/index.php?title=2025:Symbolic_Music_Generation_Results&amp;diff=14805</id>
		<title>2025:Symbolic Music Generation Results</title>
		<link rel="alternate" type="text/html" href="https://music-ir.org/mirex/w/index.php?title=2025:Symbolic_Music_Generation_Results&amp;diff=14805"/>
		<updated>2025-09-13T05:43:47Z</updated>

		<summary type="html">&lt;p&gt;Zhaojw1998: /* Results */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
= Submissions =&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; &lt;br /&gt;
|- style=&amp;quot;font-weight:bold;&amp;quot;&lt;br /&gt;
! Team&lt;br /&gt;
! Extended Abstract&lt;br /&gt;
! Methods&lt;br /&gt;
|-&lt;br /&gt;
| RWKV (Zhou-Zheng et al.)&lt;br /&gt;
| [https://www.music-ir.org/mirex/wiki/MIREX_HOME]&lt;br /&gt;
| RWKV&lt;br /&gt;
|-&lt;br /&gt;
| PixelGen&lt;br /&gt;
| [https://www.music-ir.org/mirex/wiki/MIREX_HOME]&lt;br /&gt;
| Hierarchical Transformer&lt;br /&gt;
|-&lt;br /&gt;
| MuseCoco (BL-1)&lt;br /&gt;
| [https://arxiv.org/abs/2306.00110]&lt;br /&gt;
| Transformer&lt;br /&gt;
|-&lt;br /&gt;
| Anticipatory Music Transformer (BL-2)&lt;br /&gt;
| [https://arxiv.org/abs/2306.08620]&lt;br /&gt;
| Transformer&lt;br /&gt;
|-&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
=Results=&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:center;&amp;quot;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; vertical-align:center;&amp;quot;&lt;br /&gt;
! rowspan=&amp;quot;2&amp;quot; | Team&lt;br /&gt;
! colspan=&amp;quot;4&amp;quot; | Subjective Evaluation&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; vertical-align:center;&amp;quot;&lt;br /&gt;
| Coherecy ↑&lt;br /&gt;
| Structure ↑&lt;br /&gt;
| Creativity ↑&lt;br /&gt;
| Musicality ↑&lt;br /&gt;
|-&lt;br /&gt;
| RWKV (Zhou-Zheng et al.)&lt;br /&gt;
| 3.57 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.58 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.26 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
| '''3.50 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
|-&lt;br /&gt;
| PixelGen&lt;br /&gt;
| 2.39 ± 0.10&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.37 ± 0.09&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.85 ± 0.09&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.48 ± 0.09&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| MuseCoco (BL-1)&lt;br /&gt;
| 3.11 ± 0.10&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.07 ± 0.09&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.08 ± 0.09&amp;lt;sup&amp;gt;ab&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.95 ± 0.09&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| Anticipatory Music Transformer (BL-2)&lt;br /&gt;
| '''3.70 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| '''3.69 ± 0.09&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| '''3.30 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| 3.45 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Evaluation Results'''&lt;br /&gt;
&lt;br /&gt;
Results are reported in the form of mean ± sem&amp;lt;sup&amp;gt;s&amp;lt;/sup&amp;gt; (sem refers to standard error of mean), where s is a letter. Different letters within a column indicate significant differences (p-value p &amp;lt; 0.05) based on a Wilcoxon signed rank test with Holm-Bonferroni correction.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Baseline Models'''&lt;br /&gt;
&lt;br /&gt;
For MuseCoco, we use the *xlarge* model variant with 1.2 billion learnable parameters. For Anticipatory Music Transformer, we use the *Large* model variant with 780M learnable parameters.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Subjective Evaluation Details'''&lt;br /&gt;
&lt;br /&gt;
A double-blind online survey was conducted to test music quality. Each model was anonymised, and for each test prompt, a sample was cherry-picked from 8 generated candidates. A total of 8 prompts of varied styles (pop, classical, and jazzy) were tested, resulting in an 8-page survey. The page order and the sample order within each page were both randomised. &lt;br /&gt;
&lt;br /&gt;
Responses were collected from 20 participants with diverse music backgrounds. 14 participants completed all 8 pages with an average completion time of 32 minutes.&lt;/div&gt;</summary>
		<author><name>Zhaojw1998</name></author>
		
	</entry>
	<entry>
		<id>https://music-ir.org/mirex/w/index.php?title=2025:Symbolic_Music_Generation_Results&amp;diff=14804</id>
		<title>2025:Symbolic Music Generation Results</title>
		<link rel="alternate" type="text/html" href="https://music-ir.org/mirex/w/index.php?title=2025:Symbolic_Music_Generation_Results&amp;diff=14804"/>
		<updated>2025-09-13T05:43:23Z</updated>

		<summary type="html">&lt;p&gt;Zhaojw1998: /* Results */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
= Submissions =&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; &lt;br /&gt;
|- style=&amp;quot;font-weight:bold;&amp;quot;&lt;br /&gt;
! Team&lt;br /&gt;
! Extended Abstract&lt;br /&gt;
! Methods&lt;br /&gt;
|-&lt;br /&gt;
| RWKV (Zhou-Zheng et al.)&lt;br /&gt;
| [https://www.music-ir.org/mirex/wiki/MIREX_HOME]&lt;br /&gt;
| RWKV&lt;br /&gt;
|-&lt;br /&gt;
| PixelGen&lt;br /&gt;
| [https://www.music-ir.org/mirex/wiki/MIREX_HOME]&lt;br /&gt;
| Hierarchical Transformer&lt;br /&gt;
|-&lt;br /&gt;
| MuseCoco (BL-1)&lt;br /&gt;
| [https://arxiv.org/abs/2306.00110]&lt;br /&gt;
| Transformer&lt;br /&gt;
|-&lt;br /&gt;
| Anticipatory Music Transformer (BL-2)&lt;br /&gt;
| [https://arxiv.org/abs/2306.08620]&lt;br /&gt;
| Transformer&lt;br /&gt;
|-&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
=Results=&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:center;&amp;quot;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; vertical-align:center;&amp;quot;&lt;br /&gt;
! rowspan=&amp;quot;2&amp;quot; | Team&lt;br /&gt;
! colspan=&amp;quot;4&amp;quot; | Subjective Evaluation&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; vertical-align:center;&amp;quot;&lt;br /&gt;
| Coherecy ↑&lt;br /&gt;
| Structure ↑&lt;br /&gt;
| Creativity ↑&lt;br /&gt;
| Musicality ↑&lt;br /&gt;
|-&lt;br /&gt;
| RWKV (Zhou-Zheng et al.)&lt;br /&gt;
| 3.57 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.58 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.26 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
| '''3.50 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
|-&lt;br /&gt;
| PixelGen&lt;br /&gt;
| 2.39 ± 0.10&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.37 ± 0.09&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.85 ± 0.09&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.48 ± 0.09&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| MuseCoco (BL-1)&lt;br /&gt;
| 3.11 ± 0.10&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.07 ± 0.09&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.08 ± 0.09&amp;lt;sup&amp;gt;ab&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.95 ± 0.09&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| Anticipatory Music Transformer (BL-2)&lt;br /&gt;
| '''3.70 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| '''3.69 ± 0.09&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| '''3.30 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| 3.45 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Evaluation Results'''&lt;br /&gt;
&lt;br /&gt;
Results are reported in the form of mean ± sem&amp;lt;sup&amp;gt;s&amp;lt;/sup&amp;gt; (sem refers to standard error of mean), where s is a letter. Different letters within a column indicate significant differences (p-value p &amp;lt; 0.05) based on a Wilcoxon signed rank test with Holm-Bonferroni correction.&lt;br /&gt;
&lt;br /&gt;
'''Baseline Models'''&lt;br /&gt;
&lt;br /&gt;
For MuseCoco, we use the *xlarge* model variant with 1.2 billion learnable parameters. For Anticipatory Music Transformer, we use the *Large* model variant with 780M learnable parameters.&lt;br /&gt;
&lt;br /&gt;
'''Subjective Evaluation Details'''&lt;br /&gt;
&lt;br /&gt;
A double-blind online survey was conducted to test music quality. Each model was anonymised, and for each test prompt, a sample was cherry-picked from 8 generated candidates. A total of 8 prompts of varied styles (pop, classical, and jazzy) were tested, resulting in an 8-page survey. The page order and the sample order within each page were both randomised. &lt;br /&gt;
&lt;br /&gt;
Responses were collected from 20 participants with diverse music backgrounds. 14 participants completed all 8 pages with an average completion time of 32 minutes.&lt;/div&gt;</summary>
		<author><name>Zhaojw1998</name></author>
		
	</entry>
	<entry>
		<id>https://music-ir.org/mirex/w/index.php?title=2025:Symbolic_Music_Generation_Results&amp;diff=14803</id>
		<title>2025:Symbolic Music Generation Results</title>
		<link rel="alternate" type="text/html" href="https://music-ir.org/mirex/w/index.php?title=2025:Symbolic_Music_Generation_Results&amp;diff=14803"/>
		<updated>2025-09-13T05:22:05Z</updated>

		<summary type="html">&lt;p&gt;Zhaojw1998: /* Results */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
= Submissions =&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; &lt;br /&gt;
|- style=&amp;quot;font-weight:bold;&amp;quot;&lt;br /&gt;
! Team&lt;br /&gt;
! Extended Abstract&lt;br /&gt;
! Methods&lt;br /&gt;
|-&lt;br /&gt;
| RWKV (Zhou-Zheng et al.)&lt;br /&gt;
| [https://www.music-ir.org/mirex/wiki/MIREX_HOME]&lt;br /&gt;
| RWKV&lt;br /&gt;
|-&lt;br /&gt;
| PixelGen&lt;br /&gt;
| [https://www.music-ir.org/mirex/wiki/MIREX_HOME]&lt;br /&gt;
| Hierarchical Transformer&lt;br /&gt;
|-&lt;br /&gt;
| MuseCoco (BL-1)&lt;br /&gt;
| [https://arxiv.org/abs/2306.00110]&lt;br /&gt;
| Transformer&lt;br /&gt;
|-&lt;br /&gt;
| Anticipatory Music Transformer (BL-2)&lt;br /&gt;
| [https://arxiv.org/abs/2306.08620]&lt;br /&gt;
| Transformer&lt;br /&gt;
|-&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
=Results=&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:center;&amp;quot;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; vertical-align:center;&amp;quot;&lt;br /&gt;
! rowspan=&amp;quot;2&amp;quot; | Team&lt;br /&gt;
! colspan=&amp;quot;4&amp;quot; | Subjective Evaluation&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; vertical-align:center;&amp;quot;&lt;br /&gt;
| Coherecy ↑&lt;br /&gt;
| Structure ↑&lt;br /&gt;
| Creativity ↑&lt;br /&gt;
| Musicality ↑&lt;br /&gt;
|-&lt;br /&gt;
| RWKV (Zhou-Zheng et al.)&lt;br /&gt;
| 3.57 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.58 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.26 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
| '''3.50 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
|-&lt;br /&gt;
| PixelGen&lt;br /&gt;
| 2.39 ± 0.10&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.37 ± 0.09&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.85 ± 0.09&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.48 ± 0.09&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| MuseCoco (BL-1)&lt;br /&gt;
| 3.11 ± 0.10&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.07 ± 0.09&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.08 ± 0.09&amp;lt;sup&amp;gt;ab&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.95 ± 0.09&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| Anticipatory Music Transformer (BL-2)&lt;br /&gt;
| '''3.70 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| '''3.69 ± 0.09&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| '''3.30 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| 3.45 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Notes on Evaluation Results'''&lt;br /&gt;
&lt;br /&gt;
Results are reported in the form of mean ± sem&amp;lt;sup&amp;gt;s&amp;lt;/sup&amp;gt; (sem refers to standard error of mean), where s is a letter. Different letters within a column indicate significant differences (p-value p &amp;lt; 0.05) based on a Wilcoxon signed rank test with Holm-Bonferroni correction.&lt;br /&gt;
&lt;br /&gt;
'''Notes on Baseline Models'''&lt;br /&gt;
&lt;br /&gt;
For MuseCoco, we use the *xlarge* model variant with 1.2 billion learnable parameters. For Anticipatory Music Transformer, we use the *Large* model variant with 780M learnable parameters.&lt;br /&gt;
&lt;br /&gt;
'''Subjective Evaluation Details''': Each test sample was cherry-picked from 8 samples generated from the corresponding prompt. A total of 6 prompts of varied styles (Pop, Classical, and Jazz) were tested, resulting in a 6-page survey. Responses were collected from 20 participants with diverse music backgrounds.&lt;/div&gt;</summary>
		<author><name>Zhaojw1998</name></author>
		
	</entry>
	<entry>
		<id>https://music-ir.org/mirex/w/index.php?title=2025:Symbolic_Music_Generation_Results&amp;diff=14802</id>
		<title>2025:Symbolic Music Generation Results</title>
		<link rel="alternate" type="text/html" href="https://music-ir.org/mirex/w/index.php?title=2025:Symbolic_Music_Generation_Results&amp;diff=14802"/>
		<updated>2025-09-13T05:21:24Z</updated>

		<summary type="html">&lt;p&gt;Zhaojw1998: /* Results */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
= Submissions =&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; &lt;br /&gt;
|- style=&amp;quot;font-weight:bold;&amp;quot;&lt;br /&gt;
! Team&lt;br /&gt;
! Extended Abstract&lt;br /&gt;
! Methods&lt;br /&gt;
|-&lt;br /&gt;
| RWKV (Zhou-Zheng et al.)&lt;br /&gt;
| [https://www.music-ir.org/mirex/wiki/MIREX_HOME]&lt;br /&gt;
| RWKV&lt;br /&gt;
|-&lt;br /&gt;
| PixelGen&lt;br /&gt;
| [https://www.music-ir.org/mirex/wiki/MIREX_HOME]&lt;br /&gt;
| Hierarchical Transformer&lt;br /&gt;
|-&lt;br /&gt;
| MuseCoco (BL-1)&lt;br /&gt;
| [https://arxiv.org/abs/2306.00110]&lt;br /&gt;
| Transformer&lt;br /&gt;
|-&lt;br /&gt;
| Anticipatory Music Transformer (BL-2)&lt;br /&gt;
| [https://arxiv.org/abs/2306.08620]&lt;br /&gt;
| Transformer&lt;br /&gt;
|-&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
=Results=&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:center;&amp;quot;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; vertical-align:center;&amp;quot;&lt;br /&gt;
! rowspan=&amp;quot;2&amp;quot; | Team&lt;br /&gt;
! colspan=&amp;quot;4&amp;quot; | Subjective Evaluation&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; vertical-align:center;&amp;quot;&lt;br /&gt;
| Coherecy ↑&lt;br /&gt;
| Structure ↑&lt;br /&gt;
| Creativity ↑&lt;br /&gt;
| Musicality ↑&lt;br /&gt;
|-&lt;br /&gt;
| RWKV (Zhou-Zheng et al.)&lt;br /&gt;
| 3.57 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.58 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.26 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
| '''3.50 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
|-&lt;br /&gt;
| PixelGen&lt;br /&gt;
| 2.39 ± 0.10&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.37 ± 0.09&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.85 ± 0.09&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.48 ± 0.09&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| MuseCoco (BL-1)&lt;br /&gt;
| 3.11 ± 0.10&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.07 ± 0.09&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.08 ± 0.09&amp;lt;sup&amp;gt;ab&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.95 ± 0.09&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| Anticipatory Music Transformer (BL-2)&lt;br /&gt;
| '''3.70 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| '''3.69 ± 0.09&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| '''3.30 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| 3.45 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Notes on Evaluation Results''': Results are reported in the form of mean ± sem&amp;lt;sup&amp;gt;s&amp;lt;/sup&amp;gt; (sem refers to standard error of mean), where s is a letter. Different letters within a column indicate significant differences (p-value p &amp;lt; 0.05) based on a Wilcoxon signed rank test with Holm-Bonferroni correction.&lt;br /&gt;
&lt;br /&gt;
'''Notes on Baseline Models''': For MuseCoco, we use the *xlarge* model variant with 1.2 billion learnable parameters. For Anticipatory Music Transformer, we use the *Large* model variant with 780M learnable parameters.&lt;br /&gt;
&lt;br /&gt;
'''Subjective Evaluation Details''': Each test sample was cherry-picked from 8 samples generated from the corresponding prompt. A total of 6 prompts of varied styles (Pop, Classical, and Jazz) were tested, resulting in a 6-page survey. Responses were collected from 20 participants with diverse music backgrounds.&lt;/div&gt;</summary>
		<author><name>Zhaojw1998</name></author>
		
	</entry>
	<entry>
		<id>https://music-ir.org/mirex/w/index.php?title=2025:Symbolic_Music_Generation_Results&amp;diff=14792</id>
		<title>2025:Symbolic Music Generation Results</title>
		<link rel="alternate" type="text/html" href="https://music-ir.org/mirex/w/index.php?title=2025:Symbolic_Music_Generation_Results&amp;diff=14792"/>
		<updated>2025-09-12T03:28:34Z</updated>

		<summary type="html">&lt;p&gt;Zhaojw1998: /* Results */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
= Submissions =&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; &lt;br /&gt;
|- style=&amp;quot;font-weight:bold;&amp;quot;&lt;br /&gt;
! Team&lt;br /&gt;
! Extended Abstract&lt;br /&gt;
! Methods&lt;br /&gt;
|-&lt;br /&gt;
| RWKV (Zhou-Zheng et al.)&lt;br /&gt;
| [https://www.music-ir.org/mirex/wiki/MIREX_HOME]&lt;br /&gt;
| RWKV&lt;br /&gt;
|-&lt;br /&gt;
| PixelGen&lt;br /&gt;
| [https://www.music-ir.org/mirex/wiki/MIREX_HOME]&lt;br /&gt;
| Hierarchical Transformer&lt;br /&gt;
|-&lt;br /&gt;
| MuseCoco (BL-1)&lt;br /&gt;
| [https://arxiv.org/abs/2306.00110]&lt;br /&gt;
| Transformer&lt;br /&gt;
|-&lt;br /&gt;
| Anticipatory Music Transformer (BL-2)&lt;br /&gt;
| [https://arxiv.org/abs/2306.08620]&lt;br /&gt;
| Transformer&lt;br /&gt;
|-&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
=Results=&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:center;&amp;quot;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; vertical-align:center;&amp;quot;&lt;br /&gt;
! rowspan=&amp;quot;2&amp;quot; | Team&lt;br /&gt;
! colspan=&amp;quot;4&amp;quot; | Subjective Evaluation&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; vertical-align:center;&amp;quot;&lt;br /&gt;
| Coherecy ↑&lt;br /&gt;
| Structure ↑&lt;br /&gt;
| Creativity ↑&lt;br /&gt;
| Musicality ↑&lt;br /&gt;
|-&lt;br /&gt;
| RWKV (Zhou-Zheng et al.)&lt;br /&gt;
| 3.57 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.58 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.26 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
| '''3.50 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
|-&lt;br /&gt;
| PixelGen&lt;br /&gt;
| 2.39 ± 0.10&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.37 ± 0.09&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.85 ± 0.09&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.48 ± 0.09&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| MuseCoco (BL-1)&lt;br /&gt;
| 3.11 ± 0.10&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.07 ± 0.09&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.08 ± 0.09&amp;lt;sup&amp;gt;ab&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.95 ± 0.09&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| Anticipatory Music Transformer (BL-2)&lt;br /&gt;
| '''3.70 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| '''3.69 ± 0.09&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| '''3.30 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| 3.45 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
'''Note''': Results are reported in the form of mean ± sem&amp;lt;sup&amp;gt;s&amp;lt;/sup&amp;gt; (sem refers to standard error of mean), where s is a letter. Different letters within a column indicate significant differences (p-value p &amp;lt; 0.05) based on a Wilcoxon signed rank test with Holm-Bonferroni correction.&lt;br /&gt;
&lt;br /&gt;
'''Subjective Evaluation Details''': Each test sample was cherry-picked from 8 samples generated from the corresponding prompt. A total of 6 prompts of varied styles (Pop, Classical, and Jazz) were tested, resulting in a 6-page survey. Responses were collected from 20 participants with diverse music backgrounds.&lt;/div&gt;</summary>
		<author><name>Zhaojw1998</name></author>
		
	</entry>
	<entry>
		<id>https://music-ir.org/mirex/w/index.php?title=2025:Symbolic_Music_Generation_Results&amp;diff=14791</id>
		<title>2025:Symbolic Music Generation Results</title>
		<link rel="alternate" type="text/html" href="https://music-ir.org/mirex/w/index.php?title=2025:Symbolic_Music_Generation_Results&amp;diff=14791"/>
		<updated>2025-09-12T03:21:42Z</updated>

		<summary type="html">&lt;p&gt;Zhaojw1998: /* Results */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
= Submissions =&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; &lt;br /&gt;
|- style=&amp;quot;font-weight:bold;&amp;quot;&lt;br /&gt;
! Team&lt;br /&gt;
! Extended Abstract&lt;br /&gt;
! Methods&lt;br /&gt;
|-&lt;br /&gt;
| RWKV (Zhou-Zheng et al.)&lt;br /&gt;
| [https://www.music-ir.org/mirex/wiki/MIREX_HOME]&lt;br /&gt;
| RWKV&lt;br /&gt;
|-&lt;br /&gt;
| PixelGen&lt;br /&gt;
| [https://www.music-ir.org/mirex/wiki/MIREX_HOME]&lt;br /&gt;
| Hierarchical Transformer&lt;br /&gt;
|-&lt;br /&gt;
| MuseCoco (BL-1)&lt;br /&gt;
| [https://arxiv.org/abs/2306.00110]&lt;br /&gt;
| Transformer&lt;br /&gt;
|-&lt;br /&gt;
| Anticipatory Music Transformer (BL-2)&lt;br /&gt;
| [https://arxiv.org/abs/2306.08620]&lt;br /&gt;
| Transformer&lt;br /&gt;
|-&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
=Results=&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:center;&amp;quot;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; vertical-align:center;&amp;quot;&lt;br /&gt;
! rowspan=&amp;quot;2&amp;quot; | Team&lt;br /&gt;
! colspan=&amp;quot;4&amp;quot; | Subjective Evaluation&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; vertical-align:center;&amp;quot;&lt;br /&gt;
| Coherecy ↑&lt;br /&gt;
| Structure ↑&lt;br /&gt;
| Creativity ↑&lt;br /&gt;
| Musicality ↑&lt;br /&gt;
|-&lt;br /&gt;
| RWKV (Zhou-Zheng et al.)&lt;br /&gt;
| 3.57 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.58 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.26 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
| '''3.5 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
|-&lt;br /&gt;
| PixelGen&lt;br /&gt;
| 2.39 ± 0.10&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.37 ± 0.09&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.85 ± 0.09&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.48 ± 0.09&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| MuseCoco (BL-1)&lt;br /&gt;
| 3.11 ± 0.10&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.07 ± 0.09&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.08 ± 0.09&amp;lt;sup&amp;gt;ab&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.95 ± 0.09&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| Anticipatory Music Transformer (BL-2)&lt;br /&gt;
| '''3.70 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| '''3.69 ± 0.09&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| '''3.30 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| 3.45 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
'''Note''': Results are reported in the form of mean ± sem&amp;lt;sup&amp;gt;s&amp;lt;/sup&amp;gt; (sem refers to standard error of mean), where s is a letter. Different letters within a column indicate significant differences (p-value p &amp;lt; 0.05) based on a Wilcoxon signed rank test with Holm-Bonferroni correction.&lt;br /&gt;
&lt;br /&gt;
'''Subjective Evaluation Details''': One piece cherry-picked from 8 samples of each test piece, resulting in 6 pages of questions. We collect responses from 22 participants (18 complete submissions and 4 partial submissions). For complete submissions, the average completion time is 16min 59s.&lt;/div&gt;</summary>
		<author><name>Zhaojw1998</name></author>
		
	</entry>
	<entry>
		<id>https://music-ir.org/mirex/w/index.php?title=2024:Symbolic_Music_Generation_Results&amp;diff=14018</id>
		<title>2024:Symbolic Music Generation Results</title>
		<link rel="alternate" type="text/html" href="https://music-ir.org/mirex/w/index.php?title=2024:Symbolic_Music_Generation_Results&amp;diff=14018"/>
		<updated>2024-11-12T08:28:49Z</updated>

		<summary type="html">&lt;p&gt;Zhaojw1998: /* Submissions */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
= Submissions =&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; &lt;br /&gt;
|- style=&amp;quot;font-weight:bold;&amp;quot;&lt;br /&gt;
! Team&lt;br /&gt;
! Extended Abstract&lt;br /&gt;
! Methods&lt;br /&gt;
! Methodology&lt;br /&gt;
|-&lt;br /&gt;
| Chart-Accompaniment&lt;br /&gt;
| [https://futuremirex.com/portal/wp-content/uploads/2024/11/chart_accomp_2024_ISMIR_LBD.pdf PDF]&lt;br /&gt;
| BART&lt;br /&gt;
| A BART model leveraging pre-trained Transformer encoders for piano accompaniment generation.&lt;br /&gt;
|-&lt;br /&gt;
| AccoMontage (BL-1)&lt;br /&gt;
| [https://arxiv.org/abs/2108.11213 PDF]&lt;br /&gt;
| Style Transfer&lt;br /&gt;
| A hybrid algorithm generating piano accompaniments by rule-based search and music representation learning.&lt;br /&gt;
|-&lt;br /&gt;
| Whole-Song-Gen (BL-2)&lt;br /&gt;
| [https://arxiv.org/abs/2405.09901 PDF]&lt;br /&gt;
| DDPM&lt;br /&gt;
| A denoising diffusion probabilistic model (DDPM) generating piano accompaniments as piano-roll images&lt;br /&gt;
|-&lt;br /&gt;
| Compose-&amp;amp;-Embesslish (BL-3)&lt;br /&gt;
|  [https://arxiv.org/abs/2209.08212 PDF]&lt;br /&gt;
| Transformer&lt;br /&gt;
| A Transformer-based architecture generating piano performances in beat-based event sequences.&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
=Results=&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align:center;&amp;quot;&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; vertical-align:center;&amp;quot;&lt;br /&gt;
! rowspan=&amp;quot;2&amp;quot; | Team&lt;br /&gt;
! colspan=&amp;quot;4&amp;quot; | Subjective Evaluation&lt;br /&gt;
! Objective Evaluation&lt;br /&gt;
|- style=&amp;quot;font-weight:bold; vertical-align:center;&amp;quot;&lt;br /&gt;
| Coherecy ↑&lt;br /&gt;
| Naturalness ↑&lt;br /&gt;
| Creativity ↑&lt;br /&gt;
| Musicality ↑&lt;br /&gt;
| NLL ↓&lt;br /&gt;
|-&lt;br /&gt;
| Chart-Accompaniment&lt;br /&gt;
| 1.92 ± 0.11&amp;lt;sup&amp;gt;d&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 1.87 ± 0.10&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.62 ± 0.13&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.01 ± 0.11&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 4.12 ± 0.12&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| AccoMontage (BL-1)&lt;br /&gt;
| '''3.77 ± 0.11&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| '''3.59 ± 0.11&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| '''3.65 ± 0.11&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| '''3.63 ± 0.12&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| '''2.48 ± 0.07&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
|-&lt;br /&gt;
| Whole-Song-Gen (BL-2)&lt;br /&gt;
| 3.59 ± 0.11&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.24 ± 0.11&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| '''3.66 ± 0.10&amp;lt;sup&amp;gt;a&amp;lt;/sup&amp;gt;'''&lt;br /&gt;
| 3.47 ± 0.13&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 2.87 ± 0.08&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| Compose-&amp;amp;-Embesslish (BL-3)&lt;br /&gt;
| 3.39 ± 0.10&amp;lt;sup&amp;gt;c&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.38 ± 0.12&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.13 ± 0.10&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 3.36 ± 0.11&amp;lt;sup&amp;gt;b&amp;lt;/sup&amp;gt;&lt;br /&gt;
| 7.41 ± 0.07&amp;lt;sup&amp;gt;d&amp;lt;/sup&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
'''Note''': Results are reported in the form of mean ± sem&amp;lt;sup&amp;gt;s&amp;lt;/sup&amp;gt; (sem refers to standard error of mean), where s is a letter. Different letters within a column indicate significant differences (p-value p &amp;lt; 0.05) based on a Wilcoxon signed rank test.&lt;br /&gt;
&lt;br /&gt;
'''Objective Evaluation Details''': Each model generates 16 samples for each of 6 test pieces. Negative Log Likelihood (NLL) is computed by inputing the molody and accompaniment into the MuseCoco 1B model.&lt;br /&gt;
&lt;br /&gt;
'''Subjective Evaluation Details''': One piece cherry-picked from 16 samples of each test piece, resulting in 6 pages of questions. We collect responses from 22 participants (18 complete submissions and 4 partial submissions). For complete submissions, the average completion time is 16min 59s.&lt;/div&gt;</summary>
		<author><name>Zhaojw1998</name></author>
		
	</entry>
	<entry>
		<id>https://music-ir.org/mirex/w/index.php?title=2024:Symbolic_Music_Generation&amp;diff=13907</id>
		<title>2024:Symbolic Music Generation</title>
		<link rel="alternate" type="text/html" href="https://music-ir.org/mirex/w/index.php?title=2024:Symbolic_Music_Generation&amp;diff=13907"/>
		<updated>2024-10-01T02:10:14Z</updated>

		<summary type="html">&lt;p&gt;Zhaojw1998: /* References */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Description=&lt;br /&gt;
Symbolic music generation is a broad topic. It covers a wide range of tasks, including generation, harmonization, arrangement, instrumentation, and more. We have multiple ways to represent music data, and the evaluation metrics also vary. To define a MIREX challenge within this topic, we need to narrow our focus to specific subtasks that are both relevant to the community and feasible to evaluate effectively.&lt;br /&gt;
&lt;br /&gt;
This year, we select the task to be '''piano accompaniment arrangement from a lead sheet'''. The lead sheet provides information about the melody, and chord progression. The goal is to generate a piano accompaniment that complements the lead melody. The music data consists of 8-measure segments in 4/4 meter, quantized to a sixteenth-note resolution. A more detailed description of the data structure is provided in the data format section. The genre of the lead sheets is broadly within western pop music (refer to the music examples for more detail).&lt;br /&gt;
&lt;br /&gt;
=Data Format=&lt;br /&gt;
The input lead sheet consists of 8 bars for the melody and harmony, with an additional mandatory pickup measure (left blank if not used). The data is prepared in JSON format containing two properties: &amp;lt;code&amp;gt;melody&amp;lt;/code&amp;gt; and &amp;lt;code&amp;gt;chords&amp;lt;/code&amp;gt;:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;code&amp;gt;melody&amp;lt;/code&amp;gt;: a list of notes. Each note contains properties of &amp;lt;code&amp;gt;start&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;pitch&amp;lt;/code&amp;gt;, and &amp;lt;code&amp;gt;duration&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;code&amp;gt;chords&amp;lt;/code&amp;gt;: a list of chords. Each chord contains properties of &amp;lt;code&amp;gt;start&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;symbol&amp;lt;/code&amp;gt;, and &amp;lt;code&amp;gt;duration&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
The output generation should also follow the JSON format containing one property &amp;lt;code&amp;gt;acc&amp;lt;/code&amp;gt;:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;code&amp;gt;acc&amp;lt;/code&amp;gt;: a list of notes. Each note contains properties of &amp;lt;code&amp;gt;start&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;pitch&amp;lt;/code&amp;gt;, and &amp;lt;code&amp;gt;duration&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
'''Detailed explanation of &amp;lt;code&amp;gt;start&amp;lt;/code&amp;gt; and &amp;lt;code&amp;gt;duration&amp;lt;/code&amp;gt; attributes.''' &lt;br /&gt;
&lt;br /&gt;
# The data is assumed to be in 4/4 meter, quantized to a sixteenth-note resolution. For both melody and chords, onsets and durations are counted in sixteenth notes. &lt;br /&gt;
# Both onsets and durations are integers ranging from 0 to 9 * 16 - 1 = 143. Notes that end later than the ninth measure (i.e., 9 * 16 = 144th time step) will be truncated to the end of the ninth measure. &lt;br /&gt;
# Melody notes are not allowed to overlap with one another. &lt;br /&gt;
# There should be no gaps or overlaps between chords. Chords must follow one another directly. If there is a blank space where no chord is played, it must be filled with the &amp;lt;code&amp;gt;N&amp;lt;/code&amp;gt; chord. &lt;br /&gt;
# The accompaniment of the pick-up measure should be blank. &lt;br /&gt;
&lt;br /&gt;
'''Detailed explanation of the &amp;lt;code&amp;gt;pitch&amp;lt;/code&amp;gt; attribute.''' &lt;br /&gt;
&lt;br /&gt;
# The pitch property of a note should be integers ranging from 0 to 127, corresponding to the MIDI pitch numbers.&lt;br /&gt;
&lt;br /&gt;
'''Detailed explanation of the chord &amp;lt;code&amp;gt;symbol&amp;lt;/code&amp;gt; attribute.''' &lt;br /&gt;
&lt;br /&gt;
# The symbol property of a chord should be a string based on the syntax of (Harte, 2010). In other words, each chord string should be able to be passed as a parameter to mir_eval.chord.encode() without causing an error.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Data Example=&lt;br /&gt;
Below is an example of the input lead sheet in the format given above. The lead sheet is the melody of the first phrase of ''Hey Jude'' by The Beatles.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
{&lt;br /&gt;
  &amp;quot;melody&amp;quot;: [&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 12, &amp;quot;pitch&amp;quot;: 72, &amp;quot;duration&amp;quot;: 4},&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 16, &amp;quot;pitch&amp;quot;: 69, &amp;quot;duration&amp;quot;: 8},&lt;br /&gt;
    ...&lt;br /&gt;
  ],&lt;br /&gt;
  &amp;quot;chords&amp;quot;: [&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 0, &amp;quot;symbol&amp;quot;: &amp;quot;N&amp;quot;, &amp;quot;duration&amp;quot;: 16},&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 16, &amp;quot;symbol&amp;quot;: &amp;quot;F&amp;quot;, &amp;quot;duration&amp;quot;: 16},&lt;br /&gt;
    ...&lt;br /&gt;
  ]&lt;br /&gt;
}&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
This is an example of the generated accompaniment. The accompaniment is generated using the baseline method WholeSongGen introduced below. Note that the generation starts from the second measure (time step 16).&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
{&lt;br /&gt;
  &amp;quot;acc&amp;quot;: [&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 16, &amp;quot;pitch&amp;quot;: 41, &amp;quot;duration&amp;quot;: 12},&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 16, &amp;quot;pitch&amp;quot;: 65, &amp;quot;duration&amp;quot;: 5},&lt;br /&gt;
    ...&lt;br /&gt;
  ]&lt;br /&gt;
}&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
Full data examples can be accessed in [https://github.com/ZZWaang/acc-gen-8bar-wholesong/tree/main/generation_samples this code repository]. MIDI conversion code and MIDI demos are also provided there.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Evaluation and Competition Format=&lt;br /&gt;
We will evaluate the submitted algorithms through an online subjective double-blind test. The evaluation format differs from conventional tasks in the following aspects:&lt;br /&gt;
* '''We use a &amp;quot;''potluck''&amp;quot; test set. Before submitting the algorithm, each team is required to submit two lead sheets.''' The organizer team will supplement the lead sheet if necessary. &lt;br /&gt;
* There will be '''no live ranking''' because the subjective test will be done after the algorithm submission deadline.&lt;br /&gt;
* To better handle randomness in the generation algorithm, we '''allow cherry-picking from a fixed number of generated samples'''.   &lt;br /&gt;
* We hope to compute some objective measurements as well, but these will only be reported as a reference.&lt;br /&gt;
&lt;br /&gt;
==Subjective Evaluation Format==&lt;br /&gt;
* After each team submits the algorithm, the organizer team will use the algorithm to generate '''16 arrangements''' for each test sample. The generated results will be returned to each team for cherry-picking.&lt;br /&gt;
* Only a subset of the test set will be used for subjective evaluation.&lt;br /&gt;
* In the subjective evaluation, we will first ask the subjects to listen to the lead melody with chords and then listen to the generated samples in random order. The order of the samples will be randomized.&lt;br /&gt;
* The subject will be asked to rate each arrangement based on the following criteria:&lt;br /&gt;
:* Harmony correctness (5-point scale)&lt;br /&gt;
:* Creativity (5-point scale)&lt;br /&gt;
:* Naturalness (5-point scale)&lt;br /&gt;
:* Overall musicality (5-point scale)&lt;br /&gt;
&lt;br /&gt;
==Objective Measurements==&lt;br /&gt;
* We will use objective measurements only as a reference. The correlation between subjective and objective scores will be measured as a reference. &lt;br /&gt;
* The current plan is to compute the Negative Log Likelihood of a large music language model (e.g., Lu et al., 2023).&lt;br /&gt;
* We welcome proposals of the objective measurements.&lt;br /&gt;
&lt;br /&gt;
==Important Dates==&lt;br /&gt;
&lt;br /&gt;
* '''Oct 8, 2024''': Submit two lead sheets as a part of the test set. &lt;br /&gt;
* '''Oct 15, 2024''': Submit the main algorithm.&lt;br /&gt;
* '''Oct 22, 2024''': Return the generated samples. The cherry-picking phase begins.&lt;br /&gt;
* '''Oct 25, 2024''': Submit the cherry-picked sample ids.&lt;br /&gt;
* '''Oct 31 - Nov 3, 2024''': Online subjective evaluation.&lt;br /&gt;
* '''Nov 5, 2024''': Announce the final result.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Submission=&lt;br /&gt;
&lt;br /&gt;
As a generative task with subjective evaluation, the submission process ''differs greatly'' from other MIREX tasks. There are four important stages:&lt;br /&gt;
# Test set submission (Oct 8, 2024)&lt;br /&gt;
# Algorithm submission (Oct 15, 2024)&lt;br /&gt;
# Cherry-picked sample IDs submission (Oct 25, 2024)&lt;br /&gt;
# Evaluation form submission (Nov 3, 2024)&lt;br /&gt;
Please check the Important Dates section for the detailed schedule. '''Failure to participate in any of the stages will result in disqualification.'''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Algorithm Submission==&lt;br /&gt;
Participants must include an &amp;lt;code&amp;gt;batch_acc_gen.sh&amp;lt;/code&amp;gt; script in their submission. The task captain will use the script to generate output files according to the following format:&lt;br /&gt;
&lt;br /&gt;
'''Usage'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
acc_gen.sh &amp;quot;/path/to/input.json&amp;quot; &amp;quot;/path/to/output_folder&amp;quot; n_sample&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* Input File: Path to the input .json file.&lt;br /&gt;
* Output Folder: Path to the folder where the generated output files will be saved.&lt;br /&gt;
* n_sample: Number of samples to generate.&lt;br /&gt;
&lt;br /&gt;
'''Output'''&lt;br /&gt;
* The script should generate n_sample output files in the specified output folder.&lt;br /&gt;
* Output files should be named sequentially as sample_01.json, sample_02.json, ..., up to sample_n_sample.json.&lt;br /&gt;
&lt;br /&gt;
Participants are free to implement the internal logic of the script, but it must adhere to this format for proper execution during the evaluation process.&lt;br /&gt;
&lt;br /&gt;
'''Packaging Submissions'''&lt;br /&gt;
* Every submission must be packed into a docker image&lt;br /&gt;
* Every submission will be deployed and evaluated automatically with &amp;lt;code&amp;gt;docker run&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''Accepted submission form'''&lt;br /&gt;
* Link to public or private Github repository&lt;br /&gt;
* Link to public or private docker hub&lt;br /&gt;
* Shared google drive links&lt;br /&gt;
* If the repository is private, an access token is also required&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Baselines=&lt;br /&gt;
&lt;br /&gt;
To establish a benchmark for this task, we consider the three baseline models in their official implementations:&lt;br /&gt;
&lt;br /&gt;
'''WholeSongGen''' (Wang et al., 2024)&lt;br /&gt;
* A denoising diffusion probabilistic model (DDPM) generating piano accompaniments as piano-roll images.&lt;br /&gt;
&lt;br /&gt;
'''Compose &amp;amp; Embellish''' (Wu and Yang, 2023)&lt;br /&gt;
* A Transformer-based architecture generating piano performances in beat-based event sequences.&lt;br /&gt;
&lt;br /&gt;
'''AccoMontage''' (Zhao and Xia, 2021)&lt;br /&gt;
* A hybrid algorithm generating piano accompaniments by rule-based search and music representation learning.&lt;br /&gt;
&lt;br /&gt;
=Contacts=&lt;br /&gt;
If you any questions or suggestions about the task, please contact:&lt;br /&gt;
* Ziyu Wang: ziyu.wang&amp;lt;at&amp;gt;nyu.edu&lt;br /&gt;
* Jingwei Zhao: jzhao&amp;lt;at&amp;gt;u.nus.edu&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=References=&lt;br /&gt;
* Harte, C. Towards automatic extraction of harmony information from music signals. PhD Diss. 2010.&lt;br /&gt;
* Lu, P., et al. Musecoco: Generating symbolic music from text. arXiv preprint arXiv:2306.00110 (2023).&lt;br /&gt;
* Wang, Z., et al. Whole-song hierarchical generation of symbolic music using cascaded diffusion models, in ICLR 2024.&lt;br /&gt;
* Wu, S.-L., &amp;amp; Yang, Y.-H. Compose &amp;amp; Embellish: Well-structured piano performance generation via a two-stage approach, in ICASSP 2023.&lt;br /&gt;
* Zhao, J., &amp;amp; Xia, G. Accomontage: Accompaniment arrangement via phrase selection and style transfer, in ISMIR 2021.&lt;br /&gt;
&lt;br /&gt;
* Code and data format samples: [https://github.com/ZZWaang/acc-gen-8bar-wholesong/tree/main https://github.com/ZZWaang/acc-gen-8bar-wholesong/tree/main]&lt;/div&gt;</summary>
		<author><name>Zhaojw1998</name></author>
		
	</entry>
	<entry>
		<id>https://music-ir.org/mirex/w/index.php?title=2024:Symbolic_Music_Generation&amp;diff=13906</id>
		<title>2024:Symbolic Music Generation</title>
		<link rel="alternate" type="text/html" href="https://music-ir.org/mirex/w/index.php?title=2024:Symbolic_Music_Generation&amp;diff=13906"/>
		<updated>2024-10-01T02:07:51Z</updated>

		<summary type="html">&lt;p&gt;Zhaojw1998: /* Baselines */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Description=&lt;br /&gt;
Symbolic music generation is a broad topic. It covers a wide range of tasks, including generation, harmonization, arrangement, instrumentation, and more. We have multiple ways to represent music data, and the evaluation metrics also vary. To define a MIREX challenge within this topic, we need to narrow our focus to specific subtasks that are both relevant to the community and feasible to evaluate effectively.&lt;br /&gt;
&lt;br /&gt;
This year, we select the task to be '''piano accompaniment arrangement from a lead sheet'''. The lead sheet provides information about the melody, and chord progression. The goal is to generate a piano accompaniment that complements the lead melody. The music data consists of 8-measure segments in 4/4 meter, quantized to a sixteenth-note resolution. A more detailed description of the data structure is provided in the data format section. The genre of the lead sheets is broadly within western pop music (refer to the music examples for more detail).&lt;br /&gt;
&lt;br /&gt;
=Data Format=&lt;br /&gt;
The input lead sheet consists of 8 bars for the melody and harmony, with an additional mandatory pickup measure (left blank if not used). The data is prepared in JSON format containing two properties: &amp;lt;code&amp;gt;melody&amp;lt;/code&amp;gt; and &amp;lt;code&amp;gt;chords&amp;lt;/code&amp;gt;:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;code&amp;gt;melody&amp;lt;/code&amp;gt;: a list of notes. Each note contains properties of &amp;lt;code&amp;gt;start&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;pitch&amp;lt;/code&amp;gt;, and &amp;lt;code&amp;gt;duration&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;code&amp;gt;chords&amp;lt;/code&amp;gt;: a list of chords. Each chord contains properties of &amp;lt;code&amp;gt;start&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;symbol&amp;lt;/code&amp;gt;, and &amp;lt;code&amp;gt;duration&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
The output generation should also follow the JSON format containing one property &amp;lt;code&amp;gt;acc&amp;lt;/code&amp;gt;:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;code&amp;gt;acc&amp;lt;/code&amp;gt;: a list of notes. Each note contains properties of &amp;lt;code&amp;gt;start&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;pitch&amp;lt;/code&amp;gt;, and &amp;lt;code&amp;gt;duration&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
'''Detailed explanation of &amp;lt;code&amp;gt;start&amp;lt;/code&amp;gt; and &amp;lt;code&amp;gt;duration&amp;lt;/code&amp;gt; attributes.''' &lt;br /&gt;
&lt;br /&gt;
# The data is assumed to be in 4/4 meter, quantized to a sixteenth-note resolution. For both melody and chords, onsets and durations are counted in sixteenth notes. &lt;br /&gt;
# Both onsets and durations are integers ranging from 0 to 9 * 16 - 1 = 143. Notes that end later than the ninth measure (i.e., 9 * 16 = 144th time step) will be truncated to the end of the ninth measure. &lt;br /&gt;
# Melody notes are not allowed to overlap with one another. &lt;br /&gt;
# There should be no gaps or overlaps between chords. Chords must follow one another directly. If there is a blank space where no chord is played, it must be filled with the &amp;lt;code&amp;gt;N&amp;lt;/code&amp;gt; chord. &lt;br /&gt;
# The accompaniment of the pick-up measure should be blank. &lt;br /&gt;
&lt;br /&gt;
'''Detailed explanation of the &amp;lt;code&amp;gt;pitch&amp;lt;/code&amp;gt; attribute.''' &lt;br /&gt;
&lt;br /&gt;
# The pitch property of a note should be integers ranging from 0 to 127, corresponding to the MIDI pitch numbers.&lt;br /&gt;
&lt;br /&gt;
'''Detailed explanation of the chord &amp;lt;code&amp;gt;symbol&amp;lt;/code&amp;gt; attribute.''' &lt;br /&gt;
&lt;br /&gt;
# The symbol property of a chord should be a string based on the syntax of (Harte, 2010). In other words, each chord string should be able to be passed as a parameter to mir_eval.chord.encode() without causing an error.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Data Example=&lt;br /&gt;
Below is an example of the input lead sheet in the format given above. The lead sheet is the melody of the first phrase of ''Hey Jude'' by The Beatles.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
{&lt;br /&gt;
  &amp;quot;melody&amp;quot;: [&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 12, &amp;quot;pitch&amp;quot;: 72, &amp;quot;duration&amp;quot;: 4},&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 16, &amp;quot;pitch&amp;quot;: 69, &amp;quot;duration&amp;quot;: 8},&lt;br /&gt;
    ...&lt;br /&gt;
  ],&lt;br /&gt;
  &amp;quot;chords&amp;quot;: [&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 0, &amp;quot;symbol&amp;quot;: &amp;quot;N&amp;quot;, &amp;quot;duration&amp;quot;: 16},&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 16, &amp;quot;symbol&amp;quot;: &amp;quot;F&amp;quot;, &amp;quot;duration&amp;quot;: 16},&lt;br /&gt;
    ...&lt;br /&gt;
  ]&lt;br /&gt;
}&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
This is an example of the generated accompaniment. The accompaniment is generated using the baseline method WholeSongGen introduced below. Note that the generation starts from the second measure (time step 16).&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
{&lt;br /&gt;
  &amp;quot;acc&amp;quot;: [&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 16, &amp;quot;pitch&amp;quot;: 41, &amp;quot;duration&amp;quot;: 12},&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 16, &amp;quot;pitch&amp;quot;: 65, &amp;quot;duration&amp;quot;: 5},&lt;br /&gt;
    ...&lt;br /&gt;
  ]&lt;br /&gt;
}&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
Full data examples can be accessed in [https://github.com/ZZWaang/acc-gen-8bar-wholesong/tree/main/generation_samples this code repository]. MIDI conversion code and MIDI demos are also provided there.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Evaluation and Competition Format=&lt;br /&gt;
We will evaluate the submitted algorithms through an online subjective double-blind test. The evaluation format differs from conventional tasks in the following aspects:&lt;br /&gt;
* '''We use a &amp;quot;''potluck''&amp;quot; test set. Before submitting the algorithm, each team is required to submit two lead sheets.''' The organizer team will supplement the lead sheet if necessary. &lt;br /&gt;
* There will be '''no live ranking''' because the subjective test will be done after the algorithm submission deadline.&lt;br /&gt;
* To better handle randomness in the generation algorithm, we '''allow cherry-picking from a fixed number of generated samples'''.   &lt;br /&gt;
* We hope to compute some objective measurements as well, but these will only be reported as a reference.&lt;br /&gt;
&lt;br /&gt;
==Subjective Evaluation Format==&lt;br /&gt;
* After each team submits the algorithm, the organizer team will use the algorithm to generate '''16 arrangements''' for each test sample. The generated results will be returned to each team for cherry-picking.&lt;br /&gt;
* Only a subset of the test set will be used for subjective evaluation.&lt;br /&gt;
* In the subjective evaluation, we will first ask the subjects to listen to the lead melody with chords and then listen to the generated samples in random order. The order of the samples will be randomized.&lt;br /&gt;
* The subject will be asked to rate each arrangement based on the following criteria:&lt;br /&gt;
:* Harmony correctness (5-point scale)&lt;br /&gt;
:* Creativity (5-point scale)&lt;br /&gt;
:* Naturalness (5-point scale)&lt;br /&gt;
:* Overall musicality (5-point scale)&lt;br /&gt;
&lt;br /&gt;
==Objective Measurements==&lt;br /&gt;
* We will use objective measurements only as a reference. The correlation between subjective and objective scores will be measured as a reference. &lt;br /&gt;
* The current plan is to compute the Negative Log Likelihood of a large music language model (e.g., Lu et al., 2023).&lt;br /&gt;
* We welcome proposals of the objective measurements.&lt;br /&gt;
&lt;br /&gt;
==Important Dates==&lt;br /&gt;
&lt;br /&gt;
* '''Oct 8, 2024''': Submit two lead sheets as a part of the test set. &lt;br /&gt;
* '''Oct 15, 2024''': Submit the main algorithm.&lt;br /&gt;
* '''Oct 22, 2024''': Return the generated samples. The cherry-picking phase begins.&lt;br /&gt;
* '''Oct 25, 2024''': Submit the cherry-picked sample ids.&lt;br /&gt;
* '''Oct 31 - Nov 3, 2024''': Online subjective evaluation.&lt;br /&gt;
* '''Nov 5, 2024''': Announce the final result.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Submission=&lt;br /&gt;
&lt;br /&gt;
As a generative task with subjective evaluation, the submission process ''differs greatly'' from other MIREX tasks. There are four important stages:&lt;br /&gt;
# Test set submission (Oct 8, 2024)&lt;br /&gt;
# Algorithm submission (Oct 15, 2024)&lt;br /&gt;
# Cherry-picked sample IDs submission (Oct 25, 2024)&lt;br /&gt;
# Evaluation form submission (Nov 3, 2024)&lt;br /&gt;
Please check the Important Dates section for the detailed schedule. '''Failure to participate in any of the stages will result in disqualification.'''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Algorithm Submission==&lt;br /&gt;
Participants must include an &amp;lt;code&amp;gt;batch_acc_gen.sh&amp;lt;/code&amp;gt; script in their submission. The task captain will use the script to generate output files according to the following format:&lt;br /&gt;
&lt;br /&gt;
'''Usage'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
acc_gen.sh &amp;quot;/path/to/input.json&amp;quot; &amp;quot;/path/to/output_folder&amp;quot; n_sample&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* Input File: Path to the input .json file.&lt;br /&gt;
* Output Folder: Path to the folder where the generated output files will be saved.&lt;br /&gt;
* n_sample: Number of samples to generate.&lt;br /&gt;
&lt;br /&gt;
'''Output'''&lt;br /&gt;
* The script should generate n_sample output files in the specified output folder.&lt;br /&gt;
* Output files should be named sequentially as sample_01.json, sample_02.json, ..., up to sample_n_sample.json.&lt;br /&gt;
&lt;br /&gt;
Participants are free to implement the internal logic of the script, but it must adhere to this format for proper execution during the evaluation process.&lt;br /&gt;
&lt;br /&gt;
'''Packaging Submissions'''&lt;br /&gt;
* Every submission must be packed into a docker image&lt;br /&gt;
* Every submission will be deployed and evaluated automatically with &amp;lt;code&amp;gt;docker run&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''Accepted submission form'''&lt;br /&gt;
* Link to public or private Github repository&lt;br /&gt;
* Link to public or private docker hub&lt;br /&gt;
* Shared google drive links&lt;br /&gt;
* If the repository is private, an access token is also required&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Baselines=&lt;br /&gt;
&lt;br /&gt;
To establish a benchmark for this task, we consider the three baseline models in their official implementations:&lt;br /&gt;
&lt;br /&gt;
'''WholeSongGen''' (Wang et al., 2024)&lt;br /&gt;
* A denoising diffusion probabilistic model (DDPM) generating piano accompaniments as piano-roll images.&lt;br /&gt;
&lt;br /&gt;
'''Compose &amp;amp; Embellish''' (Wu and Yang, 2023)&lt;br /&gt;
* A Transformer-based architecture generating piano performances in beat-based event sequences.&lt;br /&gt;
&lt;br /&gt;
'''AccoMontage''' (Zhao and Xia, 2021)&lt;br /&gt;
* A hybrid algorithm generating piano accompaniments by rule-based search and music representation learning.&lt;br /&gt;
&lt;br /&gt;
=Contacts=&lt;br /&gt;
If you any questions or suggestions about the task, please contact:&lt;br /&gt;
* Ziyu Wang: ziyu.wang&amp;lt;at&amp;gt;nyu.edu&lt;br /&gt;
* Jingwei Zhao: jzhao&amp;lt;at&amp;gt;u.nus.edu&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=References=&lt;br /&gt;
* Harte, C. Towards automatic extraction of harmony information from music signals. PhD Diss. 2010.&lt;br /&gt;
* Lu, P., et al. Musecoco: Generating symbolic music from text. arXiv preprint arXiv:2306.00110 (2023).&lt;br /&gt;
* Wang, Z., et al. Whole-song hierarchical generation of symbolic music using cascaded diffusion models, in ICLR 2024.&lt;br /&gt;
* Zhao, J., &amp;amp; Xia, G. Accomontage: Accompaniment arrangement via phrase selection and style transfer, in ISMIR 2021.&lt;br /&gt;
* Thickstun, J., et al. Anticipatory music transformer, TMLR, 2024.&lt;br /&gt;
* Hsiao, W. Y., et al. Compound word transformer: Learning to compose full-song music over dynamic directed hypergraphs, in AAAI 2021.&lt;br /&gt;
* Ren, Y., et al. Popmag: Pop music accompaniment generation, in MM 2020.&lt;br /&gt;
&lt;br /&gt;
* Code and data format samples: [https://github.com/ZZWaang/acc-gen-8bar-wholesong/tree/main https://github.com/ZZWaang/acc-gen-8bar-wholesong/tree/main]&lt;/div&gt;</summary>
		<author><name>Zhaojw1998</name></author>
		
	</entry>
	<entry>
		<id>https://music-ir.org/mirex/w/index.php?title=2024:Symbolic_Music_Generation&amp;diff=13905</id>
		<title>2024:Symbolic Music Generation</title>
		<link rel="alternate" type="text/html" href="https://music-ir.org/mirex/w/index.php?title=2024:Symbolic_Music_Generation&amp;diff=13905"/>
		<updated>2024-10-01T02:06:25Z</updated>

		<summary type="html">&lt;p&gt;Zhaojw1998: /* Baselines */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Description=&lt;br /&gt;
Symbolic music generation is a broad topic. It covers a wide range of tasks, including generation, harmonization, arrangement, instrumentation, and more. We have multiple ways to represent music data, and the evaluation metrics also vary. To define a MIREX challenge within this topic, we need to narrow our focus to specific subtasks that are both relevant to the community and feasible to evaluate effectively.&lt;br /&gt;
&lt;br /&gt;
This year, we select the task to be '''piano accompaniment arrangement from a lead sheet'''. The lead sheet provides information about the melody, and chord progression. The goal is to generate a piano accompaniment that complements the lead melody. The music data consists of 8-measure segments in 4/4 meter, quantized to a sixteenth-note resolution. A more detailed description of the data structure is provided in the data format section. The genre of the lead sheets is broadly within western pop music (refer to the music examples for more detail).&lt;br /&gt;
&lt;br /&gt;
=Data Format=&lt;br /&gt;
The input lead sheet consists of 8 bars for the melody and harmony, with an additional mandatory pickup measure (left blank if not used). The data is prepared in JSON format containing two properties: &amp;lt;code&amp;gt;melody&amp;lt;/code&amp;gt; and &amp;lt;code&amp;gt;chords&amp;lt;/code&amp;gt;:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;code&amp;gt;melody&amp;lt;/code&amp;gt;: a list of notes. Each note contains properties of &amp;lt;code&amp;gt;start&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;pitch&amp;lt;/code&amp;gt;, and &amp;lt;code&amp;gt;duration&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;code&amp;gt;chords&amp;lt;/code&amp;gt;: a list of chords. Each chord contains properties of &amp;lt;code&amp;gt;start&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;symbol&amp;lt;/code&amp;gt;, and &amp;lt;code&amp;gt;duration&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
The output generation should also follow the JSON format containing one property &amp;lt;code&amp;gt;acc&amp;lt;/code&amp;gt;:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;code&amp;gt;acc&amp;lt;/code&amp;gt;: a list of notes. Each note contains properties of &amp;lt;code&amp;gt;start&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;pitch&amp;lt;/code&amp;gt;, and &amp;lt;code&amp;gt;duration&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
'''Detailed explanation of &amp;lt;code&amp;gt;start&amp;lt;/code&amp;gt; and &amp;lt;code&amp;gt;duration&amp;lt;/code&amp;gt; attributes.''' &lt;br /&gt;
&lt;br /&gt;
# The data is assumed to be in 4/4 meter, quantized to a sixteenth-note resolution. For both melody and chords, onsets and durations are counted in sixteenth notes. &lt;br /&gt;
# Both onsets and durations are integers ranging from 0 to 9 * 16 - 1 = 143. Notes that end later than the ninth measure (i.e., 9 * 16 = 144th time step) will be truncated to the end of the ninth measure. &lt;br /&gt;
# Melody notes are not allowed to overlap with one another. &lt;br /&gt;
# There should be no gaps or overlaps between chords. Chords must follow one another directly. If there is a blank space where no chord is played, it must be filled with the &amp;lt;code&amp;gt;N&amp;lt;/code&amp;gt; chord. &lt;br /&gt;
# The accompaniment of the pick-up measure should be blank. &lt;br /&gt;
&lt;br /&gt;
'''Detailed explanation of the &amp;lt;code&amp;gt;pitch&amp;lt;/code&amp;gt; attribute.''' &lt;br /&gt;
&lt;br /&gt;
# The pitch property of a note should be integers ranging from 0 to 127, corresponding to the MIDI pitch numbers.&lt;br /&gt;
&lt;br /&gt;
'''Detailed explanation of the chord &amp;lt;code&amp;gt;symbol&amp;lt;/code&amp;gt; attribute.''' &lt;br /&gt;
&lt;br /&gt;
# The symbol property of a chord should be a string based on the syntax of (Harte, 2010). In other words, each chord string should be able to be passed as a parameter to mir_eval.chord.encode() without causing an error.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Data Example=&lt;br /&gt;
Below is an example of the input lead sheet in the format given above. The lead sheet is the melody of the first phrase of ''Hey Jude'' by The Beatles.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
{&lt;br /&gt;
  &amp;quot;melody&amp;quot;: [&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 12, &amp;quot;pitch&amp;quot;: 72, &amp;quot;duration&amp;quot;: 4},&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 16, &amp;quot;pitch&amp;quot;: 69, &amp;quot;duration&amp;quot;: 8},&lt;br /&gt;
    ...&lt;br /&gt;
  ],&lt;br /&gt;
  &amp;quot;chords&amp;quot;: [&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 0, &amp;quot;symbol&amp;quot;: &amp;quot;N&amp;quot;, &amp;quot;duration&amp;quot;: 16},&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 16, &amp;quot;symbol&amp;quot;: &amp;quot;F&amp;quot;, &amp;quot;duration&amp;quot;: 16},&lt;br /&gt;
    ...&lt;br /&gt;
  ]&lt;br /&gt;
}&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
This is an example of the generated accompaniment. The accompaniment is generated using the baseline method WholeSongGen introduced below. Note that the generation starts from the second measure (time step 16).&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
{&lt;br /&gt;
  &amp;quot;acc&amp;quot;: [&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 16, &amp;quot;pitch&amp;quot;: 41, &amp;quot;duration&amp;quot;: 12},&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 16, &amp;quot;pitch&amp;quot;: 65, &amp;quot;duration&amp;quot;: 5},&lt;br /&gt;
    ...&lt;br /&gt;
  ]&lt;br /&gt;
}&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
Full data examples can be accessed in [https://github.com/ZZWaang/acc-gen-8bar-wholesong/tree/main/generation_samples this code repository]. MIDI conversion code and MIDI demos are also provided there.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Evaluation and Competition Format=&lt;br /&gt;
We will evaluate the submitted algorithms through an online subjective double-blind test. The evaluation format differs from conventional tasks in the following aspects:&lt;br /&gt;
* '''We use a &amp;quot;''potluck''&amp;quot; test set. Before submitting the algorithm, each team is required to submit two lead sheets.''' The organizer team will supplement the lead sheet if necessary. &lt;br /&gt;
* There will be '''no live ranking''' because the subjective test will be done after the algorithm submission deadline.&lt;br /&gt;
* To better handle randomness in the generation algorithm, we '''allow cherry-picking from a fixed number of generated samples'''.   &lt;br /&gt;
* We hope to compute some objective measurements as well, but these will only be reported as a reference.&lt;br /&gt;
&lt;br /&gt;
==Subjective Evaluation Format==&lt;br /&gt;
* After each team submits the algorithm, the organizer team will use the algorithm to generate '''16 arrangements''' for each test sample. The generated results will be returned to each team for cherry-picking.&lt;br /&gt;
* Only a subset of the test set will be used for subjective evaluation.&lt;br /&gt;
* In the subjective evaluation, we will first ask the subjects to listen to the lead melody with chords and then listen to the generated samples in random order. The order of the samples will be randomized.&lt;br /&gt;
* The subject will be asked to rate each arrangement based on the following criteria:&lt;br /&gt;
:* Harmony correctness (5-point scale)&lt;br /&gt;
:* Creativity (5-point scale)&lt;br /&gt;
:* Naturalness (5-point scale)&lt;br /&gt;
:* Overall musicality (5-point scale)&lt;br /&gt;
&lt;br /&gt;
==Objective Measurements==&lt;br /&gt;
* We will use objective measurements only as a reference. The correlation between subjective and objective scores will be measured as a reference. &lt;br /&gt;
* The current plan is to compute the Negative Log Likelihood of a large music language model (e.g., Lu et al., 2023).&lt;br /&gt;
* We welcome proposals of the objective measurements.&lt;br /&gt;
&lt;br /&gt;
==Important Dates==&lt;br /&gt;
&lt;br /&gt;
* '''Oct 8, 2024''': Submit two lead sheets as a part of the test set. &lt;br /&gt;
* '''Oct 15, 2024''': Submit the main algorithm.&lt;br /&gt;
* '''Oct 22, 2024''': Return the generated samples. The cherry-picking phase begins.&lt;br /&gt;
* '''Oct 25, 2024''': Submit the cherry-picked sample ids.&lt;br /&gt;
* '''Oct 31 - Nov 3, 2024''': Online subjective evaluation.&lt;br /&gt;
* '''Nov 5, 2024''': Announce the final result.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Submission=&lt;br /&gt;
&lt;br /&gt;
As a generative task with subjective evaluation, the submission process ''differs greatly'' from other MIREX tasks. There are four important stages:&lt;br /&gt;
# Test set submission (Oct 8, 2024)&lt;br /&gt;
# Algorithm submission (Oct 15, 2024)&lt;br /&gt;
# Cherry-picked sample IDs submission (Oct 25, 2024)&lt;br /&gt;
# Evaluation form submission (Nov 3, 2024)&lt;br /&gt;
Please check the Important Dates section for the detailed schedule. '''Failure to participate in any of the stages will result in disqualification.'''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Algorithm Submission==&lt;br /&gt;
Participants must include an &amp;lt;code&amp;gt;batch_acc_gen.sh&amp;lt;/code&amp;gt; script in their submission. The task captain will use the script to generate output files according to the following format:&lt;br /&gt;
&lt;br /&gt;
'''Usage'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
acc_gen.sh &amp;quot;/path/to/input.json&amp;quot; &amp;quot;/path/to/output_folder&amp;quot; n_sample&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* Input File: Path to the input .json file.&lt;br /&gt;
* Output Folder: Path to the folder where the generated output files will be saved.&lt;br /&gt;
* n_sample: Number of samples to generate.&lt;br /&gt;
&lt;br /&gt;
'''Output'''&lt;br /&gt;
* The script should generate n_sample output files in the specified output folder.&lt;br /&gt;
* Output files should be named sequentially as sample_01.json, sample_02.json, ..., up to sample_n_sample.json.&lt;br /&gt;
&lt;br /&gt;
Participants are free to implement the internal logic of the script, but it must adhere to this format for proper execution during the evaluation process.&lt;br /&gt;
&lt;br /&gt;
'''Packaging Submissions'''&lt;br /&gt;
* Every submission must be packed into a docker image&lt;br /&gt;
* Every submission will be deployed and evaluated automatically with &amp;lt;code&amp;gt;docker run&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''Accepted submission form'''&lt;br /&gt;
* Link to public or private Github repository&lt;br /&gt;
* Link to public or private docker hub&lt;br /&gt;
* Shared google drive links&lt;br /&gt;
* If the repository is private, an access token is also required&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Baselines=&lt;br /&gt;
&lt;br /&gt;
To establish a benchmark for this task, we consider the three baseline models in their official implementations:&lt;br /&gt;
&lt;br /&gt;
'''WholeSongGen''' (Wang et al., 2024), a denoising diffusion probabilistic model (DDPM) generating piano accompaniments as piano-roll images.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Compose &amp;amp; Embellish''' (Wu and Yang, 2023), a Transformer-based architecture generating piano performances in beat-based event sequences.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''AccoMontage''' (Zhao and Xia, 2021), a hybrid algorithm generating piano accompaniments by rule-based search and music representation learning.&lt;br /&gt;
&lt;br /&gt;
=Contacts=&lt;br /&gt;
If you any questions or suggestions about the task, please contact:&lt;br /&gt;
* Ziyu Wang: ziyu.wang&amp;lt;at&amp;gt;nyu.edu&lt;br /&gt;
* Jingwei Zhao: jzhao&amp;lt;at&amp;gt;u.nus.edu&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=References=&lt;br /&gt;
* Harte, C. Towards automatic extraction of harmony information from music signals. PhD Diss. 2010.&lt;br /&gt;
* Lu, P., et al. Musecoco: Generating symbolic music from text. arXiv preprint arXiv:2306.00110 (2023).&lt;br /&gt;
* Wang, Z., et al. Whole-song hierarchical generation of symbolic music using cascaded diffusion models, in ICLR 2024.&lt;br /&gt;
* Zhao, J., &amp;amp; Xia, G. Accomontage: Accompaniment arrangement via phrase selection and style transfer, in ISMIR 2021.&lt;br /&gt;
* Thickstun, J., et al. Anticipatory music transformer, TMLR, 2024.&lt;br /&gt;
* Hsiao, W. Y., et al. Compound word transformer: Learning to compose full-song music over dynamic directed hypergraphs, in AAAI 2021.&lt;br /&gt;
* Ren, Y., et al. Popmag: Pop music accompaniment generation, in MM 2020.&lt;br /&gt;
&lt;br /&gt;
* Code and data format samples: [https://github.com/ZZWaang/acc-gen-8bar-wholesong/tree/main https://github.com/ZZWaang/acc-gen-8bar-wholesong/tree/main]&lt;/div&gt;</summary>
		<author><name>Zhaojw1998</name></author>
		
	</entry>
	<entry>
		<id>https://music-ir.org/mirex/w/index.php?title=2024:Symbolic_Music_Generation&amp;diff=13893</id>
		<title>2024:Symbolic Music Generation</title>
		<link rel="alternate" type="text/html" href="https://music-ir.org/mirex/w/index.php?title=2024:Symbolic_Music_Generation&amp;diff=13893"/>
		<updated>2024-09-16T18:22:43Z</updated>

		<summary type="html">&lt;p&gt;Zhaojw1998: /* References */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Description=&lt;br /&gt;
Symbolic music generation is a broad topic. It covers a wide range of tasks, including generation, harmonization, arrangement, instrumentation, and more. We have multiple ways to represent music data, and the evaluation metrics also vary. To define a MIREX challenge within this topic, we need to narrow our focus to specific subtasks that are both relevant to the community and feasible to evaluate effectively.&lt;br /&gt;
&lt;br /&gt;
This year, we select the task to be '''piano accompaniment arrangement from a lead sheet'''. The lead sheet provides information about the melody, and chord progression. The goal is to generate a piano accompaniment that complements the lead melody. The music data consists of 8-measure segments in 4/4 meter, quantized to a sixteenth-note resolution. A more detailed description of the data structure is provided in the data format section. The genre of the lead sheets is broadly within western pop music (refer to the music examples for more detail).&lt;br /&gt;
&lt;br /&gt;
=Data Format=&lt;br /&gt;
The input lead sheet consists of 8 bars for the melody and harmony, with an additional mandatory pickup measure (left blank if not used). The data is prepared in JSON format containing two properties: &amp;lt;code&amp;gt;melody&amp;lt;/code&amp;gt; and &amp;lt;code&amp;gt;chords&amp;lt;/code&amp;gt;:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;code&amp;gt;melody&amp;lt;/code&amp;gt;: a list of notes. Each note contains properties of &amp;lt;code&amp;gt;start&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;pitch&amp;lt;/code&amp;gt;, and &amp;lt;code&amp;gt;duration&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;code&amp;gt;chords&amp;lt;/code&amp;gt;: a list of chords. Each chord contains properties of &amp;lt;code&amp;gt;start&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;symbol&amp;lt;/code&amp;gt;, and &amp;lt;code&amp;gt;duration&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
The output generation should also follow the JSON format containing one property &amp;lt;code&amp;gt;acc&amp;lt;/code&amp;gt;:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;code&amp;gt;acc&amp;lt;/code&amp;gt;: a list of notes. Each note contains properties of &amp;lt;code&amp;gt;start&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;pitch&amp;lt;/code&amp;gt;, and &amp;lt;code&amp;gt;duration&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
'''Detailed explanation of &amp;lt;code&amp;gt;start&amp;lt;/code&amp;gt; and &amp;lt;code&amp;gt;duration&amp;lt;/code&amp;gt; attributes.''' &lt;br /&gt;
&lt;br /&gt;
# The data is assumed to be in 4/4 meter, quantized to a sixteenth-note resolution. For both melody and chords, onsets and durations are counted in sixteenth notes. &lt;br /&gt;
# Both onsets and durations are integers ranging from 0 to 9 * 16 - 1 = 143. Notes that end later than the ninth measure (i.e., 9 * 16 = 144th time step) will be truncated to the end of the ninth measure. &lt;br /&gt;
# Melody notes are not allowed to overlap with one another. &lt;br /&gt;
# There should be no gaps or overlaps between chords. Chords must follow one another directly. If there is a blank space where no chord is played, it must be filled with the &amp;lt;code&amp;gt;N&amp;lt;/code&amp;gt; chord. &lt;br /&gt;
# The accompaniment of the pick-up measure should be blank. &lt;br /&gt;
&lt;br /&gt;
'''Detailed explanation of the &amp;lt;code&amp;gt;pitch&amp;lt;/code&amp;gt; attribute.''' &lt;br /&gt;
&lt;br /&gt;
# The pitch property of a note should be integers ranging from 0 to 127, corresponding to the MIDI pitch numbers.&lt;br /&gt;
&lt;br /&gt;
'''Detailed explanation of the chord &amp;lt;code&amp;gt;symbol&amp;lt;/code&amp;gt; attribute.''' &lt;br /&gt;
&lt;br /&gt;
# The symbol property of a chord should be a string based on the syntax of (Harte, 2010). In other words, each chord string should be able to be passed as a parameter to mir_eval.chord.encode() without causing an error.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Data Example=&lt;br /&gt;
Below is an example of the input lead sheet in the format given above. The lead sheet is the melody of the first phrase of ''Hey Jude'' by The Beatles.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
{&lt;br /&gt;
  &amp;quot;melody&amp;quot;: [&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 12, &amp;quot;pitch&amp;quot;: 72, &amp;quot;duration&amp;quot;: 4},&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 16, &amp;quot;pitch&amp;quot;: 69, &amp;quot;duration&amp;quot;: 8},&lt;br /&gt;
    ...&lt;br /&gt;
  ],&lt;br /&gt;
  &amp;quot;chords&amp;quot;: [&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 0, &amp;quot;symbol&amp;quot;: &amp;quot;N&amp;quot;, &amp;quot;duration&amp;quot;: 16},&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 16, &amp;quot;symbol&amp;quot;: &amp;quot;F&amp;quot;, &amp;quot;duration&amp;quot;: 16},&lt;br /&gt;
    ...&lt;br /&gt;
  ]&lt;br /&gt;
}&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
This is an example of the generated accompaniment. The accompaniment is generated using the baseline method WholeSongGen introduced below. Note that the generation starts from the second measure (time step 16).&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
{&lt;br /&gt;
  &amp;quot;acc&amp;quot;: [&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 16, &amp;quot;pitch&amp;quot;: 41, &amp;quot;duration&amp;quot;: 12},&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 16, &amp;quot;pitch&amp;quot;: 65, &amp;quot;duration&amp;quot;: 5},&lt;br /&gt;
    ...&lt;br /&gt;
  ]&lt;br /&gt;
}&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
Full data examples can be accessed in [https://github.com/ZZWaang/acc-gen-8bar-wholesong/tree/main/generation_samples this code repository]. MIDI conversion code and MIDI demos are also provided there.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Evaluation and Competition Format=&lt;br /&gt;
We will evaluate the submitted algorithms through an online subjective double-blind test. The evaluation format differs from conventional tasks in the following aspects:&lt;br /&gt;
* '''We use a &amp;quot;''potluck''&amp;quot; test set. Before submitting the algorithm, each team is required to submit two lead sheets.''' The organizer team will supplement the lead sheet if necessary. &lt;br /&gt;
* There will be '''no live ranking''' because the subjective test will be done after the algorithm submission deadline.&lt;br /&gt;
* To better handle randomness in the generation algorithm, we '''allow cherry-picking from a fixed number of generated samples'''.   &lt;br /&gt;
* We hope to compute some objective measurements as well, but these will only be reported as a reference.&lt;br /&gt;
&lt;br /&gt;
==Subjective Evaluation Format==&lt;br /&gt;
* After each team submits the algorithm, the organizer team will use the algorithm to generate '''16 arrangements''' for each test sample. The generated results will be returned to each team for cherry-picking.&lt;br /&gt;
* Only a subset of the test set will be used for subjective evaluation.&lt;br /&gt;
* In the subjective evaluation, we will first ask the subjects to listen to the lead melody with chords and then listen to the generated samples in random order. The order of the samples will be randomized.&lt;br /&gt;
* The subject will be asked to rate each arrangement based on the following criteria:&lt;br /&gt;
:* Harmony correctness (5-point scale)&lt;br /&gt;
:* Creativity (5-point scale)&lt;br /&gt;
:* Naturalness (5-point scale)&lt;br /&gt;
:* Overall musicality (5-point scale)&lt;br /&gt;
&lt;br /&gt;
==Objective Measurements==&lt;br /&gt;
* We will use objective measurements only as a reference. The correlation between subjective and objective scores will be measured as a reference. &lt;br /&gt;
* The current plan is to compute the Negative Log Likelihood of a large music language model (e.g., Lu et al., 2023).&lt;br /&gt;
* We welcome proposals of the objective measurements.&lt;br /&gt;
&lt;br /&gt;
==Important Dates==&lt;br /&gt;
&lt;br /&gt;
* '''Oct 8, 2024''': Submit two lead sheets as a part of the test set. &lt;br /&gt;
* '''Oct 15, 2024''': Submit the main algorithm.&lt;br /&gt;
* '''Oct 22, 2024''': Return the generated samples. The cherry-picking phase begins.&lt;br /&gt;
* '''Oct 25, 2024''': Submit the cherry-picked sample ids.&lt;br /&gt;
* '''Oct 31 - Nov 3, 2024''': Online subjective evaluation.&lt;br /&gt;
* '''Nov 5, 2024''': Announce the final result.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Submission=&lt;br /&gt;
&lt;br /&gt;
As a generative task with subjective evaluation, the submission process ''differs greatly'' from other MIREX tasks. There are four important stages:&lt;br /&gt;
# Test set submission (Oct 8, 2024)&lt;br /&gt;
# Algorithm submission (Oct 15, 2024)&lt;br /&gt;
# Cherry-picked sample IDs submission (Oct 25, 2024)&lt;br /&gt;
# Evaluation form submission (Nov 3, 2024)&lt;br /&gt;
Please check the Important Dates section for the detailed schedule. '''Failure to participate in any of the stages will result in disqualification.'''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Algorithm Submission==&lt;br /&gt;
Participants must include an &amp;lt;code&amp;gt;batch_acc_gen.sh&amp;lt;/code&amp;gt; script in their submission. The task captain will use the script to generate output files according to the following format:&lt;br /&gt;
&lt;br /&gt;
'''Usage'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
acc_gen.sh &amp;quot;/path/to/input.json&amp;quot; &amp;quot;/path/to/output_folder&amp;quot; n_sample&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* Input File: Path to the input .json file.&lt;br /&gt;
* Output Folder: Path to the folder where the generated output files will be saved.&lt;br /&gt;
* n_sample: Number of samples to generate.&lt;br /&gt;
&lt;br /&gt;
'''Output'''&lt;br /&gt;
* The script should generate n_sample output files in the specified output folder.&lt;br /&gt;
* Output files should be named sequentially as sample_01.json, sample_02.json, ..., up to sample_n_sample.json.&lt;br /&gt;
&lt;br /&gt;
Participants are free to implement the internal logic of the script, but it must adhere to this format for proper execution during the evaluation process.&lt;br /&gt;
&lt;br /&gt;
'''Packaging Submissions'''&lt;br /&gt;
* Every submission must be packed into a docker image&lt;br /&gt;
* Every submission will be deployed and evaluated automatically with &amp;lt;code&amp;gt;docker run&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''Accepted submission form'''&lt;br /&gt;
* Link to public or private Github repository&lt;br /&gt;
* Link to public or private docker hub&lt;br /&gt;
* Shared google drive links&lt;br /&gt;
* If the repository is private, an access token is also required&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Baselines=&lt;br /&gt;
&lt;br /&gt;
To establish a benchmark for this task, we have several options for baseline models, subjective to the availability of official implementations:&lt;br /&gt;
&lt;br /&gt;
'''WholeSongGen''' (Wang et al., 2024)&lt;br /&gt;
&lt;br /&gt;
This model generates piano accompaniment using a diffusion model.&lt;br /&gt;
&lt;br /&gt;
'''AccoMontage''' (Zhao et al., 2021)&lt;br /&gt;
&lt;br /&gt;
This algorithm generates piano accompaniment using a combination of rule-based search and deep representation learning.&lt;br /&gt;
&lt;br /&gt;
'''Compound Word Transformer''' (Hsiao et al., 2021)&lt;br /&gt;
&lt;br /&gt;
This model generates piano performance using a Transformer-based architecture. &lt;br /&gt;
&lt;br /&gt;
'''Anticipatory Music Transformer''' (Thickstun et al., 2024)&lt;br /&gt;
&lt;br /&gt;
A Transformer-based model for track infilling and accompaniment generation. While the current implementation does not explicitly consider chord input, it remains a relevant starting point to study with.&lt;br /&gt;
&lt;br /&gt;
'''PopMAG''' (Ren et al., 2020)&lt;br /&gt;
&lt;br /&gt;
A Transformer-based model for multi-track accompaniment generation.&lt;br /&gt;
&lt;br /&gt;
=Contacts=&lt;br /&gt;
If you any questions or suggestions about the task, please contact:&lt;br /&gt;
* Ziyu Wang: ziyu.wang&amp;lt;at&amp;gt;nyu.edu&lt;br /&gt;
* Jingwei Zhao: jzhao&amp;lt;at&amp;gt;u.nus.edu&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=References=&lt;br /&gt;
* Harte, C. Towards automatic extraction of harmony information from music signals. PhD Diss. 2010.&lt;br /&gt;
* Lu, P., et al. Musecoco: Generating symbolic music from text. arXiv preprint arXiv:2306.00110 (2023).&lt;br /&gt;
* Wang, Z., et al. Whole-song hierarchical generation of symbolic music using cascaded diffusion models, in ICLR 2024.&lt;br /&gt;
* Zhao, J., &amp;amp; Xia, G. Accomontage: Accompaniment arrangement via phrase selection and style transfer, in ISMIR 2021.&lt;br /&gt;
* Thickstun, J., et al. Anticipatory music transformer, TMLR, 2024.&lt;br /&gt;
* Hsiao, W. Y., et al. Compound word transformer: Learning to compose full-song music over dynamic directed hypergraphs, in AAAI 2021.&lt;br /&gt;
* Ren, Y., et al. Popmag: Pop music accompaniment generation, in MM 2020.&lt;br /&gt;
&lt;br /&gt;
* Code and data format samples: [https://github.com/ZZWaang/acc-gen-8bar-wholesong/tree/main https://github.com/ZZWaang/acc-gen-8bar-wholesong/tree/main]&lt;/div&gt;</summary>
		<author><name>Zhaojw1998</name></author>
		
	</entry>
	<entry>
		<id>https://music-ir.org/mirex/w/index.php?title=2024:Symbolic_Music_Generation&amp;diff=13892</id>
		<title>2024:Symbolic Music Generation</title>
		<link rel="alternate" type="text/html" href="https://music-ir.org/mirex/w/index.php?title=2024:Symbolic_Music_Generation&amp;diff=13892"/>
		<updated>2024-09-16T18:07:19Z</updated>

		<summary type="html">&lt;p&gt;Zhaojw1998: /* Baselines */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Description=&lt;br /&gt;
Symbolic music generation is a broad topic. It covers a wide range of tasks, including generation, harmonization, arrangement, instrumentation, and more. We have multiple ways to represent music data, and the evaluation metrics also vary. To define a MIREX challenge within this topic, we need to narrow our focus to specific subtasks that are both relevant to the community and feasible to evaluate effectively.&lt;br /&gt;
&lt;br /&gt;
This year, we select the task to be '''piano accompaniment arrangement from a lead sheet'''. The lead sheet provides information about the melody, and chord progression. The goal is to generate a piano accompaniment that complements the lead melody. The music data consists of 8-measure segments in 4/4 meter, quantized to a sixteenth-note resolution. A more detailed description of the data structure is provided in the data format section. The genre of the lead sheets is broadly within western pop music (refer to the music examples for more detail).&lt;br /&gt;
&lt;br /&gt;
=Data Format=&lt;br /&gt;
The input lead sheet consists of 8 bars for the melody and harmony, with an additional mandatory pickup measure (left blank if not used). The data is prepared in JSON format containing two properties: &amp;lt;code&amp;gt;melody&amp;lt;/code&amp;gt; and &amp;lt;code&amp;gt;chords&amp;lt;/code&amp;gt;:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;code&amp;gt;melody&amp;lt;/code&amp;gt;: a list of notes. Each note contains properties of &amp;lt;code&amp;gt;start&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;pitch&amp;lt;/code&amp;gt;, and &amp;lt;code&amp;gt;duration&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;code&amp;gt;chords&amp;lt;/code&amp;gt;: a list of chords. Each chord contains properties of &amp;lt;code&amp;gt;start&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;symbol&amp;lt;/code&amp;gt;, and &amp;lt;code&amp;gt;duration&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
The output generation should also follow the JSON format containing one property &amp;lt;code&amp;gt;acc&amp;lt;/code&amp;gt;:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;code&amp;gt;acc&amp;lt;/code&amp;gt;: a list of notes. Each note contains properties of &amp;lt;code&amp;gt;start&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;pitch&amp;lt;/code&amp;gt;, and &amp;lt;code&amp;gt;duration&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
'''Detailed explanation of &amp;lt;code&amp;gt;start&amp;lt;/code&amp;gt; and &amp;lt;code&amp;gt;duration&amp;lt;/code&amp;gt; attributes.''' &lt;br /&gt;
&lt;br /&gt;
# The data is assumed to be in 4/4 meter, quantized to a sixteenth-note resolution. For both melody and chords, onsets and durations are counted in sixteenth notes. &lt;br /&gt;
# Both onsets and durations are integers ranging from 0 to 9 * 16 - 1 = 143. Notes that end later than the ninth measure (i.e., 9 * 16 = 144th time step) will be truncated to the end of the ninth measure. &lt;br /&gt;
# Melody notes are not allowed to overlap with one another. &lt;br /&gt;
# There should be no gaps or overlaps between chords. Chords must follow one another directly. If there is a blank space where no chord is played, it must be filled with the &amp;lt;code&amp;gt;N&amp;lt;/code&amp;gt; chord. &lt;br /&gt;
# The accompaniment of the pick-up measure should be blank. &lt;br /&gt;
&lt;br /&gt;
'''Detailed explanation of the &amp;lt;code&amp;gt;pitch&amp;lt;/code&amp;gt; attribute.''' &lt;br /&gt;
&lt;br /&gt;
# The pitch property of a note should be integers ranging from 0 to 127, corresponding to the MIDI pitch numbers.&lt;br /&gt;
&lt;br /&gt;
'''Detailed explanation of the chord &amp;lt;code&amp;gt;symbol&amp;lt;/code&amp;gt; attribute.''' &lt;br /&gt;
&lt;br /&gt;
# The symbol property of a chord should be a string based on the syntax of (Harte, 2010). In other words, each chord string should be able to be passed as a parameter to mir_eval.chord.encode() without causing an error.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Data Example=&lt;br /&gt;
Below is an example of the input lead sheet in the format given above. The lead sheet is the melody of the first phrase of ''Hey Jude'' by The Beatles.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
{&lt;br /&gt;
  &amp;quot;melody&amp;quot;: [&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 12, &amp;quot;pitch&amp;quot;: 72, &amp;quot;duration&amp;quot;: 4},&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 16, &amp;quot;pitch&amp;quot;: 69, &amp;quot;duration&amp;quot;: 8},&lt;br /&gt;
    ...&lt;br /&gt;
  ],&lt;br /&gt;
  &amp;quot;chords&amp;quot;: [&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 0, &amp;quot;symbol&amp;quot;: &amp;quot;N&amp;quot;, &amp;quot;duration&amp;quot;: 16},&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 16, &amp;quot;symbol&amp;quot;: &amp;quot;F&amp;quot;, &amp;quot;duration&amp;quot;: 16},&lt;br /&gt;
    ...&lt;br /&gt;
  ]&lt;br /&gt;
}&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
This is an example of the generated accompaniment. The accompaniment is generated using the baseline method WholeSongGen introduced below. Note that the generation starts from the second measure (time step 16).&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
{&lt;br /&gt;
  &amp;quot;acc&amp;quot;: [&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 16, &amp;quot;pitch&amp;quot;: 41, &amp;quot;duration&amp;quot;: 12},&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 16, &amp;quot;pitch&amp;quot;: 65, &amp;quot;duration&amp;quot;: 5},&lt;br /&gt;
    ...&lt;br /&gt;
  ]&lt;br /&gt;
}&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
Full data examples can be accessed in [https://github.com/ZZWaang/acc-gen-8bar-wholesong/tree/main/generation_samples this code repository]. MIDI conversion code and MIDI demos are also provided there.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Evaluation and Competition Format=&lt;br /&gt;
We will evaluate the submitted algorithms through an online subjective double-blind test. The evaluation format differs from conventional tasks in the following aspects:&lt;br /&gt;
* '''We use a &amp;quot;''potluck''&amp;quot; test set. Before submitting the algorithm, each team is required to submit two lead sheets.''' The organizer team will supplement the lead sheet if necessary. &lt;br /&gt;
* There will be '''no live ranking''' because the subjective test will be done after the algorithm submission deadline.&lt;br /&gt;
* To better handle randomness in the generation algorithm, we '''allow cherry-picking from a fixed number of generated samples'''.   &lt;br /&gt;
* We hope to compute some objective measurements as well, but these will only be reported as a reference.&lt;br /&gt;
&lt;br /&gt;
==Subjective Evaluation Format==&lt;br /&gt;
* After each team submits the algorithm, the organizer team will use the algorithm to generate '''16 arrangements''' for each test sample. The generated results will be returned to each team for cherry-picking.&lt;br /&gt;
* Only a subset of the test set will be used for subjective evaluation.&lt;br /&gt;
* In the subjective evaluation, we will first ask the subjects to listen to the lead melody with chords and then listen to the generated samples in random order. The order of the samples will be randomized.&lt;br /&gt;
* The subject will be asked to rate each arrangement based on the following criteria:&lt;br /&gt;
:* Harmony correctness (5-point scale)&lt;br /&gt;
:* Creativity (5-point scale)&lt;br /&gt;
:* Naturalness (5-point scale)&lt;br /&gt;
:* Overall musicality (5-point scale)&lt;br /&gt;
&lt;br /&gt;
==Objective Measurements==&lt;br /&gt;
* We will use objective measurements only as a reference. The correlation between subjective and objective scores will be measured as a reference. &lt;br /&gt;
* The current plan is to compute the Negative Log Likelihood of a large music language model (e.g., Lu et al., 2023).&lt;br /&gt;
* We welcome proposals of the objective measurements.&lt;br /&gt;
&lt;br /&gt;
==Important Dates==&lt;br /&gt;
&lt;br /&gt;
* '''Oct 8, 2024''': Submit two lead sheets as a part of the test set. &lt;br /&gt;
* '''Oct 15, 2024''': Submit the main algorithm.&lt;br /&gt;
* '''Oct 22, 2024''': Return the generated samples. The cherry-picking phase begins.&lt;br /&gt;
* '''Oct 25, 2024''': Submit the cherry-picked sample ids.&lt;br /&gt;
* '''Oct 31 - Nov 3, 2024''': Online subjective evaluation.&lt;br /&gt;
* '''Nov 5, 2024''': Announce the final result.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Submission=&lt;br /&gt;
&lt;br /&gt;
As a generative task with subjective evaluation, the submission process ''differs greatly'' from other MIREX tasks. There are four important stages:&lt;br /&gt;
# Test set submission (Oct 8, 2024)&lt;br /&gt;
# Algorithm submission (Oct 15, 2024)&lt;br /&gt;
# Cherry-picked sample IDs submission (Oct 25, 2024)&lt;br /&gt;
# Evaluation form submission (Nov 3, 2024)&lt;br /&gt;
Please check the Important Dates section for the detailed schedule. '''Failure to participate in any of the stages will result in disqualification.'''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Algorithm Submission==&lt;br /&gt;
Participants must include an &amp;lt;code&amp;gt;batch_acc_gen.sh&amp;lt;/code&amp;gt; script in their submission. The task captain will use the script to generate output files according to the following format:&lt;br /&gt;
&lt;br /&gt;
'''Usage'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
acc_gen.sh &amp;quot;/path/to/input.json&amp;quot; &amp;quot;/path/to/output_folder&amp;quot; n_sample&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* Input File: Path to the input .json file.&lt;br /&gt;
* Output Folder: Path to the folder where the generated output files will be saved.&lt;br /&gt;
* n_sample: Number of samples to generate.&lt;br /&gt;
&lt;br /&gt;
'''Output'''&lt;br /&gt;
* The script should generate n_sample output files in the specified output folder.&lt;br /&gt;
* Output files should be named sequentially as sample_01.json, sample_02.json, ..., up to sample_n_sample.json.&lt;br /&gt;
&lt;br /&gt;
Participants are free to implement the internal logic of the script, but it must adhere to this format for proper execution during the evaluation process.&lt;br /&gt;
&lt;br /&gt;
'''Packaging Submissions'''&lt;br /&gt;
* Every submission must be packed into a docker image&lt;br /&gt;
* Every submission will be deployed and evaluated automatically with &amp;lt;code&amp;gt;docker run&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''Accepted submission form'''&lt;br /&gt;
* Link to public or private Github repository&lt;br /&gt;
* Link to public or private docker hub&lt;br /&gt;
* Shared google drive links&lt;br /&gt;
* If the repository is private, an access token is also required&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Baselines=&lt;br /&gt;
&lt;br /&gt;
To establish a benchmark for this task, we have several options for baseline models, subjective to the availability of official implementations:&lt;br /&gt;
&lt;br /&gt;
'''WholeSongGen''' (Wang et al., 2024)&lt;br /&gt;
&lt;br /&gt;
This model generates piano accompaniment using a diffusion model.&lt;br /&gt;
&lt;br /&gt;
'''AccoMontage''' (Zhao et al., 2021)&lt;br /&gt;
&lt;br /&gt;
This algorithm generates piano accompaniment using a combination of rule-based search and deep representation learning.&lt;br /&gt;
&lt;br /&gt;
'''Compound Word Transformer''' (Hsiao et al., 2021)&lt;br /&gt;
&lt;br /&gt;
This model generates piano performance using a Transformer-based architecture. &lt;br /&gt;
&lt;br /&gt;
'''Anticipatory Music Transformer''' (Thickstun et al., 2024)&lt;br /&gt;
&lt;br /&gt;
A Transformer-based model for track infilling and accompaniment generation. While the current implementation does not explicitly consider chord input, it remains a relevant starting point to study with.&lt;br /&gt;
&lt;br /&gt;
'''PopMAG''' (Ren et al., 2020)&lt;br /&gt;
&lt;br /&gt;
A Transformer-based model for multi-track accompaniment generation.&lt;br /&gt;
&lt;br /&gt;
=Contacts=&lt;br /&gt;
If you any questions or suggestions about the task, please contact:&lt;br /&gt;
* Ziyu Wang: ziyu.wang&amp;lt;at&amp;gt;nyu.edu&lt;br /&gt;
* Jingwei Zhao: jzhao&amp;lt;at&amp;gt;u.nus.edu&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=References=&lt;br /&gt;
* Harte, C. (2010). Towards automatic extraction of harmony information from music signals (Doctoral dissertation).* Lu, P., Xu, X., Kang, C., Yu, B., Xing, C., Tan, X., &amp;amp; Bian, J. (2023). Musecoco: Generating symbolic music from text. arXiv preprint arXiv:2306.00110.&lt;br /&gt;
* Wang, Z., Min, L., &amp;amp; Xia, G. Whole-Song Hierarchical Generation of Symbolic Music Using Cascaded Diffusion Models. In The Twelfth International Conference on Learning Representations.&lt;br /&gt;
* Jingwei Zhao, &amp;amp; Gus Xia (2021). AccoMontage: Accompaniment Arrangement via Phrase Selection and Style Transfer. In Proceedings of the 22nd International Society for Music Information Retrieval Conference, ISMIR 2021, Online, November 7-12, 2021 (pp. 833–840).&lt;br /&gt;
* Thickstun, J., Hall, D., Donahue, C., &amp;amp; Liang, P. (2023). Anticipatory music transformer. arXiv preprint arXiv:2306.08620.&lt;br /&gt;
* Code and data format samples: [https://github.com/ZZWaang/acc-gen-8bar-wholesong/tree/main https://github.com/ZZWaang/acc-gen-8bar-wholesong/tree/main]&lt;/div&gt;</summary>
		<author><name>Zhaojw1998</name></author>
		
	</entry>
	<entry>
		<id>https://music-ir.org/mirex/w/index.php?title=2024:Symbolic_Music_Generation&amp;diff=13891</id>
		<title>2024:Symbolic Music Generation</title>
		<link rel="alternate" type="text/html" href="https://music-ir.org/mirex/w/index.php?title=2024:Symbolic_Music_Generation&amp;diff=13891"/>
		<updated>2024-09-16T18:05:47Z</updated>

		<summary type="html">&lt;p&gt;Zhaojw1998: /* Baselines */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Description=&lt;br /&gt;
Symbolic music generation is a broad topic. It covers a wide range of tasks, including generation, harmonization, arrangement, instrumentation, and more. We have multiple ways to represent music data, and the evaluation metrics also vary. To define a MIREX challenge within this topic, we need to narrow our focus to specific subtasks that are both relevant to the community and feasible to evaluate effectively.&lt;br /&gt;
&lt;br /&gt;
This year, we select the task to be '''piano accompaniment arrangement from a lead sheet'''. The lead sheet provides information about the melody, and chord progression. The goal is to generate a piano accompaniment that complements the lead melody. The music data consists of 8-measure segments in 4/4 meter, quantized to a sixteenth-note resolution. A more detailed description of the data structure is provided in the data format section. The genre of the lead sheets is broadly within western pop music (refer to the music examples for more detail).&lt;br /&gt;
&lt;br /&gt;
=Data Format=&lt;br /&gt;
The input lead sheet consists of 8 bars for the melody and harmony, with an additional mandatory pickup measure (left blank if not used). The data is prepared in JSON format containing two properties: &amp;lt;code&amp;gt;melody&amp;lt;/code&amp;gt; and &amp;lt;code&amp;gt;chords&amp;lt;/code&amp;gt;:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;code&amp;gt;melody&amp;lt;/code&amp;gt;: a list of notes. Each note contains properties of &amp;lt;code&amp;gt;start&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;pitch&amp;lt;/code&amp;gt;, and &amp;lt;code&amp;gt;duration&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;code&amp;gt;chords&amp;lt;/code&amp;gt;: a list of chords. Each chord contains properties of &amp;lt;code&amp;gt;start&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;symbol&amp;lt;/code&amp;gt;, and &amp;lt;code&amp;gt;duration&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
The output generation should also follow the JSON format containing one property &amp;lt;code&amp;gt;acc&amp;lt;/code&amp;gt;:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;code&amp;gt;acc&amp;lt;/code&amp;gt;: a list of notes. Each note contains properties of &amp;lt;code&amp;gt;start&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;pitch&amp;lt;/code&amp;gt;, and &amp;lt;code&amp;gt;duration&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
'''Detailed explanation of &amp;lt;code&amp;gt;start&amp;lt;/code&amp;gt; and &amp;lt;code&amp;gt;duration&amp;lt;/code&amp;gt; attributes.''' &lt;br /&gt;
&lt;br /&gt;
# The data is assumed to be in 4/4 meter, quantized to a sixteenth-note resolution. For both melody and chords, onsets and durations are counted in sixteenth notes. &lt;br /&gt;
# Both onsets and durations are integers ranging from 0 to 9 * 16 - 1 = 143. Notes that end later than the ninth measure (i.e., 9 * 16 = 144th time step) will be truncated to the end of the ninth measure. &lt;br /&gt;
# Melody notes are not allowed to overlap with one another. &lt;br /&gt;
# There should be no gaps or overlaps between chords. Chords must follow one another directly. If there is a blank space where no chord is played, it must be filled with the &amp;lt;code&amp;gt;N&amp;lt;/code&amp;gt; chord. &lt;br /&gt;
# The accompaniment of the pick-up measure should be blank. &lt;br /&gt;
&lt;br /&gt;
'''Detailed explanation of the &amp;lt;code&amp;gt;pitch&amp;lt;/code&amp;gt; attribute.''' &lt;br /&gt;
&lt;br /&gt;
# The pitch property of a note should be integers ranging from 0 to 127, corresponding to the MIDI pitch numbers.&lt;br /&gt;
&lt;br /&gt;
'''Detailed explanation of the chord &amp;lt;code&amp;gt;symbol&amp;lt;/code&amp;gt; attribute.''' &lt;br /&gt;
&lt;br /&gt;
# The symbol property of a chord should be a string based on the syntax of (Harte, 2010). In other words, each chord string should be able to be passed as a parameter to mir_eval.chord.encode() without causing an error.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Data Example=&lt;br /&gt;
Below is an example of the input lead sheet in the format given above. The lead sheet is the melody of the first phrase of ''Hey Jude'' by The Beatles.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
{&lt;br /&gt;
  &amp;quot;melody&amp;quot;: [&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 12, &amp;quot;pitch&amp;quot;: 72, &amp;quot;duration&amp;quot;: 4},&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 16, &amp;quot;pitch&amp;quot;: 69, &amp;quot;duration&amp;quot;: 8},&lt;br /&gt;
    ...&lt;br /&gt;
  ],&lt;br /&gt;
  &amp;quot;chords&amp;quot;: [&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 0, &amp;quot;symbol&amp;quot;: &amp;quot;N&amp;quot;, &amp;quot;duration&amp;quot;: 16},&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 16, &amp;quot;symbol&amp;quot;: &amp;quot;F&amp;quot;, &amp;quot;duration&amp;quot;: 16},&lt;br /&gt;
    ...&lt;br /&gt;
  ]&lt;br /&gt;
}&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
This is an example of the generated accompaniment. The accompaniment is generated using the baseline method WholeSongGen introduced below. Note that the generation starts from the second measure (time step 16).&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
{&lt;br /&gt;
  &amp;quot;acc&amp;quot;: [&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 16, &amp;quot;pitch&amp;quot;: 41, &amp;quot;duration&amp;quot;: 12},&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 16, &amp;quot;pitch&amp;quot;: 65, &amp;quot;duration&amp;quot;: 5},&lt;br /&gt;
    ...&lt;br /&gt;
  ]&lt;br /&gt;
}&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
Full data examples can be accessed in [https://github.com/ZZWaang/acc-gen-8bar-wholesong/tree/main/generation_samples this code repository]. MIDI conversion code and MIDI demos are also provided there.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Evaluation and Competition Format=&lt;br /&gt;
We will evaluate the submitted algorithms through an online subjective double-blind test. The evaluation format differs from conventional tasks in the following aspects:&lt;br /&gt;
* '''We use a &amp;quot;''potluck''&amp;quot; test set. Before submitting the algorithm, each team is required to submit two lead sheets.''' The organizer team will supplement the lead sheet if necessary. &lt;br /&gt;
* There will be '''no live ranking''' because the subjective test will be done after the algorithm submission deadline.&lt;br /&gt;
* To better handle randomness in the generation algorithm, we '''allow cherry-picking from a fixed number of generated samples'''.   &lt;br /&gt;
* We hope to compute some objective measurements as well, but these will only be reported as a reference.&lt;br /&gt;
&lt;br /&gt;
==Subjective Evaluation Format==&lt;br /&gt;
* After each team submits the algorithm, the organizer team will use the algorithm to generate '''16 arrangements''' for each test sample. The generated results will be returned to each team for cherry-picking.&lt;br /&gt;
* Only a subset of the test set will be used for subjective evaluation.&lt;br /&gt;
* In the subjective evaluation, we will first ask the subjects to listen to the lead melody with chords and then listen to the generated samples in random order. The order of the samples will be randomized.&lt;br /&gt;
* The subject will be asked to rate each arrangement based on the following criteria:&lt;br /&gt;
:* Harmony correctness (5-point scale)&lt;br /&gt;
:* Creativity (5-point scale)&lt;br /&gt;
:* Naturalness (5-point scale)&lt;br /&gt;
:* Overall musicality (5-point scale)&lt;br /&gt;
&lt;br /&gt;
==Objective Measurements==&lt;br /&gt;
* We will use objective measurements only as a reference. The correlation between subjective and objective scores will be measured as a reference. &lt;br /&gt;
* The current plan is to compute the Negative Log Likelihood of a large music language model (e.g., Lu et al., 2023).&lt;br /&gt;
* We welcome proposals of the objective measurements.&lt;br /&gt;
&lt;br /&gt;
==Important Dates==&lt;br /&gt;
&lt;br /&gt;
* '''Oct 8, 2024''': Submit two lead sheets as a part of the test set. &lt;br /&gt;
* '''Oct 15, 2024''': Submit the main algorithm.&lt;br /&gt;
* '''Oct 22, 2024''': Return the generated samples. The cherry-picking phase begins.&lt;br /&gt;
* '''Oct 25, 2024''': Submit the cherry-picked sample ids.&lt;br /&gt;
* '''Oct 31 - Nov 3, 2024''': Online subjective evaluation.&lt;br /&gt;
* '''Nov 5, 2024''': Announce the final result.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Submission=&lt;br /&gt;
&lt;br /&gt;
As a generative task with subjective evaluation, the submission process ''differs greatly'' from other MIREX tasks. There are four important stages:&lt;br /&gt;
# Test set submission (Oct 8, 2024)&lt;br /&gt;
# Algorithm submission (Oct 15, 2024)&lt;br /&gt;
# Cherry-picked sample IDs submission (Oct 25, 2024)&lt;br /&gt;
# Evaluation form submission (Nov 3, 2024)&lt;br /&gt;
Please check the Important Dates section for the detailed schedule. '''Failure to participate in any of the stages will result in disqualification.'''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Algorithm Submission==&lt;br /&gt;
Participants must include an &amp;lt;code&amp;gt;batch_acc_gen.sh&amp;lt;/code&amp;gt; script in their submission. The task captain will use the script to generate output files according to the following format:&lt;br /&gt;
&lt;br /&gt;
'''Usage'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
acc_gen.sh &amp;quot;/path/to/input.json&amp;quot; &amp;quot;/path/to/output_folder&amp;quot; n_sample&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* Input File: Path to the input .json file.&lt;br /&gt;
* Output Folder: Path to the folder where the generated output files will be saved.&lt;br /&gt;
* n_sample: Number of samples to generate.&lt;br /&gt;
&lt;br /&gt;
'''Output'''&lt;br /&gt;
* The script should generate n_sample output files in the specified output folder.&lt;br /&gt;
* Output files should be named sequentially as sample_01.json, sample_02.json, ..., up to sample_n_sample.json.&lt;br /&gt;
&lt;br /&gt;
Participants are free to implement the internal logic of the script, but it must adhere to this format for proper execution during the evaluation process.&lt;br /&gt;
&lt;br /&gt;
'''Packaging Submissions'''&lt;br /&gt;
* Every submission must be packed into a docker image&lt;br /&gt;
* Every submission will be deployed and evaluated automatically with &amp;lt;code&amp;gt;docker run&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''Accepted submission form'''&lt;br /&gt;
* Link to public or private Github repository&lt;br /&gt;
* Link to public or private docker hub&lt;br /&gt;
* Shared google drive links&lt;br /&gt;
* If the repository is private, an access token is also required&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Baselines=&lt;br /&gt;
&lt;br /&gt;
To establish a benchmark for this task, we have several options for baseline models, subjective to the availability of official implementations:&lt;br /&gt;
&lt;br /&gt;
'''WholeSongGen''' (Wang et al., 2024)&lt;br /&gt;
&lt;br /&gt;
This model generates piano accompaniment using a diffusion model.&lt;br /&gt;
&lt;br /&gt;
'''AccoMontage''' (Zhao et al., 2020)&lt;br /&gt;
&lt;br /&gt;
This algorithm generates piano accompaniment using a combination of rule-based search and deep representation learning.&lt;br /&gt;
&lt;br /&gt;
'''Compound Word Transformer''' (Hsiao et al., 2021)&lt;br /&gt;
&lt;br /&gt;
This model generates piano performance using a Transformer-based architecture. &lt;br /&gt;
&lt;br /&gt;
'''Anticipatory Music Transformer''' (Thickstun et al., 2024)&lt;br /&gt;
&lt;br /&gt;
A Transformer-based model for track infilling and accompaniment generation. While the current implementation does not explicitly consider chord input, it remains a relevant starting point to study with.&lt;br /&gt;
&lt;br /&gt;
'''PopMAG''' (Ren et al., 2020)&lt;br /&gt;
&lt;br /&gt;
A Transformer-based model for multi-track accompaniment generation.&lt;br /&gt;
&lt;br /&gt;
=Contacts=&lt;br /&gt;
If you any questions or suggestions about the task, please contact:&lt;br /&gt;
* Ziyu Wang: ziyu.wang&amp;lt;at&amp;gt;nyu.edu&lt;br /&gt;
* Jingwei Zhao: jzhao&amp;lt;at&amp;gt;u.nus.edu&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=References=&lt;br /&gt;
* Harte, C. (2010). Towards automatic extraction of harmony information from music signals (Doctoral dissertation).* Lu, P., Xu, X., Kang, C., Yu, B., Xing, C., Tan, X., &amp;amp; Bian, J. (2023). Musecoco: Generating symbolic music from text. arXiv preprint arXiv:2306.00110.&lt;br /&gt;
* Wang, Z., Min, L., &amp;amp; Xia, G. Whole-Song Hierarchical Generation of Symbolic Music Using Cascaded Diffusion Models. In The Twelfth International Conference on Learning Representations.&lt;br /&gt;
* Jingwei Zhao, &amp;amp; Gus Xia (2021). AccoMontage: Accompaniment Arrangement via Phrase Selection and Style Transfer. In Proceedings of the 22nd International Society for Music Information Retrieval Conference, ISMIR 2021, Online, November 7-12, 2021 (pp. 833–840).&lt;br /&gt;
* Thickstun, J., Hall, D., Donahue, C., &amp;amp; Liang, P. (2023). Anticipatory music transformer. arXiv preprint arXiv:2306.08620.&lt;br /&gt;
* Code and data format samples: [https://github.com/ZZWaang/acc-gen-8bar-wholesong/tree/main https://github.com/ZZWaang/acc-gen-8bar-wholesong/tree/main]&lt;/div&gt;</summary>
		<author><name>Zhaojw1998</name></author>
		
	</entry>
	<entry>
		<id>https://music-ir.org/mirex/w/index.php?title=2024:Symbolic_Music_Generation&amp;diff=13890</id>
		<title>2024:Symbolic Music Generation</title>
		<link rel="alternate" type="text/html" href="https://music-ir.org/mirex/w/index.php?title=2024:Symbolic_Music_Generation&amp;diff=13890"/>
		<updated>2024-09-16T18:04:39Z</updated>

		<summary type="html">&lt;p&gt;Zhaojw1998: /* Baselines */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Description=&lt;br /&gt;
Symbolic music generation is a broad topic. It covers a wide range of tasks, including generation, harmonization, arrangement, instrumentation, and more. We have multiple ways to represent music data, and the evaluation metrics also vary. To define a MIREX challenge within this topic, we need to narrow our focus to specific subtasks that are both relevant to the community and feasible to evaluate effectively.&lt;br /&gt;
&lt;br /&gt;
This year, we select the task to be '''piano accompaniment arrangement from a lead sheet'''. The lead sheet provides information about the melody, and chord progression. The goal is to generate a piano accompaniment that complements the lead melody. The music data consists of 8-measure segments in 4/4 meter, quantized to a sixteenth-note resolution. A more detailed description of the data structure is provided in the data format section. The genre of the lead sheets is broadly within western pop music (refer to the music examples for more detail).&lt;br /&gt;
&lt;br /&gt;
=Data Format=&lt;br /&gt;
The input lead sheet consists of 8 bars for the melody and harmony, with an additional mandatory pickup measure (left blank if not used). The data is prepared in JSON format containing two properties: &amp;lt;code&amp;gt;melody&amp;lt;/code&amp;gt; and &amp;lt;code&amp;gt;chords&amp;lt;/code&amp;gt;:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;code&amp;gt;melody&amp;lt;/code&amp;gt;: a list of notes. Each note contains properties of &amp;lt;code&amp;gt;start&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;pitch&amp;lt;/code&amp;gt;, and &amp;lt;code&amp;gt;duration&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;code&amp;gt;chords&amp;lt;/code&amp;gt;: a list of chords. Each chord contains properties of &amp;lt;code&amp;gt;start&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;symbol&amp;lt;/code&amp;gt;, and &amp;lt;code&amp;gt;duration&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
The output generation should also follow the JSON format containing one property &amp;lt;code&amp;gt;acc&amp;lt;/code&amp;gt;:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;code&amp;gt;acc&amp;lt;/code&amp;gt;: a list of notes. Each note contains properties of &amp;lt;code&amp;gt;start&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;pitch&amp;lt;/code&amp;gt;, and &amp;lt;code&amp;gt;duration&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
'''Detailed explanation of &amp;lt;code&amp;gt;start&amp;lt;/code&amp;gt; and &amp;lt;code&amp;gt;duration&amp;lt;/code&amp;gt; attributes.''' &lt;br /&gt;
&lt;br /&gt;
# The data is assumed to be in 4/4 meter, quantized to a sixteenth-note resolution. For both melody and chords, onsets and durations are counted in sixteenth notes. &lt;br /&gt;
# Both onsets and durations are integers ranging from 0 to 9 * 16 - 1 = 143. Notes that end later than the ninth measure (i.e., 9 * 16 = 144th time step) will be truncated to the end of the ninth measure. &lt;br /&gt;
# Melody notes are not allowed to overlap with one another. &lt;br /&gt;
# There should be no gaps or overlaps between chords. Chords must follow one another directly. If there is a blank space where no chord is played, it must be filled with the &amp;lt;code&amp;gt;N&amp;lt;/code&amp;gt; chord. &lt;br /&gt;
# The accompaniment of the pick-up measure should be blank. &lt;br /&gt;
&lt;br /&gt;
'''Detailed explanation of the &amp;lt;code&amp;gt;pitch&amp;lt;/code&amp;gt; attribute.''' &lt;br /&gt;
&lt;br /&gt;
# The pitch property of a note should be integers ranging from 0 to 127, corresponding to the MIDI pitch numbers.&lt;br /&gt;
&lt;br /&gt;
'''Detailed explanation of the chord &amp;lt;code&amp;gt;symbol&amp;lt;/code&amp;gt; attribute.''' &lt;br /&gt;
&lt;br /&gt;
# The symbol property of a chord should be a string based on the syntax of (Harte, 2010). In other words, each chord string should be able to be passed as a parameter to mir_eval.chord.encode() without causing an error.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Data Example=&lt;br /&gt;
Below is an example of the input lead sheet in the format given above. The lead sheet is the melody of the first phrase of ''Hey Jude'' by The Beatles.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
{&lt;br /&gt;
  &amp;quot;melody&amp;quot;: [&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 12, &amp;quot;pitch&amp;quot;: 72, &amp;quot;duration&amp;quot;: 4},&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 16, &amp;quot;pitch&amp;quot;: 69, &amp;quot;duration&amp;quot;: 8},&lt;br /&gt;
    ...&lt;br /&gt;
  ],&lt;br /&gt;
  &amp;quot;chords&amp;quot;: [&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 0, &amp;quot;symbol&amp;quot;: &amp;quot;N&amp;quot;, &amp;quot;duration&amp;quot;: 16},&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 16, &amp;quot;symbol&amp;quot;: &amp;quot;F&amp;quot;, &amp;quot;duration&amp;quot;: 16},&lt;br /&gt;
    ...&lt;br /&gt;
  ]&lt;br /&gt;
}&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
This is an example of the generated accompaniment. The accompaniment is generated using the baseline method WholeSongGen introduced below. Note that the generation starts from the second measure (time step 16).&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
{&lt;br /&gt;
  &amp;quot;acc&amp;quot;: [&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 16, &amp;quot;pitch&amp;quot;: 41, &amp;quot;duration&amp;quot;: 12},&lt;br /&gt;
    {&amp;quot;start&amp;quot;: 16, &amp;quot;pitch&amp;quot;: 65, &amp;quot;duration&amp;quot;: 5},&lt;br /&gt;
    ...&lt;br /&gt;
  ]&lt;br /&gt;
}&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
Full data examples can be accessed in [https://github.com/ZZWaang/acc-gen-8bar-wholesong/tree/main/generation_samples this code repository]. MIDI conversion code and MIDI demos are also provided there.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Evaluation and Competition Format=&lt;br /&gt;
We will evaluate the submitted algorithms through an online subjective double-blind test. The evaluation format differs from conventional tasks in the following aspects:&lt;br /&gt;
* '''We use a &amp;quot;''potluck''&amp;quot; test set. Before submitting the algorithm, each team is required to submit two lead sheets.''' The organizer team will supplement the lead sheet if necessary. &lt;br /&gt;
* There will be '''no live ranking''' because the subjective test will be done after the algorithm submission deadline.&lt;br /&gt;
* To better handle randomness in the generation algorithm, we '''allow cherry-picking from a fixed number of generated samples'''.   &lt;br /&gt;
* We hope to compute some objective measurements as well, but these will only be reported as a reference.&lt;br /&gt;
&lt;br /&gt;
==Subjective Evaluation Format==&lt;br /&gt;
* After each team submits the algorithm, the organizer team will use the algorithm to generate '''16 arrangements''' for each test sample. The generated results will be returned to each team for cherry-picking.&lt;br /&gt;
* Only a subset of the test set will be used for subjective evaluation.&lt;br /&gt;
* In the subjective evaluation, we will first ask the subjects to listen to the lead melody with chords and then listen to the generated samples in random order. The order of the samples will be randomized.&lt;br /&gt;
* The subject will be asked to rate each arrangement based on the following criteria:&lt;br /&gt;
:* Harmony correctness (5-point scale)&lt;br /&gt;
:* Creativity (5-point scale)&lt;br /&gt;
:* Naturalness (5-point scale)&lt;br /&gt;
:* Overall musicality (5-point scale)&lt;br /&gt;
&lt;br /&gt;
==Objective Measurements==&lt;br /&gt;
* We will use objective measurements only as a reference. The correlation between subjective and objective scores will be measured as a reference. &lt;br /&gt;
* The current plan is to compute the Negative Log Likelihood of a large music language model (e.g., Lu et al., 2023).&lt;br /&gt;
* We welcome proposals of the objective measurements.&lt;br /&gt;
&lt;br /&gt;
==Important Dates==&lt;br /&gt;
&lt;br /&gt;
* '''Oct 8, 2024''': Submit two lead sheets as a part of the test set. &lt;br /&gt;
* '''Oct 15, 2024''': Submit the main algorithm.&lt;br /&gt;
* '''Oct 22, 2024''': Return the generated samples. The cherry-picking phase begins.&lt;br /&gt;
* '''Oct 25, 2024''': Submit the cherry-picked sample ids.&lt;br /&gt;
* '''Oct 31 - Nov 3, 2024''': Online subjective evaluation.&lt;br /&gt;
* '''Nov 5, 2024''': Announce the final result.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Submission=&lt;br /&gt;
&lt;br /&gt;
As a generative task with subjective evaluation, the submission process ''differs greatly'' from other MIREX tasks. There are four important stages:&lt;br /&gt;
# Test set submission (Oct 8, 2024)&lt;br /&gt;
# Algorithm submission (Oct 15, 2024)&lt;br /&gt;
# Cherry-picked sample IDs submission (Oct 25, 2024)&lt;br /&gt;
# Evaluation form submission (Nov 3, 2024)&lt;br /&gt;
Please check the Important Dates section for the detailed schedule. '''Failure to participate in any of the stages will result in disqualification.'''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Algorithm Submission==&lt;br /&gt;
Participants must include an &amp;lt;code&amp;gt;batch_acc_gen.sh&amp;lt;/code&amp;gt; script in their submission. The task captain will use the script to generate output files according to the following format:&lt;br /&gt;
&lt;br /&gt;
'''Usage'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
acc_gen.sh &amp;quot;/path/to/input.json&amp;quot; &amp;quot;/path/to/output_folder&amp;quot; n_sample&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* Input File: Path to the input .json file.&lt;br /&gt;
* Output Folder: Path to the folder where the generated output files will be saved.&lt;br /&gt;
* n_sample: Number of samples to generate.&lt;br /&gt;
&lt;br /&gt;
'''Output'''&lt;br /&gt;
* The script should generate n_sample output files in the specified output folder.&lt;br /&gt;
* Output files should be named sequentially as sample_01.json, sample_02.json, ..., up to sample_n_sample.json.&lt;br /&gt;
&lt;br /&gt;
Participants are free to implement the internal logic of the script, but it must adhere to this format for proper execution during the evaluation process.&lt;br /&gt;
&lt;br /&gt;
'''Packaging Submissions'''&lt;br /&gt;
* Every submission must be packed into a docker image&lt;br /&gt;
* Every submission will be deployed and evaluated automatically with &amp;lt;code&amp;gt;docker run&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''Accepted submission form'''&lt;br /&gt;
* Link to public or private Github repository&lt;br /&gt;
* Link to public or private docker hub&lt;br /&gt;
* Shared google drive links&lt;br /&gt;
* If the repository is private, an access token is also required&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Baselines=&lt;br /&gt;
&lt;br /&gt;
To establish a benchmark for this task, we have several baseline options, subjective to the availability of official implementations:&lt;br /&gt;
&lt;br /&gt;
'''WholeSongGen''' (Wang et al., 2024)&lt;br /&gt;
&lt;br /&gt;
This model generates piano accompaniment using a diffusion model.&lt;br /&gt;
&lt;br /&gt;
'''AccoMontage''' (Zhao et al., 2020)&lt;br /&gt;
&lt;br /&gt;
This algorithm generates piano accompaniment using a combination of rule-based search and deep representation learning.&lt;br /&gt;
&lt;br /&gt;
'''Compound Word Transformer''' (Hsiao et al., 2021)&lt;br /&gt;
&lt;br /&gt;
This model generates piano performance using a Transformer-based architecture. &lt;br /&gt;
&lt;br /&gt;
'''Anticipatory Music Transformer''' (Thickstun et al., 2024)&lt;br /&gt;
&lt;br /&gt;
A Transformer-based model for track infilling and accompaniment generation. While the current implementation does not explicitly consider chord input, it remains a relevant starting point to study with.&lt;br /&gt;
&lt;br /&gt;
'''PopMAG''' (Ren et al., 2020)&lt;br /&gt;
&lt;br /&gt;
A Transformer-based model for multi-track accompaniment generation.&lt;br /&gt;
&lt;br /&gt;
=Contacts=&lt;br /&gt;
If you any questions or suggestions about the task, please contact:&lt;br /&gt;
* Ziyu Wang: ziyu.wang&amp;lt;at&amp;gt;nyu.edu&lt;br /&gt;
* Jingwei Zhao: jzhao&amp;lt;at&amp;gt;u.nus.edu&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=References=&lt;br /&gt;
* Harte, C. (2010). Towards automatic extraction of harmony information from music signals (Doctoral dissertation).* Lu, P., Xu, X., Kang, C., Yu, B., Xing, C., Tan, X., &amp;amp; Bian, J. (2023). Musecoco: Generating symbolic music from text. arXiv preprint arXiv:2306.00110.&lt;br /&gt;
* Wang, Z., Min, L., &amp;amp; Xia, G. Whole-Song Hierarchical Generation of Symbolic Music Using Cascaded Diffusion Models. In The Twelfth International Conference on Learning Representations.&lt;br /&gt;
* Jingwei Zhao, &amp;amp; Gus Xia (2021). AccoMontage: Accompaniment Arrangement via Phrase Selection and Style Transfer. In Proceedings of the 22nd International Society for Music Information Retrieval Conference, ISMIR 2021, Online, November 7-12, 2021 (pp. 833–840).&lt;br /&gt;
* Thickstun, J., Hall, D., Donahue, C., &amp;amp; Liang, P. (2023). Anticipatory music transformer. arXiv preprint arXiv:2306.08620.&lt;br /&gt;
* Code and data format samples: [https://github.com/ZZWaang/acc-gen-8bar-wholesong/tree/main https://github.com/ZZWaang/acc-gen-8bar-wholesong/tree/main]&lt;/div&gt;</summary>
		<author><name>Zhaojw1998</name></author>
		
	</entry>
</feed>