Repository logo

MULTI-TASK LEARNING FOR BEAT DETECTION ON MIDI PERFORMANCES

dc.contributor.authorTorabi Ghazvini, Hananeh
dc.contributor.copyright-releaseNot Applicable
dc.contributor.degreeMaster of Computer Science
dc.contributor.departmentFaculty of Computer Science
dc.contributor.ethics-approvalNot Applicable
dc.contributor.external-examinerna
dc.contributor.manuscriptsNot Applicable
dc.contributor.thesis-readerDr. Daniel Oore
dc.contributor.thesis-readerDr. Marta Kryven
dc.contributor.thesis-supervisorDr. Sageev Oore
dc.date.accessioned2025-08-21T18:06:54Z
dc.date.available2025-08-21T18:06:54Z
dc.date.defence2025-08-12
dc.date.issued2025-08-19
dc.descriptionThis thesis explores beat detection in symbolic music as a key problem in music information retrieval. It presents a multitask transformer architecture designed to predict beat positions in classical piano performances with expert-provided beat annotations. The model leverages auxiliary sequence-to-sequence tasks to enhance shared representations across encoders and decoders, enhancing performance on beat detection. In addition, experiments demonstrate the effectiveness of incorporating BERT as an encoder, showing that pretrained contextual representations can further improve the performance. The work combines quantitative evaluation with qualitative analysis and highlights the challenges of evaluating beat detection in symbolic music.
dc.description.abstractBeat detection is a fundamental task in music information retrieval as it necessarily requires understanding of musical structure that is essential for other tasks such as music transcription and harmonic analysis. In this thesis, we propose a transformer architecture within a multi-task framework to learn to predict beat positions in musical sequences using symbolic representations. We train our model on a dataset consisting of performances of classical piano music, where beat annotations have been provided by musical professionals. Based on principles of transfer and representation learning, we select a collection of sequence to sequence tasks that benefit from having shared weights in the encoders and decoders of the model. We show the effectiveness of including additional training tasks, and we provide both quantitative evaluation (using F-measures) and qualitative evaluation (analysis of examples). We also discuss the challenges of evaluating the beat detection task.
dc.identifier.urihttps://hdl.handle.net/10222/85360
dc.language.isoen
dc.subjectBeat Detection
dc.subjectMIDI Performances
dc.subjectClassical Piano Music
dc.subjectMulti-task Learning
dc.subjectTransformer
dc.subjectTransfer Learning
dc.titleMULTI-TASK LEARNING FOR BEAT DETECTION ON MIDI PERFORMANCES

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
HananehTorabiGhazvini2025.pdf
Size:
5.01 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.12 KB
Format:
Item-specific license agreed upon to submission
Description: