MULTI-TASK LEARNING FOR BEAT DETECTION ON MIDI PERFORMANCES
| dc.contributor.author | Torabi Ghazvini, Hananeh | |
| dc.contributor.copyright-release | Not Applicable | |
| dc.contributor.degree | Master of Computer Science | |
| dc.contributor.department | Faculty of Computer Science | |
| dc.contributor.ethics-approval | Not Applicable | |
| dc.contributor.external-examiner | na | |
| dc.contributor.manuscripts | Not Applicable | |
| dc.contributor.thesis-reader | Dr. Daniel Oore | |
| dc.contributor.thesis-reader | Dr. Marta Kryven | |
| dc.contributor.thesis-supervisor | Dr. Sageev Oore | |
| dc.date.accessioned | 2025-08-21T18:06:54Z | |
| dc.date.available | 2025-08-21T18:06:54Z | |
| dc.date.defence | 2025-08-12 | |
| dc.date.issued | 2025-08-19 | |
| dc.description | This thesis explores beat detection in symbolic music as a key problem in music information retrieval. It presents a multitask transformer architecture designed to predict beat positions in classical piano performances with expert-provided beat annotations. The model leverages auxiliary sequence-to-sequence tasks to enhance shared representations across encoders and decoders, enhancing performance on beat detection. In addition, experiments demonstrate the effectiveness of incorporating BERT as an encoder, showing that pretrained contextual representations can further improve the performance. The work combines quantitative evaluation with qualitative analysis and highlights the challenges of evaluating beat detection in symbolic music. | |
| dc.description.abstract | Beat detection is a fundamental task in music information retrieval as it necessarily requires understanding of musical structure that is essential for other tasks such as music transcription and harmonic analysis. In this thesis, we propose a transformer architecture within a multi-task framework to learn to predict beat positions in musical sequences using symbolic representations. We train our model on a dataset consisting of performances of classical piano music, where beat annotations have been provided by musical professionals. Based on principles of transfer and representation learning, we select a collection of sequence to sequence tasks that benefit from having shared weights in the encoders and decoders of the model. We show the effectiveness of including additional training tasks, and we provide both quantitative evaluation (using F-measures) and qualitative evaluation (analysis of examples). We also discuss the challenges of evaluating the beat detection task. | |
| dc.identifier.uri | https://hdl.handle.net/10222/85360 | |
| dc.language.iso | en | |
| dc.subject | Beat Detection | |
| dc.subject | MIDI Performances | |
| dc.subject | Classical Piano Music | |
| dc.subject | Multi-task Learning | |
| dc.subject | Transformer | |
| dc.subject | Transfer Learning | |
| dc.title | MULTI-TASK LEARNING FOR BEAT DETECTION ON MIDI PERFORMANCES |
