Show simple item record

dc.contributor.authorMalik, Disha
dc.date.accessioned2022-03-30T14:54:57Z
dc.date.available2022-03-30T14:54:57Z
dc.date.issued2022-03-30T14:54:57Z
dc.identifier.urihttp://hdl.handle.net/10222/81501
dc.description.abstractIn open-ended surveys, participant answers that do not give any legitimate answer or opinion to the question being asked are called no-opinion responses. We consider the problem of detection of no-opinion answers in the CLSA dataset using a Machine Learning approach. The CLSA dataset contains verbatim responses from over 51,000 participants to the question of what promotes healthy aging. Our foremost goal is to clean the CLSA dataset to help foster the healthy aging study and pave a healthier way forward for the future generations. This thesis investigates the performance of existing state-of-the-art approaches, using distance measures coupled with embeddings and Active Learning to cluster and classify no-opinion responses. Among the unsupervised techniques we obtained the best performance using the BERT embeddings with Euclidean Distance. We also show that the Active Learning approach is a viable approach to identify no-opinion responses in a large survey, and in our experiments, the SVM based classifier had the best performance of 0.97 in the AUC score of the PR curve. Using this approach we identified 1157 instances of no-opinion responses in the CLSA dataset.en_US
dc.language.isoenen_US
dc.subjectActive Learningen_US
dc.subjectCLSAen_US
dc.titleDETECTING NO-OPINION RESPONSES IN THE CANADIAN LONGITUDINAL STUDY ON AGING (CLSA) DATASET USING UNSUPERVISED METHODS AND ACTIVE LEARNINGen_US
dc.date.defence2022-03-11
dc.contributor.departmentFaculty of Computer Scienceen_US
dc.contributor.degreeMaster of Computer Scienceen_US
dc.contributor.external-examinern/aen_US
dc.contributor.graduate-coordinatorDr. Michael McAllisteren_US
dc.contributor.thesis-readerDr. Evangelos Miliosen_US
dc.contributor.thesis-readerDr. Srinivas Sampallien_US
dc.contributor.thesis-supervisorDr. Vlado Keseljen_US
dc.contributor.thesis-supervisorDr. Dijana Kosmajacen_US
dc.contributor.ethics-approvalNot Applicableen_US
dc.contributor.manuscriptsNot Applicableen_US
dc.contributor.copyright-releaseNoen_US
 Find Full text

Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record