Show simple item record

dc.contributor.authorGheibi, Mirerfan
dc.date.accessioned2021-12-13T14:40:35Z
dc.date.available2021-12-13T14:40:35Z
dc.date.issued2021-12-13T14:40:35Z
dc.identifier.urihttp://hdl.handle.net/10222/81075
dc.description.abstractWe develop a computer vision system to help biologists detect endangered whales. Given access to a limited dataset of aerial imagery (1544 images of mainly water), we implemented object detection and semantic segmentation models. For segmentation, we leverage the extreme data imbalance by introducing an elliptic annotation mechanism mitigating the need for tight annotations while still constrained by expert annotators' available time. Data scarcity made zero-false-negative rate infeasible, so we minimized false negatives while having few enough false positives that it could still help an expert annotator accelerate the annotation process itself. This would allow a bootstrapping dataset creation approach: collecting increasingly larger datasets in parallel with training increasingly accurate models. We evaluated performance for the downstream bootstrapping task with an AI-in-the-Loop experiment. Motivated by the expert user's workflow, this required developing a feature-based clustering visualization of the images. Our segmentation system admitted few false negatives and was more efficient than manually data collection alone. While the proposed approach cannot entirely solve the challenge of the extremely small dataset, it suggests that a slightly larger dataset (e.g. adding 100 whale images would double the relevant training set) may be sufficient to bootstrap the training and collection with effectively no false negatives.en_US
dc.language.isoenen_US
dc.subjectDeep Learningen_US
dc.subjectComputer Visionen_US
dc.subjectSemantic Segmentationen_US
dc.subjectObject Detectionen_US
dc.subjectAerial Imageryen_US
dc.subjectAI-in-the-Loopen_US
dc.subjectHuman-in-the-Loopen_US
dc.subjectFaster R-CNNen_US
dc.subjectObject Annotationen_US
dc.subjectWhale Detectionen_US
dc.subjectImage Clusteringen_US
dc.subjectMarine Animal Detectionen_US
dc.subjectBootstrapping Dataset Creationen_US
dc.subjectDataset Creationen_US
dc.titleHelping Biologists Find Whales: AI-in-the-Loop Support for Environmental Dataset Creationen_US
dc.date.defence2021-11-30
dc.contributor.departmentFaculty of Computer Scienceen_US
dc.contributor.degreeMaster of Computer Scienceen_US
dc.contributor.external-examinern/aen_US
dc.contributor.graduate-coordinatorMichael McAllisteren_US
dc.contributor.thesis-readerStan Matwinen_US
dc.contributor.thesis-readerThomas Trappenbergen_US
dc.contributor.thesis-supervisorSageev Ooreen_US
dc.contributor.ethics-approvalNot Applicableen_US
dc.contributor.manuscriptsNot Applicableen_US
dc.contributor.copyright-releaseNot Applicableen_US
 Find Full text

Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record