Show simple item record

dc.contributor.authorEtemad, Elham
dc.date.accessioned2018-12-13T11:52:50Z
dc.date.available2018-12-13T11:52:50Z
dc.date.issued2018-12-13T11:52:50Z
dc.identifier.urihttp://hdl.handle.net/10222/75024
dc.description.abstractThe most essential technique in creating agents with the ability to process and understand the content of visual data is object recognition, which includes image content classi cation, and object localization. Deep convolutional neural networks' (CNNs) performance gain in computer vision, there still are application scenarios with limited training data and computing power for which using deep CNN based methods is not feasible. On the other hand, the human engineered image representations require less training data and computing power and can be enhanced by importing domain specif c knowledge. These representations may also bene fit from the human vision characteristics in reducing the gap between computed image representations and human vision perception. In this thesis we have proposed four methods to improve image classi cation and object localization. All these methods utilize the perceptual shape features of image since it is proved that the human vision perception on objects mostly relies on shape features of the objects, while color and texture are utilized as extra sources to complete this perception. In the rst method, we have created a static dictionary of perceptual shape features based on N-gram model and used that in combination with spatial pyramid matching to represent images. In the second method, a dynamic dictionary from image edge segments is formed where these segments are obtained from an octave of image in di erent scales. The third method considers the curve partitioning points as descriptive features of the image and created a dynamic dictionary from descriptors of these points. The proposed object localization method utilizes the perceptual shape features of the image to improve the location of objects determined by object recognition module. The initial location may be obtained by any object recognition method, then the proposed method iteratively merges the edge segments with the detected object using a best rst search strategy. These proposed methods have been evaluated on di erent benchmark image datasets. Judging on the overall performance of the proposed method, it is expected that the proposed methods would bring some useful alternatives to support e cient tool development for applications lacking training data or no training data at all.en_US
dc.language.isoenen_US
dc.subjectImage Representationen_US
dc.subjectPerceptual Featuresen_US
dc.subjectObject Recognitionen_US
dc.titlePERCEPTUAL SHAPE FEATURE BASED IMAGE CODING FOR VISUAL CONTENT CLASSIFICATION AND OBJECT RECOGNITIONen_US
dc.typeThesisen_US
dc.date.defence2018-11-14
dc.contributor.departmentFaculty of Computer Scienceen_US
dc.contributor.degreeDoctor of Philosophyen_US
dc.contributor.external-examinerMinglun Gongen_US
dc.contributor.graduate-coordinatorMichael McAllisteren_US
dc.contributor.thesis-readerEvangelos Miliosen_US
dc.contributor.thesis-readerDerek Reillyen_US
dc.contributor.thesis-readerVlado Keseljen_US
dc.contributor.thesis-supervisorQigang Gaoen_US
dc.contributor.ethics-approvalNot Applicableen_US
dc.contributor.manuscriptsYesen_US
dc.contributor.copyright-releaseYesen_US
 Find Full text

Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record