VISUAL ANALYTICS OF RESEARCH COMMUNITY EXPERTISE IN SPACE AND TIME

Munjal, Deepak

dc.contributor.author	Munjal, Deepak
dc.date.accessioned	2019-08-14T18:23:09Z
dc.date.available	2019-08-14T18:23:09Z
dc.date.issued	2019-08-14T18:23:09Z
dc.identifier.uri	http://hdl.handle.net/10222/76251
dc.description.abstract	Association for Computing Machinery (ACM) is an international learned society for computing. ACM operates the Distinguished Speaker Program (ACMDSP). ACMDSP maintains a list of speakers, who can be invited to deliver lectures on Computer Science topics at different locations worldwide. Currently, speakers' lectures are classified into topics manually and ACMDSP committee accesses the speaker and lecture data directly through the database. This thesis is attempting to make it more intuitive to access the database through a visualization system, and in classifying the lectures on offer into topics. It uses Google Map to visualize the speaker, topic and lecture data. It displays the speaker's location and contact details on Google Map. Each lecture delivered by the speakers is assigned to one or more topics from the set of topics defined by the ACMDSP committee. The problem of categorizing lectures into topics is similar to the problem of categorizing research papers into topics. Hence, for each topic, we have manually associated a set of keywords from the NSERC list of research topics. These keywords are used to create training sets for each topic. Title and abstract information of these research papers along with a lecture topic are used to train the machine learning models, which classify each lecture title and abstract into one or more topics of a predefined topic structure. This thesis uses three document representations, based on bag of words, bag of concepts and bag of categories. We have used three consensus methods, which include linear regression, class with maximum probability and voting based. Each of these methods is a consensus method in itself and every individual consensus method forms an agreement to predict a topic. This thesis expanded on the previous classification model based on semantic representations of lecture titles/abstracts that can classify a large set of lectures into topics. Previous work used the topics to construct the training data. However, this thesis used the NSERC keywords to describe the ACMDSP topics and construct the training data. The classifier can predict up to three topics for a single Lecture.	en_US
dc.language.iso	en	en_US
dc.subject	classification	en_US
dc.subject	machine learning	en_US
dc.subject	visual analytics	en_US
dc.subject	text analytics	en_US
dc.title	VISUAL ANALYTICS OF RESEARCH COMMUNITY EXPERTISE IN SPACE AND TIME	en_US
dc.type	Thesis	en_US
dc.date.defence	2019-07-26
dc.contributor.department	Faculty of Computer Science	en_US
dc.contributor.degree	Master of Computer Science	en_US
dc.contributor.external-examiner	n/a	en_US
dc.contributor.graduate-coordinator	Michael Mcallister	en_US
dc.contributor.thesis-reader	Vlado Keselj	en_US
dc.contributor.thesis-reader	Luis Torgo	en_US
dc.contributor.thesis-supervisor	Evangelos Milios	en_US
dc.contributor.thesis-supervisor	Fernando Paulovich	en_US
dc.contributor.ethics-approval	Received	en_US
dc.contributor.manuscripts	Not Applicable	en_US
dc.contributor.copyright-release	Not Applicable	en_US

Find Full text

Files in this item

Name:: Munjal-Deepak-MSc-CS-July-2019.pdf
Size:: 1.849Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Faculty of Graduate Studies Online Theses

Show simple item record