MICROBLOG TEXT PARSING: A COMPARISON OF STATE-OF-THE-ART PARSERS

Abbas, Syed Muhammad Faisal

MICROBLOG TEXT PARSING: A COMPARISON OF STATE-OF-THE-ART PARSERS

dc.contributor.author	Abbas, Syed Muhammad Faisal
dc.contributor.copyright-release	Not Applicable	en_US
dc.contributor.degree	Master of Computer Science	en_US
dc.contributor.department	Faculty of Computer Science	en_US
dc.contributor.ethics-approval	Not Applicable	en_US
dc.contributor.external-examiner	n/a	en_US
dc.contributor.graduate-coordinator	Dr. Evangelos Milios	en_US
dc.contributor.manuscripts	Not Applicable	en_US
dc.contributor.thesis-reader	Dr. Denis Riordan	en_US
dc.contributor.thesis-reader	Dr. Evangelos Milios	en_US
dc.contributor.thesis-supervisor	Dr. Vlado Keselj	en_US
dc.date.accessioned	2015-09-02T13:06:46Z
dc.date.available	2015-09-02T13:06:46Z
dc.date.defence	2015-07-14
dc.date.issued	2015
dc.description.abstract	Parsing is a natural language processing task in which relationships between words are deduced. It is essential for higher levels of semantic analysis, especially when predicates are required to be extracted from the text. Parsing is a widely established task and much effort has been put into devising good methods for it, which has resulted in reasonably accurate processing of this task. However, most of the work has been limited to formally written text such as news articles or discussion groups. Microblog text is a significant body of text that is written by laypeople in quite an informal language which is significantly different from formal written language so as to require special considerations. There are many applications in the area of analysis of microblog text that require high-quality and fast parsing, such as identification of user intentions. Dealing with large amount microblog text, we need to consider the running-time performance of the methods for many reasons: the amount of microblog text is huge and the pace new text is being generated is insurmountable, as well as the life span of its significance is very short. In this thesis we evaluated various parsers and their parsing performance as it relates to microblog text: we evaluated eight (8) state of the art parsers, five (5) of these parsers are inherently constituency (Phrase-Structure) parsers, while three (3) of them are dependency parsers. We compared all of the parsers after converting the output of constituency parsers to dependency trees and evaluated the performances using Unlabelled Attachment Score (UAS). In addition we compared the constituency parsers using PARSEVAL and FREVAL measures. Finally, we evaluated the selected parsers for their running-time performance as well.	en_US
dc.identifier.uri	http://hdl.handle.net/10222/61678
dc.language.iso	en	en_US
dc.subject	Microblog	en_US
dc.subject	Parsing	en_US
dc.subject	Parser	en_US
dc.subject	Twitter	en_US
dc.title	MICROBLOG TEXT PARSING: A COMPARISON OF STATE-OF-THE-ART PARSERS	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Abbas-SyedMuhammadFaisal-MCSc-July-2015.pdf
Size:: 351.14 KB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Faculty of Graduate Studies Online Theses