UNSUPERVISED PARAPHRASE GENERATION FROM HIERARCHICAL LANGUAGE MODELS

Traynor, Michael

dc.contributor.author	Traynor, Michael
dc.date.accessioned	2018-12-14T16:19:32Z
dc.date.available	2018-12-14T16:19:32Z
dc.date.issued	2018-12-14T16:19:32Z
dc.identifier.uri	http://hdl.handle.net/10222/75034
dc.description.abstract	Paraphrase generation is a challenging problem that requires a semantic representation of language. Language models implemented with deep neural networks (DNN) have the ability to transform text to a real valued vector space that can capture useful semantic information. In light of this, this work employs hierarchical language modeling to produce semantic representations of sentences. An encoder-decoder model is employed that uses four components: a word encoder, sentence encoder, sentence decoder, and word decoder. These components hierarchically convert a sentence from characters through word representations to a fixed-size sentence representation, then back down through words to characters. Many types of neural network are suitable for each component, and a number of them are compared in this work, including a novel architecture, the Self Attentive Recurrent Array (SARAh). The SARAh is shown to perform at least as well as Gated Recurrent Units (GRU) and Transformers on language modeling tasks, and requires fewer parameters. These language models are trained on a large and diverse dataset, but this work also shows that it is possible to fine tune such models to a particular domain, such as the works of a single author. These fine tuned models are able to leverage information learned on the larger dataset in order to perform better on the target domain. Finally, a language model is trained to produce semantic representations of sentences that are subsequently used to produce paraphrases in a completely unsupervised setting. The language model, which is trained to predict the sentence most likely to follow the input sentence, is fine tuned to instead autoencode the input sentence. Given that the sentence encoder produces a semantic representation, it is possible to use a number of techniques to encourage the decoder to generate a paraphrase rather than reconstruct the exact input sentence. These techniques include adding noise to the sentence representation, and sampling characters from the model's output layer.	en_US
dc.language.iso	en	en_US
dc.subject	Deep Learning	en_US
dc.subject	Natural Language Processing	en_US
dc.subject	Semantic Representation	en_US
dc.subject	Language Modeling	en_US
dc.subject	Paraphrase	en_US
dc.title	UNSUPERVISED PARAPHRASE GENERATION FROM HIERARCHICAL LANGUAGE MODELS	en_US
dc.date.defence	2018-12-13
dc.contributor.department	Faculty of Computer Science	en_US
dc.contributor.degree	Master of Computer Science	en_US
dc.contributor.external-examiner	n/a	en_US
dc.contributor.graduate-coordinator	Dr Michael McAllister	en_US
dc.contributor.thesis-reader	Dr Sageev Oore	en_US
dc.contributor.thesis-reader	Dr Robert Beiko	en_US
dc.contributor.thesis-supervisor	Dr Thomas Trappenberg	en_US
dc.contributor.ethics-approval	Not Applicable	en_US
dc.contributor.manuscripts	No	en_US
dc.contributor.copyright-release	Not Applicable	en_US

Find Full text

Files in this item

Name:: Traynor-Michael-MCSc-CSCI-Dece ...
Size:: 12.92Mb
Format:: PDF
Description:: Thesis

View/Open

This item appears in the following Collection(s)

Faculty of Graduate Studies Online Theses

Show simple item record