Non-uniform Language Detection in Technical Writing

Wang, Weibo Jr

View/Open

Wang-Weibo-MSc-CSCI-April-2016.pdf (290.1Kb)

Date

2016-04-21

Author

Wang, Weibo Jr

Metadata

Show full item record

Abstract

Technical writing in professional environments, such as user manual authoring, requires uniform language. Non-uniform language detection is a novel task, which aims to guarantee the consistency for technical writing by detecting sentences in a document that are intended to have the same meaning within a similar context but use different words/writing style. This thesis proposes an approach that utilizes text similarity algorithms at lexical, syntactic, semantic and pragmatic levels. Different metrics are integrated by applying a machine learning classification method. We tested our method using smart phone user manuals, and compared the performance against the state-of-the-art methods in related area. The experiments demonstrate our approach is the most efficient solution to date.

URI

http://hdl.handle.net/10222/71479

Subject

Collections

Faculty of Graduate Studies Online Theses

Find Full text