dc.contributor.author | Narravula, Goutham | |
dc.date.accessioned | 2021-12-17T19:30:36Z | |
dc.date.available | 2021-12-17T19:30:36Z | |
dc.date.issued | 2021-12-17T19:30:36Z | |
dc.identifier.uri | http://hdl.handle.net/10222/81119 | |
dc.description.abstract | In oil industry, drilling reports play a vital role in documenting critical events on a drilling rig. Information in these reports will help foresee drilling risks and mitigate unwanted surprises beforehand, significantly reducing development costs and saving time for future projects. Manually going through thousands of reports can be time-consuming and laborious. This thesis proposes an approach for extracting human-interpretable topics that can best summarize clusters of reports using state-of-the-art text embedding techniques. Generated topics are used to optimize the existing information retrieval system. Due to various complexities of text, conventional data preprocessing and traditional topic models could not produce desired results. Hence, we propose an approach that uses distributed representations to capture semantic and syntactic context from a small, domain-specific dataset. Industry experts reviewed generated topics to examine topic diversity and assign appropriate labels. Detailed analysis shows that our results are more coherent and diverse than traditional methods. | en_US |
dc.language.iso | en | en_US |
dc.subject | Topic Model | en_US |
dc.subject | Text Embedding | en_US |
dc.subject | Oil and Gas | en_US |
dc.title | Text Embedding Based Topic Modeling on Noisy Historical Drilling Data | en_US |
dc.date.defence | 2021-12-14 | |
dc.contributor.department | Faculty of Computer Science | en_US |
dc.contributor.degree | Master of Computer Science | en_US |
dc.contributor.external-examiner | N/A | en_US |
dc.contributor.graduate-coordinator | Dr. Michael McAllister | en_US |
dc.contributor.thesis-reader | Dr. Evangelos Milios | en_US |
dc.contributor.thesis-reader | Dr. Nur Zincir-Heywood | en_US |
dc.contributor.thesis-supervisor | Dr. Vlado Keselj | en_US |
dc.contributor.thesis-supervisor | Dr. Dijana Kosmajac | en_US |
dc.contributor.ethics-approval | Not Applicable | en_US |
dc.contributor.manuscripts | Not Applicable | en_US |
dc.contributor.copyright-release | Not Applicable | en_US |