Show simple item record

dc.contributor.authorNarravula, Goutham
dc.date.accessioned2021-12-17T19:30:36Z
dc.date.available2021-12-17T19:30:36Z
dc.date.issued2021-12-17T19:30:36Z
dc.identifier.urihttp://hdl.handle.net/10222/81119
dc.description.abstractIn oil industry, drilling reports play a vital role in documenting critical events on a drilling rig. Information in these reports will help foresee drilling risks and mitigate unwanted surprises beforehand, significantly reducing development costs and saving time for future projects. Manually going through thousands of reports can be time-consuming and laborious. This thesis proposes an approach for extracting human-interpretable topics that can best summarize clusters of reports using state-of-the-art text embedding techniques. Generated topics are used to optimize the existing information retrieval system. Due to various complexities of text, conventional data preprocessing and traditional topic models could not produce desired results. Hence, we propose an approach that uses distributed representations to capture semantic and syntactic context from a small, domain-specific dataset. Industry experts reviewed generated topics to examine topic diversity and assign appropriate labels. Detailed analysis shows that our results are more coherent and diverse than traditional methods.en_US
dc.language.isoenen_US
dc.subjectTopic Modelen_US
dc.subjectText Embeddingen_US
dc.subjectOil and Gasen_US
dc.titleText Embedding Based Topic Modeling on Noisy Historical Drilling Dataen_US
dc.date.defence2021-12-14
dc.contributor.departmentFaculty of Computer Scienceen_US
dc.contributor.degreeMaster of Computer Scienceen_US
dc.contributor.external-examinerN/Aen_US
dc.contributor.graduate-coordinatorDr. Michael McAllisteren_US
dc.contributor.thesis-readerDr. Evangelos Miliosen_US
dc.contributor.thesis-readerDr. Nur Zincir-Heywooden_US
dc.contributor.thesis-supervisorDr. Vlado Keseljen_US
dc.contributor.thesis-supervisorDr. Dijana Kosmajacen_US
dc.contributor.ethics-approvalNot Applicableen_US
dc.contributor.manuscriptsNot Applicableen_US
dc.contributor.copyright-releaseNot Applicableen_US
 Find Full text

Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record