Practical Application of Large Language Models in the Nuclear Power Generation Industry

de Costa, Mishca

Practical Application of Large Language Models in the Nuclear Power Generation Industry

Files

MishcadeCosta2025.pdf (4.48 MB)

Date

2025-12-04

Authors

de Costa, Mishca

Abstract

Nuclear power organizations steward large, fragmented, and partially digitized textual corpora that impede knowledge access and complicate decision support. Effective application of large language models (LLMs) is further hindered by dense, site-specific Canada Deuterium Uranium (CANDU) jargon and multi-definition acronyms underrepresented in public training data. Existing applied nuclear LLM efforts emphasize isolated retrieval or proprietary model announcements, leaving gaps in integrated, auditable workflows for safety classification, terminology normalization, and structured outputs. This thesis contributes a deployment-first, prompt-centric framework comprising: (i) a GPT based Station Condition Record (SCR) safety event scoring approach emphasizing balanced recall/precision over an imbalanced corpus; (ii) an ensemble jargon detection and expansion pipeline combining LLM heuristics with deterministic and probabilistic methods; and (iii) structured output / function-calling patterns that enhance traceability and governance readiness when mining data from structured nuclear databases using hybrid NL-to-SQL techniques. Collectively, the results provide an evidence-based blueprint indicating when prompt engineering plus glossary normalization can defer costly domain pretraining, complementing parallel work on retrieval augmentation and secure local experimentation.

Keywords

nuclear, llm

URI

https://hdl.handle.net/10222/85546

Collections

Faculty of Graduate Studies Online Theses

Full item page

Practical Application of Large Language Models in the Nuclear Power Generation Industry

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections