Practical Application of Large Language Models in the Nuclear Power Generation Industry
Loading...
Date
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
Nuclear power organizations steward large, fragmented, and partially digitized textual
corpora that impede knowledge access and complicate decision support. Effective
application of large language models (LLMs) is further hindered by dense, site-specific
Canada Deuterium Uranium (CANDU) jargon and multi-definition acronyms
underrepresented in public training data. Existing applied nuclear LLM efforts emphasize
isolated retrieval or proprietary model announcements, leaving gaps in integrated,
auditable workflows for safety classification, terminology normalization, and structured
outputs. This thesis contributes a deployment-first, prompt-centric framework comprising:
(i) a GPT based Station Condition Record (SCR) safety event scoring approach
emphasizing balanced recall/precision over an imbalanced corpus; (ii) an ensemble jargon
detection and expansion pipeline combining LLM heuristics with deterministic and
probabilistic methods; and (iii) structured output / function-calling patterns that enhance
traceability and governance readiness when mining data from structured nuclear databases
using hybrid NL-to-SQL techniques. Collectively, the results provide an evidence-based
blueprint indicating when prompt engineering plus glossary normalization can defer costly
domain pretraining, complementing parallel work on retrieval augmentation and secure
local experimentation.
Description
Keywords
nuclear, llm
