Repository logo
 

Applied Functional Data Classification

dc.contributor.authorBabyn, Jonathan
dc.contributor.copyright-releaseNot Applicableen_US
dc.contributor.degreeMaster of Scienceen_US
dc.contributor.departmentDepartment of Mathematics & Statistics - Statistics Divisionen_US
dc.contributor.ethics-approvalReceiveden_US
dc.contributor.external-examinern/aen_US
dc.contributor.graduate-coordinatorJoanna Mills Flemmingen_US
dc.contributor.manuscriptsNot Applicableen_US
dc.contributor.thesis-readerBruce Smithen_US
dc.contributor.thesis-readerStephen Grahamen_US
dc.contributor.thesis-supervisorJoanna Mills Flemmingen_US
dc.date.accessioned2018-03-29T18:42:43Z
dc.date.available2018-03-29T18:42:43Z
dc.date.defence2018-03-22
dc.date.issued2018-03-29T18:42:43Z
dc.description.abstractPicomole is a New Brunswick based company developing a lung cancer diagnosis system based on a person’s breath sample. Picomole conducted two different studies in an effort to ascertain whether their breath analysis system utilizing cavity ringdown laser spectroscopy is capable of determining whether or not a subject has lung cancer. One of the resulting datasets had a very large percentage of non-random missing data which is explored in detail. Most breath analysis systems operate by trying to determine the makeup of volatile organic compounds and see if any are known signs of cancer. By contrast, the work done here is based entirely on statistical learning methods. Spectroscopy data is naturally a curve, as the concentrations of compounds are measured over a series of infrared wavelengths. This kind of data is referred to as functional data for which there exist unique techniques for dealing with problems specific to it. This motivated the consideration of techniques including Functional Principal Component Analysis, Functional Linear Discriminant Analysis and DD^G plots. Classification trees and random forests which have previously shown success on spectroscopy data were also explored. Classification trees, DD^G plots and Functional Linear Discriminant Analysis were found to be able to correctly classify subjects with accuracy greater than random guessing.en_US
dc.identifier.urihttp://hdl.handle.net/10222/73798
dc.language.isoenen_US
dc.subjectfunctional data analysisen_US
dc.subjectclassificationen_US
dc.titleApplied Functional Data Classificationen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Babyn-Jonathan-MSc-STAT-March-2018.pdf
Size:
898.25 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: