Repository logo
 

NOVEL APPLICATIONS OF RANDOM FOREST FOR EXPLORING POPULATION STRUCTURE OF ATLANTIC SALMON (SALMO SALAR) IN LABRADOR, CANADA

dc.contributor.authorSylvester, Emma
dc.contributor.copyright-releaseNot Applicableen_US
dc.contributor.degreeMaster of Scienceen_US
dc.contributor.departmentDepartment of Computational Biology and Bioinformaticsen_US
dc.contributor.ethics-approvalNot Applicableen_US
dc.contributor.external-examinern/aen_US
dc.contributor.graduate-coordinatorRobert G Beikoen_US
dc.contributor.manuscriptsYesen_US
dc.contributor.thesis-readerIan R Bradburyen_US
dc.contributor.thesis-readerChristophe Herbingeren_US
dc.contributor.thesis-supervisorRobert G Beikoen_US
dc.contributor.thesis-supervisorPaul Bentzenen_US
dc.date.accessioned2017-08-28T16:25:02Z
dc.date.available2017-08-28T16:25:02Z
dc.date.defence2017-08-10
dc.date.issued2017-08-28T16:25:02Z
dc.description.abstractThe detection of population-genetic structure is useful for understanding patterns of gene flow, population distribution, and wildlife management and conservation. In this work, we examine approaches for inferring the modern genetic structure of Atlantic salmon (Salmo salar). We explore the utility of machine-learning algorithms (random forest, regularized random forest, and guided regularized random forest) compared with FST-ranking for selection of single nucleotide polymorphisms (SNP) for fine-scale population assignment within a marine embayment, Lake Melville, Labrador. Using an unpublished SNP dataset for Atlantic salmon and validating our approaches with a published SNP data set for Alaskan Chinook salmon (Oncorhynchus tshawytscha), we demonstrate improved self-assignment accuracy and provide evidence of population structure consistent with F-statistics. We compare the level of population structure in greater Labrador that is resolved using a preliminary panel of SNPs selected with guided regularized random forest with an established panel of 101 microsatellites. We ask if salmon originating from rivers draining into Lake Melville show evidence of discrete genetic population structure relative to those outside of the embayment. Finally, we investigate environmental parameters associated with the observed genetic structure and seek to explain the mechanisms driving genetic differentiation in the area. We highlight the potential for applications of machine-learning approaches in population genetics and uncover fine-scale structure with potential impact on fisheries management techniques.en_US
dc.identifier.urihttp://hdl.handle.net/10222/73182
dc.language.isoenen_US
dc.subjectFisheries managementen_US
dc.subjectconservation geneticsen_US
dc.subjectrandom foresten_US
dc.subjectSNP selectionen_US
dc.subjectindividual assignmenten_US
dc.subjectpopulation geneticsen_US
dc.subjectlandscape geneticsen_US
dc.titleNOVEL APPLICATIONS OF RANDOM FOREST FOR EXPLORING POPULATION STRUCTURE OF ATLANTIC SALMON (SALMO SALAR) IN LABRADOR, CANADAen_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Sylvester-Emma_MSc-CBBI-August-2017.pdf
Size:
2.65 MB
Format:
Adobe Portable Document Format
Description:
Main thesis

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: