Repository logo
 

Phylogenetic analysis of multiple genes based on spectral methods

dc.contributor.authorAbeysundera, Melanie
dc.contributor.copyright-releaseNot Applicableen_US
dc.contributor.degreeDoctor of Philosophyen_US
dc.contributor.departmentDepartment of Mathematics & Statistics - Statistics Divisionen_US
dc.contributor.ethics-approvalNot Applicableen_US
dc.contributor.external-examinerDr. Matthew Spenceren_US
dc.contributor.graduate-coordinatorDr. David Hamiltonen_US
dc.contributor.manuscriptsNot Applicableen_US
dc.contributor.thesis-readerDr. Robert Beikoen_US
dc.contributor.thesis-readerDr. Bruce Smithen_US
dc.contributor.thesis-supervisorDr. Hong Gu and Dr. Chris Fielden_US
dc.date.accessioned2011-11-14T14:44:19Z
dc.date.available2011-11-14T14:44:19Z
dc.date.defence2011-10-28
dc.date.issued2011-11-14
dc.description.abstractMultiple gene phylogenetic analysis is of interest since single gene analysis often results in poorly resolved trees. Here the use of spectral techniques for analyzing multi-gene data sets is explored. The protein sequences are treated as categorical time series and a measure of similarity between a pair of sequences, the spectral covariance, is used to build trees. Unlike other methods, the spectral covariance method focuses on the relationship between the sites of genetic sequences. We consider two methods with which to combine the dissimilarity or distance matrices of multiple genes. The first method involves properly scaling the dissimilarity measures derived from different genes between a pair of species and using the mean of these scaled dissimilarity measures as a summary statistic to measure the taxonomic distances across multiple genes. We introduced two criteria for computing scale coefficients which can then be used to combine information across genes, namely the minimum variance (MinVar) criterion and the minimum coefficient of variation squared (MinCV) criterion. The scale coefficients obtained with the MinVar and MinCV criteria can then be used to derive a combined-gene tree from the weighted average of the distance or dissimilarity matrices of multiple genes. The second method is based on the singular value decomposition of a matrix made up of the p-vectors of pairwise distances for k genes. By decomposing such a matrix, we extract the common signal present in multiple genes to obtain a single tree representation of the relationship between a given set of taxa. Influence functions for the components of the singular value decomposition are derived to determine which genes are most influential in determining the combined-gene tree.en_US
dc.identifier.urihttp://hdl.handle.net/10222/14310
dc.language.isoenen_US
dc.subjectPhylogenetics, spectral analysis, spectral covariance, singular value decompositionen_US
dc.titlePhylogenetic analysis of multiple genes based on spectral methodsen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Abeysundera_Melanie_PhD_STAT_Oct_2011.pdf
Size:
18.53 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.69 KB
Format:
Item-specific license agreed upon to submission
Description: