Show simple item record

dc.contributor.authorZou, Liwen
dc.date.accessioned2011-09-07T17:34:02Z
dc.date.available2011-09-07T17:34:02Z
dc.date.issued2011-09-07
dc.identifier.urihttp://hdl.handle.net/10222/14248
dc.description.abstractThis thesis considers two distinct topics in phylogenetic analysis. The first is construction of empirical rate matrices for amino acid models. The second topic, which constitutes the majority of the thesis, involves analysis of and extensions to the BH model of Barry and Hartigan (1987). There are a number of rate matrices used for phylogenetic analysis including the PAM (Dayhoff et al. 1979), JTT (Jones et al. 1992) and WAG (Whelan and Goldman 2001). The construction of each of these has difficulties. To avoid adjusting for multiple substitutions, the PAM and JTT matrices were constructed using only a subset of the data consisting of closely related species. The WAG model used an incomplete maximum likelihood estimation to reduce computational cost. We develop a modification of the pairwise methods first described in Arvestad and Bruno that better adjusts for some of the sparseness difficulties that arise with amino acid data. The BH model is very flexible, allowing separate discrete-time Markov processes to occur along different edges. We show, however, that an identifiability problem arises for the BH model making it difficult to estimate character state frequencies at internal nodes. To obtain such frequencies and edge-lengths for BH model fits, we define a nonstationary GTR (NSGTR) model along an edge, and find the NSGTR model that best approximates the fitted BH model. The NSGTR model is slightly more restrictive but allows for estimation of internal node frequencies and interpretable edge lengths. While adjusting for rates-across-sites variation is now common practice in phylogenetic analyses, it is widely recognized that in reality evolutionary processes can change over both sites and lineages. As an adjustment for this, we introduce a BH mixture model that not only allows completely different models along edges of a topology, but also allows for different site classes whose evolutionary dynamics can take any form.en_US
dc.language.isoenen_US
dc.subjectBH mixture modelen_US
dc.subjectNonstationaryen_US
dc.subjectRate matrixen_US
dc.subjectIdentifiabilityen_US
dc.subjectNSGTR modelen_US
dc.subjectEdge lengthen_US
dc.subjectBH modelen_US
dc.titleConstruction of amino acid rate matrices and extensions of the Barry and Hartigan model for phylogenetic inferenceen_US
dc.date.defence2011-08-09
dc.contributor.departmentDepartment of Mathematics & Statistics - Statistics Divisionen_US
dc.contributor.degreeDoctor of Philosophyen_US
dc.contributor.external-examinerDr. Elisabeth Tillieren_US
dc.contributor.graduate-coordinatorDr. Edward Suskoen_US
dc.contributor.thesis-readerDr. Robert Beikoen_US
dc.contributor.thesis-readerDr. Christian Blouinen_US
dc.contributor.thesis-supervisorDr. Chris Field, Dr. Andrew J. Roger,Dr. Edward Suskoen_US
dc.contributor.ethics-approvalNot Applicableen_US
dc.contributor.copyright-releaseNot Applicableen_US
 Find Full text

Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record