Show simple item record

dc.contributor.authorLiu, Chaoyue
dc.date.accessioned2022-11-18T15:06:02Z
dc.date.available2022-11-18T15:06:02Z
dc.date.issued2022-11-17
dc.identifier.urihttp://hdl.handle.net/10222/82080
dc.description.abstractPhylogenetic profiles, which summarize the presence and absence patterns of genes in a set of genomes, can be used to identify genes that have correlated evolutionary histories. However, comparative analysis of phylogenetic profiles should take into account the phylogenetic effect under consideration. In this study, we developed phylogenetic comparative methods to infer the gene coevolution. We first proposed an approach that uses Pagel's correlation model to infer the evolutionary similarities between genes and a hierarchical-clustering approach to define sets of genes with correlated distributions across the organisms. The results support the assumption of our work that the genes with correlated evolutionary histories tend to be functionally linked. However, Pagel's method is computationally expensive and tends to overestimate the signal of coevolution. We developed a new coevolutionary model - the Community Coevolution Model (CCM), which has the additional advantage of being able to examine multiple genes as a community to reveal a more complete picture of the dependency relationships. We also developed a simulation procedure to generate phylogenetic profiles of gene sets with correlated evolutionary trajectories and adjustable strength of interactions. The results show that CCM is more accurate than Pagel's method and other heuristic tree-aware methods and provides more biological insights such as the evolutionary rates, significance levels and directions (positive/negative) of interactions. We also developed a matrix decomposition-based method (Chisq-PLR), especially for large-scale analysis. Our method not only has computational speed that is competitive with other heuristic methods but also gives support to better biological explanations. This fast method can be used to preprocess large data sets to reduce the number of computations that need to be carried out by CCM or other mechanistic model based methods.en_US
dc.language.isoenen_US
dc.subjectPhylogenetic Profilesen_US
dc.subjectEvolutionary Modelen_US
dc.subjectStatisticsen_US
dc.subjectComparative Methodsen_US
dc.titleINFERENCE OF GENE COEVOLUTION BASED ON PHYLOGENETIC PROFILESen_US
dc.date.defence2022-09-20
dc.contributor.departmentDepartment of Mathematics & Statistics - Statistics Divisionen_US
dc.contributor.degreeDoctor of Philosophyen_US
dc.contributor.external-examinerDr. Andrew C. Doxeyen_US
dc.contributor.graduate-coordinatorDr. Joanna Mills Flemmingen_US
dc.contributor.thesis-readerDr. Lam Hoen_US
dc.contributor.thesis-readerDr. Tobias Kenneyen_US
dc.contributor.thesis-supervisorDr. Hong Guen_US
dc.contributor.thesis-supervisorDr. Robert Beikoen_US
dc.contributor.ethics-approvalNot Applicableen_US
dc.contributor.manuscriptsYesen_US
dc.contributor.copyright-releaseNot Applicableen_US
 Find Full text

Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record