Repository logo
 

Testing adequacy of codon substitution models

Date

2024-12-15

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

In phylogenetic inference, codon substitution models are mainly used to detect positive selection, which is a sign of adaptive molecular evolution at the protein level. Positive selection is identified when non-synonymous substitutions are more frequent than synonymous ones. To model the evolution of amino acid and codon sequences, Markov chains can be used. It's important to test these models for adequacy before drawing phylogenetic conclusions, as inadequate models can lead to unreliable results and incorrect biological interpretations. This thesis introduces several methods to evaluate the adequacy of codon substitution models, such as Pearson's Chisq test with two alternative strategies for binning site patterns; influence matrix based binning and random binning; and the Anderson-Darling test. These methods help determine whether the proposed model effectively fits the data, thereby assessing the reliability of conclusions derived from it.

Description

Keywords

Model adequacy test, Codon substitution model, Pearson’s goodness-of-fit test

Citation