Embeddability and rate identifiability of Kimura 2-parameter matrices
Rights accessOpen Access
Deciding whether a substitution matrix is embeddable (i.e. the corresponding Markov process has a continuous-time realization) is an open problem even for 4×4 matrices. We study the embedding problem and rate identifiability for the K80 model of nucleotide substitution. For these 4×4 matrices, we fully characterize the set of embeddable K80 Markov matrices and the set of embeddable matrices for which rates are identifiable. In particular, we describe an open subset of embeddable matrices with non-identifiable rates. This set contains matrices with positive eigenvalues and also diagonal largest in column matrices, which might lead to consequences in parameter estimation in phylogenetics. Finally, we compute the relative volumes of embeddable K80 matrices and of embeddable matrices with identifiable rates. This study concludes the embedding problem for the more general model K81 and its submodels, which had been initiated by the last two authors in a separate work
The final publication is available at Springer via http://dx.doi.org/10.1007/s00285-019-01446-0
CitationCasanellas, M.; Fernández-Sánchez, J.; Roca, J. Embeddability and rate identifiability of Kimura 2-parameter matrices. "Journal of mathematical biology", 1 Març 2020, vol. 80, núm. 4, p. 995-1019.
All rights reserved. This work is protected by the corresponding intellectual and industrial property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public communication or transformation of this work are prohibited without permission of the copyright holder