Phylogenetic trees are an important tool for biologists to understand how biological entities evolve through time, and they are informed by data from molecular sequences of DNA or RNA. Tree space is complicated and high-dimensional, which makes it extremely hard to traverse. Moreover, it admits many representations, each with its strengths and weaknesses. These characteristics complicate the use of Markov Chain Monte Carlo (MCMC) methods for computing expectations with respect to probability measures defined on phylogenetic space. One possible strategy for performing and studying MCMC in phylogenetic space is to find lower-dimensional projections that preserve the Markov property, either exactly or approximately. In this talk, we will discuss how to perform such a dimension reduction in the space of rooted trees using lumpability or quasi-lumpability with respect to subtrees, also known as clades. We study Metropolis-Hastings and lazy random walks on the rooted subtree prune-and-regraft (rSPR) graph and give bounds on the lumping error and total variation with respect to clades.
Rodrigo Barreto Alves
Rodrigo fez doutorado em Estatística orientado por Glauco Valle e Giulio Iacobelli na UFRJ (2022), após graduação em Ciências Atuariais pela UFRJ (2013) e mestrado em Matemática pela PUC-RJ (2017).; tem experiência como professor substituto na UERJ e na UFRRJ. Seus interesses de pesquisa são em Probabilidade, Processos Estocásticos e suas aplicações, especialmente em cadeias de Markov Monte Carlo e análise estatística para arvores filogenéticas. Atualmente é pesquisador de pós doutorado na FGV-EMAp.
