Florencia G. Leonardi, Instituto de Matemática e Estatística – Universidade de São Paulo

Parsimonious stochastic chains with applications to classificationand and phylogeny of protein sequences

vendredi 30 novembre 2007, 9h30 - 10h30

Salle de réunion, espace Turing


In this talk I will present some results concerning symbolic sequence
modeling with parsimonious stochastic chains. Parsimonious stochastic chains, which include variable memory stochastic chains, constitute a generalization of fixed order Markov chains. We introduced a new algorithm, called SPST, to select the model of parsimonious stochastic chain that fits better to a sample of sequences. Then, we use the SPST algorithm to study two important problems of genomics. These problems are the classification of proteins into families and the study of the evolution of biological sequences.