View Related Documents

Abstract

We introduce and analyse a simple model of genome evolution. It is based on two fundamental evolutionary events: gene loss and gene duplication. We are mainly interested in asymptotic distributions of gene families in a genome. This is motovated by previous work which consisted in fitting the available genomic data into, what is called paralog distributions. Two approaches are presented in this paper: continuous and discrete time models. A comparison of them is presented too – the asymptotic distribution for the continuous time model can be seen as a limit of the discrete time distributions, when probabilities of gene loss and gene duplication tend to zero. We view this paper as an intermediate step towards mathematically settling the problem of characterizing the shape of paralog distribution in bacterial genomes.
This research was partially supported by the State Committee for Scientific Research (Poland) Grants No. 2  P03A  031  25, and 7 T11F 016 21 and by the EC programme Centres of Excellence for States in phase of pre-accession, No. ICA1-CT-2000-70024.

Fulltext Preview

Image of the first page of the fulltext document