Authors:
RC Griffiths, S Tavaré
Journal name: 
Theor Popul Biol
Abstract: 
We consider inference about the history of a sample of DNA sequences, conditional upon the haplotype counts and the number of segregating sites observed at the present time. After deriving some theoretical results in the coalescent setting, we implement rejection sampling and importance sampling schemes to perform the inference. The importance sampling scheme addresses an extension of the Ewens Sampling Formula for a configuration of haplotypes and the number of segregating sites in the sample. The implementations include both constant and variable population size models. The methods are illustrated by two human Y chromosome datasets.
DOI: 
http://doi.org/10.1016/j.tpb.2018.04.006
Research group: 
Tavaré Group
E-pub date: 
25 Apr 2018