1. Home
  2. Publications
  3. A Dirichlet-multinomial mixed model for determining...

A Dirichlet-multinomial mixed model for determining differential abundance of mutational signatures

Abstract:
Mutational processes of diverse origin leave their imprints in the genome during tumour evolution. These imprints are called mutational signatures and they have been characterised for point mutations, structural variants and copy number changes. Each signature has an exposure , or abundance, per sample, which indicates how much a process has contributed to the overall genomic change. Mutational processes are not static, and a better understanding of their dynamics is key to characterise tumour evolution and identify cancer weaknesses that can be exploited during treatment. However, the structure of the data typically collected in this context makes it difficult to test whether signature exposures differ between samples or time-points. In general, the data consist of (1) patient-dependent vectors of counts for each sample and clonality group (2) generated from a covariate-dependent and compositional vector of probabilities with (3) a possibly group-dependent over-dispersion level. To model these data, we build on the Dirichlet-multinomial model to be able to model multivariate overdispersed vectors of counts as well as within-sample dependence and positive correlations between signatures. To estimate the model parameters, we implement a maximum likelihood estimator with a Laplace approximation of the random effect high-dimensional integrals and assess its bias and coverage by means of Monte Carlo simulations. We apply our approach to characterise differences of mutational processes between clonal and subclonal mutations across 23 cancer types of the PCAWG cohort. We find ubiquitous differential abundance of clonal and subclonal signatures across cancer types, and higher dispersion of signatures in the subclonal group, indicating higher variability between patients at subclonal level, possibly due to the presence of different clones with distinct active mutational processes. Mutational signature analysis is an expanding field and we envision our framework to be used widely to detect global changes in mutational process activity. Author Summary The genome is permanently subject to alterations due to errors in replication, faulty replication machinery, and external mutational processes such as tobacco smoke or UV light. Cancer is a disease of the genome, characterised by an abnormal growth of cells that harbour the same set of “clonal” mutations. In turn, these mutations might transform how cells accrue new “subclonal” mutations or the extent to which they tolerate them. The mutational signature framework lets us extract the information of which mutational processes have been active, and in which intensity, in creating a set of mutations. We extend this framework to statistically test the change in the relative intensity of mutational processes between conditions. In samples of 23 cancer types of the PCAWG project, we test the difference between mutational processes that contribute to mutations prior to cancer onset (clonal group), and upon cancer onset (subclonal group), whilst keeping into consideration patient-to-patient differences. We find differences in the majority of cancer types, and identify mutational processes which contribute preferentially to either group.
Authors:
L Morrill Gavarró, D-L Couturier, F Markowetz
Publication date:
1st Aug 2024
Full text
DOI