X Wang, F Markowetz, F De Sousa E Melo, JP Medema, L Vermeulen
Int J Biochem Cell Biol
Gene-expression-based classification studies have changed the way cancer is traditionally perceived. It is becoming increasingly clear that many cancer types are in fact not single diseases but rather consist of multiple molecular distinct subtypes. In this review, we discuss unsupervised classification studies of common malignancies during the recent years. We found that the bioinformatic workflow of many of these studies follows a common main stream, although different statistical tools may be preferred from case to case. Here we summarize the employed methods, with a special focus on consensus clustering and classification. For each critical step of the bioinformatic analysis, we explain the biological relevance and implications of the technical principles. We think that a better understanding of these ever more frequently used methods to study cancer heterogeneity by the biomedical community is relevant as these type of studies will have an important impact on patient stratification and cancer subtype-specific drug development in the future.