Till sidans topp

Sidansvarig: Webbredaktion
Sidan uppdaterades: 2012-09-11 15:12

Tipsa en vän

Variability in Metagenomi… - Göteborgs universitet Till startsida
Till innehåll Läs mer om hur kakor används på gu.se

Variability in Metagenomic Count Data and Its Influence on the Identification of Differentially Abundant Genes

Artikel i vetenskaplig tidskrift
Författare Viktor Jonsson
Tobias Österlund
Olle Nerman
Erik Kristiansson
Publicerad i Journal of computational biology : a journal of computational molecular cell biology
Volym 24
Nummer/häfte 4
Sidor 311-326
ISSN 1557-8666
Publiceringsår 2017
Publicerad vid Institutionen för matematiska vetenskaper
Sidor 311-326
Språk en
Länkar dx.doi.org/10.1089/cmb.2016.0180
Ämneskategorier Bioinformatik och systembiologi, Mikrobiologi, Matematisk statistik


Metagenomics is the study of microorganisms in environmental and clinical samples using high-throughput sequencing of random fragments of their DNA. Since metagenomics does not require any prior culturing of isolates, entire microbial communities can be studied directly in their natural state. In metagenomics, the abundance of genes is quantified by sorting and counting the DNA fragments. The resulting count data are high-dimensional and affected by high levels of technical and biological noise that make the statistical analysis challenging. In this article, we introduce an hierarchical overdispersed Poisson model to explore the variability in metagenomic data. By analyzing three comprehensive data sets, we show that the gene-specific variability varies substantially between genes and is dependent on biological function. We also assess the power of identifying differentially abundant genes and show that incorrect assumptions about the gene-specific variability can lead to unacceptable high rates of false positives. Finally, we evaluate shrinkage approaches to improve the variance estimation and show that the prior choice significantly affects the statistical power. The results presented in this study further elucidate the complex variance structure of metagenomic data and provide suggestions for accurate and reliable identification of differentially abundant genes.

Sidansvarig: Webbredaktion|Sidan uppdaterades: 2012-09-11

På Göteborgs universitet använder vi kakor (cookies) för att webbplatsen ska fungera på ett bra sätt för dig. Genom att surfa vidare godkänner du att vi använder kakor.  Vad är kakor?