Till sidans topp

Sidansvarig: Webbredaktion
Sidan uppdaterades: 2012-09-11 15:12

Tipsa en vän

VarGenius executes cohort… - Göteborgs universitet Till startsida
Till innehåll Läs mer om hur kakor används på gu.se

VarGenius executes cohort-level DNA-seq variant calling and annotation and allows to manage the resulting data through a PostgreSQL database

Artikel i vetenskaplig tidskrift
Författare F. Musacchia
A. Ciolfi
M. Mutarelli
A. Bruselles
R. Castello
M. Pinelli
Swaraj Basu
S. Banfi
G. Casari
M. Tartaglia
V. Nigro
Publicerad i Bmc Bioinformatics
Volym 19
ISSN 1471-2105
Publiceringsår 2018
Publicerad vid Institutionen för biomedicin, avdelningen för medicinsk kemi och cellbiologi
Språk en
Länkar dx.doi.org/10.1186/s12859-018-2532-...
Ämnesord genetic-variation, pathogenicity, capture
Ämneskategorier Cell- och molekylärbiologi, Bioinformatik och systembiologi


BackgroundTargeted resequencing has become the most used and cost-effective approach for identifying causative mutations of Mendelian diseases both for diagnostics and research purposes. Due to very rapid technological progress, NGS laboratories are expanding their capabilities to address the increasing number of analyses. Several open source tools are available to build a generic variant calling pipeline, but a tool able to simultaneously execute multiple analyses, organize, and categorize the samples is still missing.ResultsHere we describe VarGenius, a Linux based command line software able to execute customizable pipelines for the analysis of multiple targeted resequencing data using parallel computing. VarGenius provides a database to store the output of the analysis (calling quality statistics, variant annotations, internal allelic variant frequencies) and sample information (personal data, genotypes, phenotypes). VarGenius can also perform the joint analysis of hundreds of samples with a single command, drastically reducing the time for the configuration and execution of the analysis.VarGenius executes the standard pipeline of the Genome Analysis Tool-Kit (GATK) best practices (GBP) for germinal variant calling, annotates the variants using Annovar, and generates a user-friendly output displaying the results through a web page.VarGenius has been tested on a parallel computing cluster with 52 machines with 120GB of RAM each. Under this configuration, a 50M whole exome sequencing (WES) analysis for a family was executed in about 7h (trio or quartet); a joint analysis of 30 WES in about 24h and the parallel analysis of 34 single samples from a 1M panel in about 2h.ConclusionsWe developed VarGenius, a master tool that faces the increasing demand of heterogeneous NGS analyses and allows maximum flexibility for downstream analyses. It paves the way to a different kind of analysis, centered on cohorts rather than on singleton. Patient and variant information are stored into the database and any output file can be accessed programmatically. VarGenius can be used for routine analyses by biomedical researchers with basic Linux skills providing additional flexibility for computational biologists to develop their own algorithms for the comparison and analysis of data.The software is freely available at: https://github.com/frankMusacchia/VarGenius

Sidansvarig: Webbredaktion|Sidan uppdaterades: 2012-09-11

På Göteborgs universitet använder vi kakor (cookies) för att webbplatsen ska fungera på ett bra sätt för dig. Genom att surfa vidare godkänner du att vi använder kakor.  Vad är kakor?