Data di Pubblicazione:
2017
Citazione:
Evaluation of quality assessment protocols for high throughput genome resequencing data / M. Chiara, G. Pavesi. - In: FRONTIERS IN GENETICS. - ISSN 1664-8021. - 8:(2017 Jul), pp. 94.1-94.12. [10.3389/fgene.2017.00094]
Abstract:
Large-scale initiatives aiming to recover the complete sequence of thousands of human genomes are currently being undertaken worldwide, concurring to the generation of a comprehensive catalog of human genetic variation. The ultimate and most ambitious goal of human population scale genomics is the characterization of the so-called human "variome," through the identification of causal mutations or haplotypes. Several research institutions worldwide currently use genotyping assays based on Next-Generation Sequencing (NGS) for diagnostics and clinical screenings, and the widespread application of such technologies promises major revolutions in medical science. Bioinformatic analysis of human resequencing data is one of the main factors limiting the effectiveness and general applicability of NGS for clinical studies. The requirement for multiple tools, to be combined in dedicated protocols in order to accommodate different types of data (gene panels, exomes, or whole genomes) and the high variability of the data makes difficult the establishment of a ultimate strategy of general use. While there already exist several studies comparing sensitivity and accuracy of bioinformatic pipelines for the identification of single nucleotide variants from resequencing data, little is known about the impact of quality assessment and reads pre-processing strategies. In this work we discuss major strengths and limitations of the various genome resequencing protocols are currently used in molecular diagnostics and for the discovery of novel disease-causing mutations. By taking advantage of publicly available data we devise and suggest a series of best practices for the pre-processing of the data that consistently improve the outcome of genotyping with minimal impacts on computational costs.
Tipologia IRIS:
01 - Articolo su periodico
Keywords:
Genome resequencing; Molecular diagnostics; Next-generation sequencing read quality; Precision medicine; Whole exome sequencing; Molecular Medicine; Genetics; Genetics (clinical)
Elenco autori:
M. Chiara, G. Pavesi
Link alla scheda completa:
Link al Full Text: