MindTheGap: integrated detection and assembly of short and long insertions - Université de Rennes Accéder directement au contenu
Article Dans Une Revue Bioinformatics Année : 2014

MindTheGap: integrated detection and assembly of short and long insertions

Résumé

Motivation: Insertions play an important role in genome evolution. However, such variants are difficult to detect from short read sequencing data, especially when they exceed the paired-end insert size. Many approaches have been proposed to call short insertion variants based on paired-end mapping. However, there remains a lack of practical methods to detect and assemble long variants. Results: We propose here an original method, called MINDTHEGAP, for the integrated detection and assembly of insertion variants from re-sequencing data. Importantly, it is designed to call insertions of any size, whether they are novel or duplicated, homozygous or heterozygous in the donor genome. MINDTHEGAP uses an efficient k-mer based method to detect insertion sites in a reference genome, and subsequently assemble them from the donor reads. MINDTHEGAP showed high recall and precision on simulated datasets of various genome complexities. When applied to real C. elegans and human NA12878 datasets, MINDTHEGAP detected and correctly assembled insertions longer than 1 kb, using at most 14 GB of memory.Availability: http://mindthegap.genouest.org
Fichier principal
Vignette du fichier
mindTheGap_preprint.pdf (300.67 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01081089 , version 1 (06-11-2014)

Licence

Paternité - Pas d'utilisation commerciale

Identifiants

Citer

Guillaume Rizk, Anaïs Gouin, Rayan Chikhi, Claire Lemaitre. MindTheGap: integrated detection and assembly of short and long insertions. Bioinformatics, 2014, 30 (24), pp.3451 - 3457. ⟨10.1093/bioinformatics/btu545⟩. ⟨hal-01081089⟩
380 Consultations
151 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More