Poster (Scientific congresses and symposiums)
ToRQuEMaDA: Tool for retrieving queried eubacteria, metadata and dereplicating assemblie
Léonard, Raphaël; Sirjacobs, Damien; Sauvage, Eric et al.
2017PhD Student Day 2017
 

Files


Full Text
raphael_leonard_studentday.pdf
Publisher postprint (897.3 kB)
Download

All documents in ORBi are protected by a user license.

Send to



Details



Keywords :
dereplication; genomic signature
Abstract :
[en] The fast-growing number of available prokaryotic genomes, along with their uneven taxonomic distribution, is a problem when trying to assemble broadly sampled genome sets for phylogenomics and comparative genomics. Indeed, most of the new genomes belong to the same subset of hyper-sampled phyla, such as Proteobacteria and Firmicutes, or even to single species, such as Escherichia coli (>3000 genomes as of March 2017), while the continuous flow of newly discovered phyla prompts for regular updates of in-house databases. This situation makes it difficult to maintain sets of representative genomes combining lesser known phyla, for which only few species are available, and sound subsets of highly abundant phyla. An automated method is required but none are publicly available. In this work, the kmer composition of DNA sequences, in conjunction with quality metrics for publicly available assemblies, was used to develop an automated approach for selecting a high-quality subset of representative genomes without redundancy by using our hybrid divide-and-conquer / greedy clustering method.
Disciplines :
Biochemistry, biophysics & molecular biology
Author, co-author :
Léonard, Raphaël  ;  Université de Liège > Département des sciences de la vie > Cristallographie des macromolécules biologiques
Sirjacobs, Damien ;  Université de Liège > Département des sciences de la vie > Phylogénomique des eucaryotes
Sauvage, Eric ;  Université de Liège > Département des sciences de la vie > Centre d'ingénierie des protéines
Kerff, Frédéric  ;  Université de Liège > Département des sciences de la vie > Centre d'ingénierie des protéines
Baurain, Denis  ;  Université de Liège > Département des sciences de la vie > Phylogénomique des eucaryotes
Language :
English
Title :
ToRQuEMaDA: Tool for retrieving queried eubacteria, metadata and dereplicating assemblie
Publication date :
11 May 2017
Number of pages :
A0
Event name :
PhD Student Day 2017
Event organizer :
ULg - Université de Liège
Event place :
Liège, Belgium
Event date :
11 mai 2017
Funders :
FRIA - Fonds pour la Formation à la Recherche dans l'Industrie et dans l'Agriculture [BE]
Available on ORBi :
since 12 June 2017

Statistics


Number of views
80 (17 by ULiège)
Number of downloads
26 (6 by ULiège)

Bibliography


Similar publications



Contact ORBi