Paper published in a book (Scientific congresses and symposiums)
Segment and combine approach for Biological Sequence Classification
Geurts, Pierre; Blanco Cuesta, Antia; Wehenkel, Louis
2005In Proc. IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB 2005)
Peer reviewed
 

Files


Full Text
geurts-cibcb2005.pdf
Publisher postprint (104.92 kB)
Download

All documents in ORBi are protected by a user license.

Send to



Details



Keywords :
bioinformatics; machine learning
Abstract :
[en] This paper presents a new algorithm based on the segment and combine paradigm, for automatic classification of biological sequences. It classifies sequences by aggregating the information about their subsequences predicted by a classifier derived by machine learning from a random sample of training subsequences. This generic approach is combined with decision tree based ensemble methods, scalable both with respect to sample size and vocabulary size. The method is applied to three families of problems: DNA sequence recognition, splice junction detection, and gene regulon prediction. With respect to standard approaches based on n-grams, it appears competitive in terms of accuracy, flexibility, and scalability. The paper also highlights the possibility to exploit the resulting models to identify interpretable patterns specific of a given class of biological sequences.
Disciplines :
Computer science
Author, co-author :
Geurts, Pierre ;  Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation
Blanco Cuesta, Antia;  Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst. Montefiore) > Systèmes et modélisation
Wehenkel, Louis  ;  Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation
Language :
English
Title :
Segment and combine approach for Biological Sequence Classification
Publication date :
2005
Event name :
IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB 2005)
Event place :
San Diego, United States
Event date :
14-15 Nov. 2005
Audience :
International
Main work title :
Proc. IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB 2005)
Pages :
194-201
Peer reviewed :
Peer reviewed
Available on ORBi :
since 16 October 2009

Statistics


Number of views
86 (3 by ULiège)
Number of downloads
151 (3 by ULiège)

Scopus citations®
 
5
Scopus citations®
without self-citations
3
OpenCitations
 
5

Bibliography


Similar publications



Contact ORBi