Iterative multi-task sequence labeling for predicting structural properties of proteins

Maes, Francis; Becker, Julien; Wehenkel, Louis

Download

Paper published in a book (Scientific congresses and symposiums)

Iterative multi-task sequence labeling for predicting structural properties of proteins

Maes, Francis; Becker, Julien; Wehenkel, Louis

2011 • In ESANN 2011

Peer reviewed

Permalink
https://hdl.handle.net/2268/129254

Files (1)Send to Details Statistics Bibliography Similar publications

Files

Full Text

es2011-52.pdf

Publisher postprint (571.28 kB)

Download

All documents in ORBi are protected by a user license.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Keywords :

Bioinformatics; Multitask learning; Structured prediction

Abstract :

[en] Developing computational tools for predicting protein structural information given their amino acid sequence is of primary importance in protein science. Problems, such as the prediction of secondary structures, of solvent accessibility, or of disordered regions, can be expressed as sequence labeling problems and could be solved independently by existing machine learning based sequence labeling approaches. But, since these problems are closely related, we propose to rather approach them jointly in a multi-task approach. To this end, we introduce a new generic framework for iterative multi-task sequence labeling. We apply this - conceptually simple but quite effective - strategy to jointly solve a set of five protein annotation tasks. Our empirical results with two protein datasets show that the proposed strategy significantly outperforms the single-task approaches.

Research center :

GIGA-Bioinformatics

Disciplines :

Computer science

Author, co-author :

Maes, Francis ; Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation

Becker, Julien ; Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation

Wehenkel, Louis ; Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation

Language :

English

Title :

Iterative multi-task sequence labeling for predicting structural properties of proteins

Publication date :

2011

Event name :

19th European Symposium on Artificial Neural Networks

Event organizer :

Université Catholique de Louvain

Event place :

Bruges, Belgium

Event date :

from 27-04-2011 to 29-04-2011

Audience :

International

Main work title :

ESANN 2011

ISBN/EAN :

978-2-87419-044-5

Peer reviewed :

Peer reviewed

Funders :

FRIA - Fonds pour la Formation à la Recherche dans l'Industrie et dans l'Agriculture [BE]

Available on ORBi :

since 18 August 2012

Statistics

Number of views

141 (2 by ULiège)

Number of downloads

50 (4 by ULiège)

More statistics

Scopus citations^®

Scopus citations^®
without self-citations

Bibliography

R Adamczak, A Porollo, and J Meller. Combining prediction of secondary structure and solvent accessibility in proteins. Proteins, 2005.
J Lafferty, A McCallum, and F Pereira. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In International Conference on Machine Learning, 2001.
I Tsochantaridis, T Hofmann, T Joachims, and Y Altun. Support vector machine learning for interdependent and structured output spaces. In International Conference on Machine Learning, 2004.
WW Cohen and V R Carvalho. Stacked sequential learning. In International Joint Conferences on Artificial Intelligence, 2005.
F Maes, S Peters, L Denoyer, and P Gallinari. Simulated iterative classification: A new learning procedure for graph labeling. In European Conference on Machine Learning, 2009.
D Jones. Protein secondary structure prediction based on position-specific scoring matrices. Journal of molecular biology, 1999.
AC Camproux, R Gautier, and P Tuffery. A hidden markov model derived structural alphabet for proteins. Journal of molecular biology, 2004.
O Noivirt-Brik, J Prilusky, and J L Sussman. Assessment of disorder predictions in casp8. Proteins, 2009.
H Zhang, T Zhang, K Chen, S Shen, J Ruan, and L Kurgan. Sequence based residue depth prediction using evolutionary information and predicted secondary structure. BMC bioinformatics, 2008.