References of "Geurts, Pierre"
     in
Bookmark and Share    
Full Text
Peer Reviewed
See detailRandom forests with random projections of the output space for high dimensional multi-label classification
Joly, Arnaud ULg; Geurts, Pierre ULg; Wehenkel, Louis ULg

in Machine Learning and Knowledge Discovery in Databases (2014, September 15)

We adapt the idea of random projections applied to the out- put space, so as to enhance tree-based ensemble methods in the context of multi-label classification. We show how learning time complexity can ... [more ▼]

We adapt the idea of random projections applied to the out- put space, so as to enhance tree-based ensemble methods in the context of multi-label classification. We show how learning time complexity can be reduced without affecting computational complexity and accuracy of predictions. We also show that random output space projections may be used in order to reach different bias-variance tradeoffs, over a broad panel of benchmark problems, and that this may lead to improved accuracy while reducing significantly the computational burden of the learning stage. [less ▲]

Detailed reference viewed: 47 (21 ULg)
Full Text
Peer Reviewed
See detailSimple connectome inference from partial correlation statistics in calcium imaging
Sutera, Antonio ULg; Joly, Arnaud ULg; François-Lavet, Vincent ULg et al

in Proceedings of Connectomics 2014 (ECML 2014) (2014, June)

In this work, we propose a simple yet effective solution to the problem of connectome inference in calcium imaging data. The proposed algorithm consists of two steps. First, processing the raw signals to ... [more ▼]

In this work, we propose a simple yet effective solution to the problem of connectome inference in calcium imaging data. The proposed algorithm consists of two steps. First, processing the raw signals to detect neural peak activities. Second, inferring the degree of association between neurons from partial correlation statistics. This paper summarises the methodology that led us to win the Connectomics Challenge, proposes a simplified version of our method, and finally compares our results with respect to other inference methods. [less ▲]

Detailed reference viewed: 350 (108 ULg)
Full Text
See detailClassifying pairs with trees for supervised biological network inference
Schrynemackers, Marie ULg; Wehenkel, Louis ULg; Madan Babu, Mohan et al

E-print/Working paper (2014)

Detailed reference viewed: 20 (3 ULg)
Full Text
Peer Reviewed
See detailExploiting SNP Correlations within Random Forest for Genome-Wide Association Studies
Botta, Vincent ULg; Louppe, Gilles ULg; Geurts, Pierre ULg et al

in PLoS ONE (2014)

The primary goal of genome-wide association studies (GWAS) is to discover variants that could lead, in isolation or in combination, to a particular trait or disease. Standard approaches to GWAS, however ... [more ▼]

The primary goal of genome-wide association studies (GWAS) is to discover variants that could lead, in isolation or in combination, to a particular trait or disease. Standard approaches to GWAS, however, are usually based on univariate hypothesis tests and therefore can account neither for correlations due to linkage disequilibrium nor for combinations of several markers. To discover and leverage such potential multivariate interactions, we propose in this work an extension of the Random Forest algorithm tailored for structured GWAS data. In terms of risk prediction, we show empirically on several GWAS datasets that the proposed T-Trees method significantly outperforms both the original Random Forest algorithm and standard linear models, thereby suggesting the actual existence of multivariate non-linear effects due to the combinations of several SNPs. We also demonstrate that variable importances as derived from our method can help identify relevant loci. Finally, we highlight the strong impact that quality control procedures may have, both in terms of predictive power and loci identification. [less ▲]

Detailed reference viewed: 40 (8 ULg)
Full Text
Peer Reviewed
See detailData normalization and supervised learning to assess the condition of patients with multiple sclerosis based on gait analysis
Azrour, Samir ULg; Pierard, Sébastien ULg; Geurts, Pierre ULg et al

in European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN) (2014, April)

Gait impairment is considered as an important feature of disability in multiple sclerosis but its evaluation in the clinical routine remains limited. In this paper, we assess, by means of supervised ... [more ▼]

Gait impairment is considered as an important feature of disability in multiple sclerosis but its evaluation in the clinical routine remains limited. In this paper, we assess, by means of supervised learning, the condition of patients with multiple sclerosis based on their gait descriptors obtained with a gait analysis system. As the morphological characteristics of individuals influence their gait while being in first approximation independent of the disease level, an original strategy of data normalization with respect to these characteristics is described and applied beforehand in order to obtain more reliable predictions. In addition, we explain how we address the problem of missing data which is a common issue in the field of clinical evaluation. Results show that, based on machine learning combined to the proposed data handling techniques, we can predict a score highly correlated with the condition of patients. [less ▲]

Detailed reference viewed: 103 (40 ULg)
Full Text
Peer Reviewed
See detailNIMEFI: Gene Regulatory Network Inference using Multiple Ensemble Feature Importance Algorithms
Ruyssinck, Joeri; Huynh-Thu, Vân Anh ULg; Geurts, Pierre ULg et al

in PLoS ONE (2014)

One of the long-standing open challenges in computational systems biology is the topology inference of gene regulatory networks from high-throughput omics data. Recently, two community-wide efforts ... [more ▼]

One of the long-standing open challenges in computational systems biology is the topology inference of gene regulatory networks from high-throughput omics data. Recently, two community-wide efforts, DREAM4 and DREAM5, have been established to benchmark network inference techniques using gene expression measurements. In these challenges the overall top performer was the GENIE3 algorithm. This method decomposes the network inference task into separate regression problems for each gene in the network in which the expression values of a particular target gene are predicted using all other genes as possible predictors. Next, using tree-based ensemble methods, an importance measure for each predictor gene is calculated with respect to the target gene and a high feature importance is considered as putative evidence of a regulatory link existing between both genes. The contribution of this work is twofold. First, we generalize the regression decomposition strategy of GENIE3 to other feature importance methods. We compare the performance of support vector regression, the elastic net, random forest regression, symbolic regression and their ensemble variants in this setting to the original GENIE3 algorithm. To create the ensemble variants, we propose a subsampling approach which allows us to cast any feature selection algorithm that produces a feature ranking into an ensemble feature importance algorithm. We demonstrate that the ensemble setting is key to the network inference task, as only ensemble variants achieve top performance. As second contribution, we explore the effect of using rankwise averaged predictions of multiple ensemble algorithms as opposed to only one. We name this approach NIMEFI (Network Inference using Multiple Ensemble Feature Importance algorithms) and show that this approach outperforms all individual methods in general, although on a specific network a single method can perform better. An implementation of NIMEFI has been made publicly available. [less ▲]

Detailed reference viewed: 13 (3 ULg)
Full Text
Peer Reviewed
See detailRating Network Paths for Locality-Aware Overlay Construction and Routing
Du, Wei; Liao, Yongjun ULg; Tao, Narisu et al

in IEEE/ACM Transactions on Networking (2014)

This paper investigates the rating of network paths, i.e. acquiring quantized measures of path properties such as round-trip time and available bandwidth. Comparing to finegrained measurements, coarse ... [more ▼]

This paper investigates the rating of network paths, i.e. acquiring quantized measures of path properties such as round-trip time and available bandwidth. Comparing to finegrained measurements, coarse-grained ratings are appealing in that they are not only informative but also cheap to obtain. Motivated by this insight, we firstly address the scalable acquisition of path ratings by statistical inference. By observing similarities to recommender systems, we examine the applicability of solutions to recommender system and show that our inference problem can be solved by a class of matrix factorization techniques. A technical contribution is an active and progressive inference framework that not only improves the accuracy by selectively measuring more informative paths but also speeds up the convergence for available bandwidth by incorporating its measurement methodology. Then, we investigate the usability of rating-based network measurement and inference in applications. A case study is performed on whether locality awareness can be achieved for overlay networks of Pastry and BitTorrent using inferred ratings. We show that such coarse-grained knowledge can improve the performance of peer selection and that finer granularities do not always lead to larger improvements. [less ▲]

Detailed reference viewed: 34 (7 ULg)
Full Text
Peer Reviewed
See detailBridging physiological and evolutionary time-scales in a gene regulatory network.
Marchand, Gwenaelle; Huynh-Thu, Vân Anh ULg; Kane, Nolan C. et al

in The New phytologist (2014)

Gene regulatory networks (GRNs) govern phenotypic adaptations and reflect the trade-offs between physiological responses and evolutionary adaptation that act at different time-scales. To identify patterns ... [more ▼]

Gene regulatory networks (GRNs) govern phenotypic adaptations and reflect the trade-offs between physiological responses and evolutionary adaptation that act at different time-scales. To identify patterns of molecular function and genetic diversity in GRNs, we studied the drought response of the common sunflower, Helianthus annuus, and how the underlying GRN is related to its evolution. We examined the responses of 32 423 expressed sequences to drought and to abscisic acid (ABA) and selected 145 co-expressed transcripts. We characterized their regulatory relationships in nine kinetic studies based on different hormones. From this, we inferred a GRN by meta-analyses of a Gaussian graphical model and a random forest algorithm and studied the genetic differentiation among populations (FST ) at nodes. We identified two main hubs in the network that transport nitrate in guard cells. This suggests that nitrate transport is a critical aspect of the sunflower physiological response to drought. We observed that differentiation of the network genes in elite sunflower cultivars is correlated with their position and connectivity. This systems biology approach combined molecular data at different time-scales and identified important physiological processes. At the evolutionary level, we propose that network topology could influence responses to human selection and possibly adaptation to dry environments. [less ▲]

Detailed reference viewed: 10 (2 ULg)
Full Text
Peer Reviewed
See detailIdentification of a microRNA landscape targeting the PI3K/Akt signaling pathway in inflammation-induced colorectal carcinogenesis
JOSSE, Claire ULg; Bouznad, Nassim ULg; Geurts, Pierre ULg et al

in American Journal of Physiology - Gastrointestinal and Liver Physiology (2014), 306

Inflammation can contribute to tumor formation; however, markers that predict progression are still lacking. In the present study, the well-established azoxymethane (AOM)/dextran sulfate sodium (DSS ... [more ▼]

Inflammation can contribute to tumor formation; however, markers that predict progression are still lacking. In the present study, the well-established azoxymethane (AOM)/dextran sulfate sodium (DSS)-induced mouse model of colitis-associated cancer was used to analyze microRNA (miRNA) modulation accompanying inflammation-induced tumor development and to determine whether inflammation-triggered miRNA alterations affect the expression of genes or pathways involved in cancer. A miRNA microarray experiment was performed to establish miRNA expression profiles in mouse colon at early and late time points during inflammation and/or tumor growth. Chronic inflammation and carcinogenesis were associated with distinct changes in miRNA expression. Nevertheless, prediction algorithms of miRNA-mRNA interactions and computational analyses based on ranked miRNA lists consistently identified putative target genes that play essential roles in tumor growth or that belong to key carcinogenesis-related signaling pathways. We identified PI3K/Akt and the insulin growth factor-1 (IGF-1) as major pathways being affected in the AOM/DSS model. DSS-induced chronic inflammation downregulates miR-133a and miR-143/145, which is reportedly associated with human colorectal cancer and PI3K/Akt activation. Accordingly, conditioned medium from inflammatory cells decreases the expression of these miRNA in colorectal adenocarcinoma Caco-2 cells. Overexpression of miR-223, one of the main miRNA showing strong upregulation during AOM/DSS tumor growth, inhibited Akt phosphorylation and IGF-1R expression in these cells. Cell sorting from mouse colons delineated distinct miRNA expression patterns in epithelial and myeloid cells during the periods preceding and spanning tumor growth. Hence, cell-type-specific miRNA dysregulation and subsequent PI3K/Akt activation may be involved in the transition from intestinal inflammation to cancer. [less ▲]

Detailed reference viewed: 32 (6 ULg)
Full Text
Peer Reviewed
See detailOn protocols and measures for the validation of supervised methods for the inference of biological networks
Schrynemackers, Marie ULg; Kuffner, Robert; Geurts, Pierre ULg

in Frontiers in genetics (2013), 4(262),

Networks provide a natural representation of molecular biology knowledge, in particular to model relationships between biological entities such as genes, proteins, drugs, or diseases. Because of the ... [more ▼]

Networks provide a natural representation of molecular biology knowledge, in particular to model relationships between biological entities such as genes, proteins, drugs, or diseases. Because of the effort, the cost, or the lack of the experiments necessary for the elucidation of these networks, computational approaches for network inference have been frequently investigated in the literature. In this paper, we examine the assessment of supervised network inference. Supervised inference is based on machine learning techniques that infer the network from a training sample of known interacting and possibly non-interacting entities and additional measurement data. While these methods are very effective, their reliable validation in silico poses a challenge, since both prediction and validation need to be performed on the basis of the same partially known network. Cross-validation techniques need to be specifically adapted to classification problems on pairs of objects. We perform a critical review and assessment of protocols and measures proposed in the literature and derive specific guidelines how to best exploit and evaluate machine learning techniques for network inference. Through theoretical considerations and in silico experiments, we analyze in depth how important factors influence the outcome of performance estimation. These factors include the amount of information available for the interacting entities, the sparsity and topology of biological networks, and the lack of experimentally verified non-interacting pairs. [less ▲]

Detailed reference viewed: 41 (14 ULg)
Full Text
Peer Reviewed
See detailUnderstanding variable importances in forests of randomized trees
Louppe, Gilles ULg; Wehenkel, Louis ULg; Sutera, Antonio ULg et al

in Advances in Neural Information Processing Systems 26 (2013, December)

Despite growing interest and practical use in various scientific areas, variable importances derived from tree-based ensemble methods are not well understood from a theoretical point of view. In this work ... [more ▼]

Despite growing interest and practical use in various scientific areas, variable importances derived from tree-based ensemble methods are not well understood from a theoretical point of view. In this work we characterize the Mean Decrease Impurity (MDI) variable importances as measured by an ensemble of totally randomized trees in asymptotic sample and ensemble size conditions. We derive a three-level decomposition of the information jointly provided by all input variables about the output in terms of i) the MDI importance of each input variable, ii) the degree of interaction of a given input variable with the other input variables, iii) the different interaction terms of a given degree. We then show that this MDI importance of a variable is equal to zero if and only if the variable is irrelevant and that the MDI importance of a relevant variable is invariant with respect to the removal or the addition of irrelevant variables. We illustrate these properties on a simple example and discuss how they may change in the case of non-totally randomized trees such as Random Forests and Extra-Trees. [less ▲]

Detailed reference viewed: 1077 (136 ULg)
Full Text
Peer Reviewed
See detailDMFSGD: A Decentralized Matrix Factorization Algorithm for Network Distance Prediction
Liao, Yongjun ULg; Du, Wei; Geurts, Pierre ULg et al

in IEEE/ACM Transactions on Networking (2013), 21(5), 1511-1524

The knowledge of end-to-end network distances is essential to many Internet applications. As active probing of all pairwise distances is infeasible in large-scale networks, a natural idea is to measure a ... [more ▼]

The knowledge of end-to-end network distances is essential to many Internet applications. As active probing of all pairwise distances is infeasible in large-scale networks, a natural idea is to measure a few pairs and to predict the other ones without actually measuring them. This paper formulates the prediction problem as matrix completion where the unknown entries in a pairwise distance matrix constructed from a network are to be predicted. By assuming that the distance matrix has a low-rank characteristics, the problem is solvable by lowrank approximation based on matrix factorization. The new formulation circumvents the well-known drawbacks of existing approaches based on Euclidean embedding. A new algorithm, so-called Decentralized Matrix Factorization by Stochastic Gradient Descent (DMFSGD), is proposed. By letting network nodes exchange messages with each other, the algorithm is fully decentralized and only requires each node to collect and to process local measurements, with neither explicit matrix constructions nor special nodes such as landmarks and central servers. In addition, we compared comprehensively matrix factorization and Euclidean embedding to demonstrate the suitability of the former on network distance prediction. We further studied the incorporation of a robust loss function and of non-negativity constraints. Extensive experiments on various publicly-available datasets of network delays show not only the scalability and the accuracy of our approach, but also its usability in real Internet applications. [less ▲]

Detailed reference viewed: 147 (28 ULg)
Full Text
Peer Reviewed
See detailAutomated Processing of Zebrafish Imaging Data: A Survey
Mikut, Ralf; Dickmeis, Thomas; Driever, Wolfgang et al

in Zebrafish (2013), 10(3), 401-421

Due to the relative transparency of its embryos and larvae, the zebrafish is an ideal model organism for bioimaging approaches in vertebrates. Novel microscope technologies allow the imaging of ... [more ▼]

Due to the relative transparency of its embryos and larvae, the zebrafish is an ideal model organism for bioimaging approaches in vertebrates. Novel microscope technologies allow the imaging of developmental processes in unprecedented detail, and they enable the use of complex image-based read-outs for high-throughput/high-content screening. Such applications can easily generate Terabytes of image data, the handling and analysis of which becomes a major bottleneck in extracting the targeted information. Here, we describe the current state of the art in computational image analysis in the zebrafish system. We discuss the challenges encountered when handling high-content image data, especially with regard to data quality, annotation, and storage. We survey methods for preprocessing image data for further analysis, and describe selected examples of automated image analysis, including the tracking of cells during embryogenesis, heartbeat detection, identification of dead embryos, recognition of tissues and anatomical landmarks, and quantification of behavioral patterns of adult fish. We review recent examples for applications using such methods, such as the comprehensive analysis of cell lineages during early development, the generation of a three-dimensional brain atlas of zebrafish larvae, and high-throughput drug screens based on movement patterns. Finally, we identify future challenges for the zebrafish image analysis community, notably those concerning the compatibility of algorithms and data formats for the assembly of modular analysis pipelines. [less ▲]

Detailed reference viewed: 66 (9 ULg)
Full Text
Peer Reviewed
See detailExtremely Randomized Trees and Random Subwindows for Image Classification, Annotation, and Retrieval
Marée, Raphaël ULg; Wehenkel, Louis ULg; Geurts, Pierre ULg

in Criminisi, A; Shotton, J (Eds.) Decision Forests in Computer Vision and Medical Image Analysis, Advances in Computer Vision and Pattern Recognition (2013)

We present a unified framework involving the extraction of random subwindows within images and the induction of ensembles of extremely randomized trees. We discuss the specialization of this framework for ... [more ▼]

We present a unified framework involving the extraction of random subwindows within images and the induction of ensembles of extremely randomized trees. We discuss the specialization of this framework for solving several general problems in computer vision, ranging from image classification and segmentation to content-based image retrieval and interest point detection. The methods are illustrated on various applications and datasets from the biomedical domain [less ▲]

Detailed reference viewed: 240 (46 ULg)
Full Text
Peer Reviewed
See detailLong-incubation time-interferon-gamma release assays in response to PPD-, ESAT-6- and/or CFP-10 for the diagnosis of Mycobacterium tuberculosis infection in children
Schepers, Kinda; Mouchet, Françoise; Dirix, Violette et al

in Clinical and Vaccine Immunology (2013)

Background: Diagnosis of childhood active tuberculosis (aTB) or latent Mycobacterium tuberculosis (Mtb) infection (LTBI) remains a challenge, and replacement of tuberculin skin tests (TST) by ... [more ▼]

Background: Diagnosis of childhood active tuberculosis (aTB) or latent Mycobacterium tuberculosis (Mtb) infection (LTBI) remains a challenge, and replacement of tuberculin skin tests (TST) by commercialized interferon-gamma release assays (IGRA) is not currently recommended. Methods: 266 children between 1 month and 15 years of age, 214 being at risk of recent Mtb infection and 51 being included as controls, were prospectively enrolled. According results of clinical evaluation, TST, chest X-Ray and microbiology, children were classified as non-infected, LTBI or aTB. Long-incubation time PPD-, ESAT-6-, and CFP-10-IGRA were performed and evaluated for their accuracy to correctly classify the children. Results: Whereas both TST and PPD-IGRA were suboptimal to detect aTB, combining CFP-10-IGRA with TST or with PPD-IGRA allowed us to detect all the children with aTB, with 96% specificity for children who were positive for CFP-10-IGRA. Moreover, combination of CFP-10- and PPD-IGRA also detected 96% of children classified as LTBI, but a strong IFN-γ response to CFP-10 (>500 pg/ml) was highly suggestive of aTB at least among children less than 3 years old. Conclusions: Long-incubation time CFP-10- and PPD-IGRA should help the clinicians to identify quickly aTB or LTBI in young children. [less ▲]

Detailed reference viewed: 3 (0 ULg)
Full Text
Peer Reviewed
See detailGene regulatory network inference from systems genetics data using tree-based methods
Huynh-Thu, Vân Anh ULg; Wehenkel, Louis ULg; Geurts, Pierre ULg

in de la Fuente, Alberto (Ed.) Gene Network Inference - Verification of Methods for Systems Genetics Data (2013)

One of the pressing open problems of computational systems biology is the elucidation of the topology of gene regulatory networks (GRNs). In an attempt to solve this problem, the idea of systems genetics ... [more ▼]

One of the pressing open problems of computational systems biology is the elucidation of the topology of gene regulatory networks (GRNs). In an attempt to solve this problem, the idea of systems genetics is to exploit the natural variations that exist between the DNA sequences of related individuals and that can represent the randomized and multifactorial perturbations necessary to recover GRNs. In this chapter, we present new methods, called GENIE3-SG-joint and GENIE3- SG-sep, for the inference of GRNs from systems genetics data. Experiments on the artificial data of the StatSeq benchmark and of the DREAM5 Systems Genetics challenge show that exploiting jointly expression and genetic data is very helpful for recovering GRNs, and one of our methods outperforms by a large extent the official best performing method of the DREAM5 challenge. [less ▲]

Detailed reference viewed: 86 (19 ULg)
Full Text
See detailOrdinal Rating of Network Performance and Inference by Matrix Completion
Du, Wei; Liao, Yongjun ULg; Geurts, Pierre ULg et al

Report (2012)

This paper addresses the large-scale acquisition of end-to-end network performance. We made two distinct contributions: ordinal rating of network performance and inference by matrix completion. The former ... [more ▼]

This paper addresses the large-scale acquisition of end-to-end network performance. We made two distinct contributions: ordinal rating of network performance and inference by matrix completion. The former reduces measurement costs and unifies various metrics which eases their processing in applications. The latter enables scalable and accurate inference with no requirement of structural information of the network nor geometric constraints. By combining both, the acquisition problem bears strong similarities to recommender systems. This paper investigates the applicability of various matrix factorization models used in recommender systems. We found that the simple regularized matrix factorization is not only practical but also produces accurate results that are beneficial for peer selection. [less ▲]

Detailed reference viewed: 24 (4 ULg)
Full Text
Peer Reviewed
See detailEmbedding Monte Carlo search of features in tree-based ensemble methods
Maes, Francis ULg; Geurts, Pierre ULg; Wehenkel, Louis ULg

in Flach, Peter; De Bie, Tijl; Cristianini, Nello (Eds.) Machine Learning and Knowledge Discovery in Data Bases (2012, September)

Feature generation is the problem of automatically constructing good features for a given target learning problem. While most feature generation algorithms belong either to the filter or to the wrapper ... [more ▼]

Feature generation is the problem of automatically constructing good features for a given target learning problem. While most feature generation algorithms belong either to the filter or to the wrapper approach, this paper focuses on embedded feature generation. We propose a general scheme to embed feature generation in a wide range of tree-based learning algorithms, including single decision trees, random forests and tree boosting. It is based on the formalization of feature construction as a sequential decision making problem addressed by a tractable Monte Carlo search algorithm coupled with node splitting. This leads to fast algorithms that are applicable to large-scale problems. We empirically analyze the performances of these tree-based learners combined or not with the feature generation capability on several standard datasets. [less ▲]

Detailed reference viewed: 46 (7 ULg)
Full Text
Peer Reviewed
See detailMixtures of Bagged Markov Tree Ensembles
Schnitzler, François ULg; Geurts, Pierre ULg; Wehenkel, Louis ULg

in Cano Utrera, Andrès; Gómez-Olmedo, Manuel; Nielsen, Thomas (Eds.) Proceedings of the 6th European Workshop on Probabilistic Graphical Models (2012, September)

Markov trees, a probabilistic graphical model for density estimation, can be expanded in the form of a weighted average of Markov Trees. Learning these mixtures or ensembles from observations can be ... [more ▼]

Markov trees, a probabilistic graphical model for density estimation, can be expanded in the form of a weighted average of Markov Trees. Learning these mixtures or ensembles from observations can be performed to reduce the bias or the variance of the estimated model. We propose a new combination of both, where the upper level seeks to reduce bias while the lower level seeks to reduce variance. This algorithm is evaluated empirically on datasets generated from a mixture of Markov trees and from other synthetic densities. [less ▲]

Detailed reference viewed: 70 (5 ULg)
Full Text
Peer Reviewed
See detailComparator selection for RPC with many labels
Hiard, Samuel ULg; Geurts, Pierre ULg; Wehenkel, Louis ULg

in ECAI 2012 : 20th European Conference on Artificial Intelligence : 27-31 August 2012, Montpellier, France (2012, August)

The Ranking by Pairwise Comparison algorithm (RPC) is a well established label ranking method. However, its complexity is of O(N²) in the number N of labels. We present algorithms for selection, before ... [more ▼]

The Ranking by Pairwise Comparison algorithm (RPC) is a well established label ranking method. However, its complexity is of O(N²) in the number N of labels. We present algorithms for selection, before model construction, a subset of comparators of size O(N), to reduce the computational complexity without loss in accuracy. [less ▲]

Detailed reference viewed: 70 (20 ULg)