References of "Mahachie John, Jestinah"
     in
Bookmark and Share    
Full Text
Peer Reviewed
See detailTherapeutic Strategy and Patient Outcome during the First 2 Years of Pediatric Crohn’s Disease
Veereman, G; Mahachie John, Jestinah ULg; De Greef, E et al

in Journal of Pediatric Gastroenterology and Nutrition (2013, May)

Detailed reference viewed: 9 (0 ULg)
Full Text
Peer Reviewed
See detailAn efficient algorithm to perform multiple testing in epistasis screening
Van Lishout, François ULg; Mahachie John, Jestinah ULg; Gusareva, Elena ULg et al

in BMC Bioinformatics (2013), 14

Background: Research in epistasis or gene-gene interaction detection for human complex traits has grown over the last few years. It has been marked by promising methodological developments, improved ... [more ▼]

Background: Research in epistasis or gene-gene interaction detection for human complex traits has grown over the last few years. It has been marked by promising methodological developments, improved translation efforts of statistical epistasis to biological epistasis and attempts to integrate different omics information sources into the epistasis screening to enhance power. The quest for gene-gene interactions poses severe multiple-testing problems. In this context, the maxT algorithm is one technique to control the false-positive rate. However, the memory needed by this algorithm rises linearly with the amount of hypothesis tests. Gene-gene interaction studies will require a memory proportional to the squared number of SNPs. A genome-wide epistasis search would therefore require terabytes of memory. Hence, cache problems are likely to occur, increasing the computation time. In this work we present a new version of maxT, requiring an amount of memory independent from the number of genetic effects to be investigated. This algorithm was implemented in C++ in our epistasis screening software MBMDR-3.0.3. We evaluate the new implementation in terms of memory efficiency and speed using simulated data. The software is illustrated on real-life data for Crohn's disease. Results: In the case of a binary (affected/unaffected) trait, the parallel workflow of MBMDR-3.0.3 analyzes all gene-gene interactions with a dataset of 100,000 SNPs typed on 1000 individuals within 4 days and 9 hours, using 999 permutations of the trait to assess statistical significance, on a cluster composed of 10 blades, containing each four Quad-Core AMD Opteron Processor 2352 2.1 GHz. In the case of a continuous trait, a similar run takes 9 days. Our program found 14 SNP-SNP interactions with a multiple-testing corrected p-value of less than 0.05 on real-life Crohn's disease data. Conclusions: Our software is the first implementation of the MB-MDR methodology able to solve large-scale SNP-SNP interactions problems within a few days, without using much memory, while adequately controlling the type I error rates. A new implementation to reach genome-wide epistasis screening is under construction. In the context of Crohn's disease, MBMDR-3.0.3 could identify epistasis involving regions that are well known in the field and could be explained from a biological point of view. This demonstrates the power of our software to find relevant phenotype-genotype higher-order associations. [less ▲]

Detailed reference viewed: 10 (3 ULg)
Full Text
Peer Reviewed
See detailTherapeutic Strategy and Patient Outcome during the First 2 Years of Pediatric Crohn’s Disease
Veereman, G; Mahachie John, Jestinah ULg; De Greef, E et al

in Acta Gastroenterologica Belgica (2013, February)

Detailed reference viewed: 5 (0 ULg)
Full Text
See detailGenomic Association Screening Methodology for High-Dimensional and Complex Data Structures: Detecting n-Order Interactions
Mahachie John, Jestinah ULg

Doctoral thesis (2012)

We developed a data-mining method, Model-Based Multifactor Dimensionality Reduction (MB-MDR) to detect epistatic interactions for different types of traits. MB-MDR enables the fast identification of gene ... [more ▼]

We developed a data-mining method, Model-Based Multifactor Dimensionality Reduction (MB-MDR) to detect epistatic interactions for different types of traits. MB-MDR enables the fast identification of gene-gene interactions among 1000nds of SNPs, without the need to make restrictive assumptions about the genetic modes of inheritance. This thesis primarily focused on applying Model-Based Multifactor Dimensionality Reduction for quantitative traits, its performance and application to a variety of data problems. We carried out several simulation studies to evaluate quantitative MB-MDR in terms of power and type I error, when data are noisy, non-normal or skewed and when important main effects are present. Firstly, we assessed the performance of MB-MDR in the presence of noisy data. The error sources considered were missing genotypes, genotyping error, phenotypic mixtures and genetic heterogeneity. Results from this study showed that MB-MDR is least affected by presence of small percentages of missing data and genotyping errors but much affected in the presence of phenotypic mixtures and genetic heterogeneity. This is in line with a similar study performed for binary traits. Although both Multifactor Dimensionality Reduction (MDR) and MB-MDR are data reduction techniques with a common basis, their ways of deriving significant interactions are substantially different. Nevertheless, effects on power of introducing error sources were quite similar. Irrespective of the trait under consideration, epistasis screening methodologies such as MB-MDR and MDR mainly suffer from the presence of phenotypic mixtures and genetic heterogeneity. Secondly, we extensively addressed the issue of adjusting for lower-order genetic effects during epistasis screening, using different adjustment strategies for SNPs in the functional SNP-SNP interaction pair, and/or for additional important SNPs. Since, in this thesis, we restrict attention to 2-locus interactions only, adjustment for lower-order effects always (and only) implies adjustment for main genetic effects. Unfortunately most data dimensionality reduction techniques based on MDR do not explicitly require that lower-order effects are included in the ‘model’ when investigating higher-order effects (a prerequisite for most traditional, especially regression-based, methods). However, epistasis results may be hampered by the presence of significant lower-order effects. Results from this study showed hugely increased type I errors when main effects were not taken into account or were not properly accounted for. We observed that additive coding (the most commonly used coding in practice) in main effects adjustment does not remove all of the potential main effects that deviate from additive genetic variance. In addition, also adjusting for main effects prior to MB-MDR (via a regression framework), whatever coding is adopted, does not control type I error in all scenarios. From this study, we concluded that correction for lower-order effects should preferentially be done via codominant coding, to reduce the chance of false positive epistasis findings. The recommended way of performing an MB-MDR epistasis screening is to always adjust the analysis for lower-order effects of the SNPs under investigation, “on-the-fly”. This correction avoids overcorrection for other SNPs, which are not part of the interacting SNP pair under study. Thirdly, we assessed the cumulative effect of trait deviations from normality and homoscedasticity on the overall performance of quantitative MB-MDR to detect 2-locus epistasis signals in the absence of main effects. Although MB-MDR itself is a non-parametric method, in the sense that no assumptions are made regarding genetic modes of inheritance, the data reduction part in MB-MDR relies on association tests. In particular, for quantitative traits, the default MB-MDR way is to use the Student’s t-test (steps 1 and 2 of MB-MDR). Also when correcting for lower-order effects during quantitative MB-MDR analysis, we intrinsically maneuver within a regression framework. Since the Student’s t-statistic is the square root of the ANOVA F-statistic. Hence, along these lines, for MB-MDR to give valid results, ANOVA assumptions have to be met. Therefore, we simulated data from normal and non-normal distributions, with constant and non-constant variances, and performed association tests via the student’s t-test as well as the unequal variance t-test, commonly known as the Welch’s t-test. At first somewhat surprising, the results of this study showed that MB-MDR maintains adequate type I errors, irrespective of data distribution or association test used. On the other hand, MB-MDR give rise to lower power results for non-normal data compared to normal data. With respect to the association tests used within MB-MDR, in most cases, Welch’s t-test led to lower power compared to student’s t-test. To maintain the balance between power and type I error, we concluded that when performing MB-MDR analysis with quantitative traits, one ideally first rank-transforms traits to normality and then applies MB-MDR modeling with Student’s t-test as choice of association test. Clearly, before embarking on using a method in practice, there is a need to extensively check the applicability of the method to the data at hand. This is a common practice in biostatistics, but often a forgotten standard operating procedure in genetic epidemiology, in particular in GWAI studies. In addition to the presentation of extensive simulation studies, we also presented some MB-MDR applications to real-life data problems. These analyses involved MB-MDR analyses on quantitative as well as binary complex disease traits, primarily in the context of asthma/allergy and Crohn’s disease. In two of the presented analyses, MB-MDR confirmed logistic regression and transmission disequilibrium test (TDT) results. Part of the aforementioned methodological developments was initiated on the basis of observations of MB-MDR behavior on real-life data. Both the practical and theoretical components of this thesis confirm our belief in the potential of MB-MDR as a promising and versatile tool for the identification of epistatic effects, irrespective of the design (family-based or unrelated individuals) and irrespective of the targeted disease trait (binary, continuous, censored, categorical, multivariate). A thorough characterization of the different faces of MB-MDR this versatility gives rise to is work in progress. [less ▲]

Detailed reference viewed: 76 (6 ULg)
Peer Reviewed
See detailApplication of mixed polygenic model to control for cryptic/genuine relatedness and population stratification.
Gusareva, Elena ULg; Mahachie John, Jestinah ULg; Isaacs, Aaron et al

Poster (2012, March 12)

In genome-wide association studies (GWAs), population stratification may cause inflated type I errors and overly-optimistic test results, when not properly corrected for. During the past decade, several ... [more ▼]

In genome-wide association studies (GWAs), population stratification may cause inflated type I errors and overly-optimistic test results, when not properly corrected for. During the past decade, several methods have been proposed for association testing in the presence of population stratification. Among these, principal components-based approaches are the most popular. Principal component analysis (PCA) allows data transformation to a new coordinate system such that the projection of the data along the first new coordinate (called the PC1) has the largest variance; the second PC has the second largest variance, and so on. In practice, two components are usually enough to adjust or to control for population stratification. They can easily be included in parametric association models as covariates. Despite the success of this strategy, there are still some caveats which need further attention. Among these are that principal component-based methods generally do not account for cryptic relatedness (kinship) between supposedly unrelated individuals, are not straightforwardly adapted to accommodate family-based designs or mixtures of families and unrelated individuals, and do not always take proper account of the trait under investigation. In this work, we present an easy-to-use alternative that addresses the aforementioned issues. For quantitative traits, we propose to first use the mixed polygenic model (possibly taking into account important non-genetic confounders as covariates), second to derive “polygenic” residuals from this model – hereby removing genomic kinship relationships, and third to consider these residuals as new traits in a classical genome-wide QTL analysis for “unrelated individuals”. The polygenic component of the aforementioned mixed polygenic model describes the contribution from multiple independently segregating genes, all having a small additive effect on the trait under investigation. Via an extensive simulation study, with various settings of population stratification and admixture, we show that this approach not only removes most of the “relatedness” between individuals (cryptic relatedness or known relatedness), but also removes most of the remaining substructures caused by population stratification or admixture. As a proof of concept, we demonstrate the efficiency of this robust method to control for population stratification on real-life genome-scale data from the SNP Health Association Resource (SHARe) Asthma Resource project (SHARP) (dbGaP accession number phs000166.v2.p1). We also provide leads to extend this method to dichotomous traits. [less ▲]

Detailed reference viewed: 32 (14 ULg)
Full Text
Peer Reviewed
See detailSafety And Cost Of Infliximab For The Treatment Of Belgian Pediatric Patients With Crohn’s Disease
De Greef, E; Hoffman, I; D'haens, G et al

in Acta Gastro-Enterologica Belgica (2012, February)

Detailed reference viewed: 10 (3 ULg)
Full Text
Peer Reviewed
See detailLower-Order Effects Adjustment in Quantitative Traits Model-Based Multifactor Dimensionality Reduction
Mahachie John, Jestinah ULg; Cattaert, Tom ULg; Van Lishout, François ULg et al

in PLoS ONE (2012)

Identifying gene-gene interactions or gene-environment interactions in studies of human complex diseases remains a big challenge in genetic epidemiology. An additional challenge, often forgotten, is to ... [more ▼]

Identifying gene-gene interactions or gene-environment interactions in studies of human complex diseases remains a big challenge in genetic epidemiology. An additional challenge, often forgotten, is to account for important lower-order genetic effects. These may hamper the identification of genuine epistasis. If lower-order genetic effects contribute to the genetic variance of a trait, identified statistical interactions may simply be due to a signal boost of these effects. In this study, we restrict attention to quantitative traits and bi-allelic SNPs as genetic markers. Moreover, our interaction study focuses on 2- way SNP-SNP interactions. Via simulations, we assess the performance of different corrective measures for lower-order genetic effects in Model-Based Multifactor Dimensionality Reduction epistasis detection, using additive and co-dominant coding schemes. Performance is evaluated in terms of power and familywise error rate. Our simulations indicate that empirical power estimates are reduced with correction of lower-order effects, likewise familywise error rates. Easy-to-use automatic SNP selection procedures, SNP selection based on ‘‘top’’ findings, or SNP selection based on p-value criterion for interesting main effects result in reduced power but also almost zero false positive rates. Always accounting for main effects in the SNP-SNP pair under investigation during Model-Based Multifactor Dimensionality Reduction analysis adequately controls false positive epistasis findings. This is particularly true when adopting a co-dominant corrective coding scheme. In conclusion, automatic search procedures to identify lower-order effects to correct for during epistasis screening should be avoided. The same is true for procedures that adjust for lower-order effects prior to Model-Based Multifactor Dimensionality Reduction and involve using residuals as the new trait. We advocate using ‘‘on-the-fly’’ lower-order effects adjusting when screening for SNP-SNP interactions using Model-Based Multifactor Dimensionality Reduction analysis. [less ▲]

Detailed reference viewed: 43 (19 ULg)
Full Text
Peer Reviewed
See detailSafety and cost of infliximab for the treatment of Belgian pediatric patients with Crohn's disease.
De Greef, E.; Hoffman, I.; D'Haens, G. et al

in Acta Gastro-Enterologica Belgica (2012), 75(4), 425-31

Biologicals have become an important component in the treatment of Crohn's disease in children. Their increased and long term use raises safety concerns. We describe safety and cost of infliximab in ... [more ▼]

Biologicals have become an important component in the treatment of Crohn's disease in children. Their increased and long term use raises safety concerns. We describe safety and cost of infliximab in Belgian pediatric Crohn's disease patients. All patients on infliximab as part of the present or past treatment for Crohn's Disease until January 1st 2011 were selected from an existing database. Information on disease phenotype, medication and adverse events were extracted. Adverse events occurred in 25.9% of patients exposed to infliximab of which 29.6% were severe. In total 31.7% of patients stopped infliximab therapy. The main reasons for discontinuation were adverse events in 45.4% and loss of response in 30.3%. No malignancies or lethal complications occurred over this 241 patient year observation period. Immunomodulators were concomitant medication in 75% of patients and were discontinued subsequently in 38.4% of them. The cost of infliximab infusions per treated patient per year in the Belgian health care setting is approximately 9 474 euro, including only medication and hospital related costs. Even though infliximab is relatively safe in pediatric CD on the short term, close follow-up and an increased awareness of the possible adverse reactions is highly recommended. Adverse reactions appeared in 25.9% of all patients and were the main reason for discontinuation. Treatment cost has to be balanced against efficacy and modifications in disease course. In the Belgian health care system, the medication is available to all patients with moderate to severe CD. [less ▲]

Detailed reference viewed: 1 (0 ULg)
Full Text
Peer Reviewed
See detailSafety And Cost Of Infliximab For The Treatment Of Belgian Pediatric Patients With Crohn’s Disease
De Greef, E; Hoffman, I; D'haens, G et al

in Journal of Crohn’s and Colitis (2012)

Detailed reference viewed: 20 (1 ULg)
Peer Reviewed
See detailInference and comparison of different genetic stratification techniques
Maus, Bärbel ULg; Génin, Emmanuelle; Mahachie John, Jestinah ULg et al

Conference (2012)

Detailed reference viewed: 2 (0 ULg)
Full Text
Peer Reviewed
See detailClustering of Crohn’s disease patients: Identification of sub-phenotypes and population stratification
Maus, Bärbel ULg; Génin, Emmanuelle; Mahachie John, Jestinah ULg et al

in Genetic Epidemiology (2012), 36(7), 729

Detailed reference viewed: 12 (2 ULg)
Full Text
Peer Reviewed
See detailPets in Infancy - Asthma or Allergy at School Age? Pooled Analysis of Individual Participant Data from 11 European Birth Cohorts
Lødrup Carlsen, Karin; Roll, Stephanie; Carlsen, Kai-Håkon et al

in PLoS ONE (2012)

Detailed reference viewed: 12 (3 ULg)
Full Text
See detailAn Efficient Algorithm to Perform Multiple Testing in Epistasis Screening
Van Lishout, François ULg; Cattaert, Tom ULg; Mahachie John, Jestinah ULg et al

Conference (2011, December 13)

Background: Research in epistasis or gene-gene interaction detection for human complex traits has grown exponentially over the last few years. It has been marked by promising methodological developments ... [more ▼]

Background: Research in epistasis or gene-gene interaction detection for human complex traits has grown exponentially over the last few years. It has been marked by promising methodological developments, improved translation efforts of statistical epistasis to biological epistasis and attempts to integrate different omics information sources into the epistasis screening to enhance power. The quest for gene-gene interactions poses severe multiple-testing problems. In this context, the maxT algorithm is one technique to control the false-positive rate. However, the memory needed by this algorithm rises linearly with the amount of hypothesis tests. In main-effects detection, this is not a problem since the memory required is thus proportional to the number of SNPs. In contrast, gene-gene interaction studies will require a memory proportional to the squared amount of SNPs. A genome wide epistasis would therefore require terabytes of memory. Hence, cache problems are likely to occur, increasing the computation time. Methods: In this work we present a new version of maxT, requiring an amount of memory independent from the number of genetic effects to be investigated. This algorithm was implemented in C++ in our epistasis screening software MB-MDR-2.6.2 and compared to MB-MDR's first implementation as an R-package (Calle et al., Bioinformatics 2010). We evaluate the new implementation in terms of memory efficiency and speed using simulated data. The software is illustrated on real-life data for Crohn's disease. Results: The sequential version of MBMDR-2.6.2 is approximately 5,500 times faster than its R counterparts. The parallel version (tested on a cluster composed of 14 blades, containing each 4 quad-cores Intel Xeon CPU E5520@2.27 GHz) is approximately 900,000 times faster than the latter, for results of the same quality on the simulated data. It analyses all gene-gene interactions of a dataset of 100,000 SNPs typed on 1000 individuals within 4 days. Our program found 14 SNP-SNP interactions with a p-value less than 0.05 on the real-life Crohn’s disease data. Conclusions: Our software is able to solve large-scale SNP-SNP interactions problems within a few days, without using much memory. A new implementation to reach genome wide epistasis screening is under construction. In the context of Crohn's disease, MBMDR-2.6.2 found signal in regions well known in the field and our results could be explained from a biological point of view. This demonstrates the power of our software to find relevant phenotype-genotype associations. [less ▲]

Detailed reference viewed: 49 (22 ULg)
Full Text
Peer Reviewed
See detailComparison of genetic association strategies in the presence of rare alleles
Mahachie John, Jestinah ULg; Cattaert, Tom ULg; De Lobel, Lizzy et al

in BMC Proceedings (2011)

Detailed reference viewed: 14 (5 ULg)
Full Text
Peer Reviewed
See detailRate of Malignancies and Infections in a Large Single Center Cohort of IBD Patients Treated With Thiopurines and Anti-TNF-Antibodies.
Ochsenkühn, T; Steinborn, A; Beigel, F et al

in Gut (2011, October)

Detailed reference viewed: 12 (0 ULg)
Peer Reviewed
See detailA robustness study to investigate the performance of parametric and non-parametric tests used in Model-Based Multifactor Dimensionality Reduction Epistasis Detection.
Mahachie John, Jestinah ULg; Gusareva, Elena ULg; Van Lishout, François ULg et al

Poster (2011, September 19)

Model-Based Multifactor Dimensionality Reduction (MB-MDR) is data mining technique to identify gene-gene interactions among 1000nds of SNPs in a fast way, without making assumptions about the mode of ... [more ▼]

Model-Based Multifactor Dimensionality Reduction (MB-MDR) is data mining technique to identify gene-gene interactions among 1000nds of SNPs in a fast way, without making assumptions about the mode of genetic interactions. By construction, one of the implementations of MB-MDR involves testing one multi-locus genotype cell versus the remaining cells, hereby creating two imbalanced groups for trait distribution comparison. To date, for continuous traits, we have adopted a standard F-test to compare these groups. When normality assumption or homoscedasticity no longer hold, highly inflated results are to be expected. The power and type I error control of MB-MDR under these assumptions has been thoroughly investigated in Mahachie John et al [1]. The aim of this study is to assess, through simulations, the effects of ANOVA model violations on the performance of Model-Based Multifactor Dimensionality Reduction (MB-MDR). We quantify their effect on MB-MDR using default options, but at the same time introduce alternative options with increased performance. The better handling of imbalanced data using robust approaches [2] within a MB-MDR context is exemplified on real data for asthma-related phenotypes. 1. EJHG (2011), Early view 2. David Freedman, Statistical Models: Theory and Practice, Cambridge University Press (2000), ISBN 978-0521671057 [less ▲]

Detailed reference viewed: 17 (6 ULg)