Influence function of the error rate of classification based on clusteringRuwet, Christel ; Haesbroeck, Gentiane ![]() Conference (2009, May 19) Cluster analysis may be performed when one wishes to group similar objects into a given number of clusters. Several algorithms are available in order to construct these clusters. In this talk, focus will ... [more ▼] Cluster analysis may be performed when one wishes to group similar objects into a given number of clusters. Several algorithms are available in order to construct these clusters. In this talk, focus will be on two particular cases of the generalized k-means algorithm : the classical k-means procedure as well as the k-medoids algorithm, while the data of interest are assumed to come from an underlying population consisting of a mixture of two groups. Among the outputs of these clustering techniques, a classification rule is provided in order to classify the objects into one of the clusters. When classification is the main objective of the statistical analysis, performance is often measured by means of an error rate. Two types of error rates can be computed: a theoretical one and a more empirical one. The first one can be written as ER(F, Fm) where F is the distribution of the training sample used to set up the classification rule and Fm (model distribution) is the distribution under which the quality of the rule is assessed (via a test sample). The empirical error rate corresponds to ER(F, F), meaning that the classification rule is tested on the same sample as the one used to set up the rule. This talk will present the results concerning the theoretical error rate. In case there are some outliers in the data, the classification rule may be corrupted. Even if it is evaluated at the model distribution, the theoretical error rate may then be contaminated. To measure the robustness of classification based on clustering, influence functions have been computed. Similar results as those derived by Croux et al (2008) and Croux et al (2008) in discriminant analysis were observed. More specifically, under optimality (which happens when the model distribution is FN = 0.5 N(μ1, σ) + 0.5 N(μ2, σ), Qiu and Tamhane 2007), the contaminated error rate can never be smaller than the optimal value, resulting in a first order influence function identically equal to 0. Second order influence functions need then to be computed. When the optimality does not hold, the first order influence function of the theoretical error rate does not vanish anymore and shows that contamination may improve the error rate achieved under the non-optimal model. The first and, when required, second order influence functions of the theoretical error rate are useful in their own right to compare the robustness of the 2-means and 2-medoids classification procedures. They have also other applications. For example, they may be used to derive diagnostic tools in order to detect observations having an unduly large influence on the error rate. Also, under optimality, the second order influence function of the theoretical error rate can yield asymptotic relative classification efficiencies. [less ▲] Detailed reference viewed: 50 (23 ULg) Influence function of the error rate of the generalized k-meansRuwet, Christel ; Haesbroeck, Gentiane ![]() Scientific conference (2009, March 30) Cluster analysis may be performed when one wishes to group similar objects into a given number of clusters. Several algorithms are available in order to construct these clusters. In this talk, focus will ... [more ▼] Cluster analysis may be performed when one wishes to group similar objects into a given number of clusters. Several algorithms are available in order to construct these clusters. In this talk, focus will be on two particular cases of the generalized k-means algorithm: the classical k-means procedure as well as the k-medoids algorithm. Among the outputs of these clustering techniques, a classification rule is provided in order to classify the objects into one of the clusters. When classification is the main objective of the statistical analysis, performance is often measured by means of an error rate. In the clustering setting, the error rate has to be measured on the training sample while test samples are usually used in other settings like linear discrimination or logistic discrimination. This characteristic of classification resulting from a clustering implies that contamination in the training sample may not only affect the classification rule but also other parameters involved in the error rate. In the talk, influence functions will be used to measure the impact of contamination on the error rate and will show that contamination may decrease the error rate that one would expect under a given model. Moreover, a kind of second-order influence functions will also be derived to measure the bias in error rate the k-means and k-medoids procedures suffer from in finite-samples. Simulations will confirm the results obtained via the first and second-order influence functions. Future research perspectives will conclude the talk. [less ▲] Detailed reference viewed: 8 (0 ULg) The influence function of the TCLUST robust clustering procedureRuwet, Christel ; ; et alin Advances in Data Analysis and Classification [=ADAC] (2012), 6(2), 107-130 The TCLUST procedure performs robust clustering with the aim of finding clusters with different scatter structures and proportions. An Eigenvalue Ratio constraint is considered by TCLUST in order to avoid ... [more ▼] The TCLUST procedure performs robust clustering with the aim of finding clusters with different scatter structures and proportions. An Eigenvalue Ratio constraint is considered by TCLUST in order to avoid finding spurious clusters. In order to guarantee the robustness of the method against the presence of outliers and background noise, the method allows for trimming of a given proportion of observations self determined by the data. This article studies robustness properties of the TCLUST procedure by means of the influence function, obtaining a robustness behavior close to that of the trimmed k-means. [less ▲] Detailed reference viewed: 28 (7 ULg) Influence functions of the error rates of classification based on clusteringRuwet, Christel ; Haesbroeck, Gentiane ![]() Poster (2009, May) Cluster analysis may be performed when one wishes to group similar objects into a given number of clusters. Several algorithms are available in order to construct these clusters. In this poster, focus ... [more ▼] Cluster analysis may be performed when one wishes to group similar objects into a given number of clusters. Several algorithms are available in order to construct these clusters. In this poster, focus will be on two particular cases of the generalized k-means algorithm : the classical k-means procedure as well as the k-medoids algorithm, while the data of interest are assumed to come from an underlying population consisting of a mixture of two groups. Among the outputs of these clustering techniques, a classification rule is provided in order to classify the objects into one of the clusters. When classification is the main objective of the statistical analysis, performance is often measured by means of an error rate. Two types of error rates can be computed : a theoretical one and a more empirical one. The first one can be written as ER(F, Fm) where F is the distribution of the training sample used to set up the classification rule and Fm (model distribution) is the distribution under which the quality of the rule is assessed (via a test sample). The empirical error rate corresponds to ER(F, F), meaning that the classification rule is tested on the same sample as the one used to set up the rule. In case there are some outliers in the data, the classification rule may be corrupted. Even if it is evaluated at the model distribution, the theoretical error rate may then be contaminated, while the effect of contamination on the empirical error rate is two-fold : the rule but also the test sample are contaminated. To measure the robustness of classification based on clustering, influence functions have been computed, both for the theoretical and the empirical error rates. When using the theoretical error rate, similar results as those derived by Croux et al (2008) and Croux et al (2008) in discriminant analysis were observed. More specifically, under optimality (which happens when the model distribution is FN = 0.5N(μ1, ) + 0.5N(μ2, ), Qiu and Tamhane 2007), the contaminated error rate can never be smaller than the optimal value, resulting in a first order influence function identically equal to 0. Second order influence functions would then need to be computed, as this will be done in future research. When the optimality does not hold, the first order influence function of the theoretical error rate does not vanish anymore and shows that contamination may improve the error rate achieved under the non-optimal model. Similar computations have been performed for the empirical error rate, as the poster will show. The first and, when required, second order influence functions of the theoretical and empirical error rates are useful in their own right to compare the robustness of the 2-means and 2-medoids classification procedures. They have also other applications. For example, they may be used to derive diagnostic tools in order to detect observations having an unduly large influence on the error rate. Also, under optimality, the second order influence function of the theoretical error rate can yield asymptotic relative classification efficiencies. [less ▲] Detailed reference viewed: 46 (24 ULg) Influence Lianol Solapro on sow colostrums production; ; Renaville, Robert et alin 21 Inter. Pig Vet Society Congress (2010) Detailed reference viewed: 32 (3 ULg) Influence Molecular Arrangement in Self-assembled Monolayers on Adhesion Forces Measured by Chemical Force MicroscopyDuwez, Anne-Sophie ; ; in Chemphyschem : A European Journal of Chemical Physics and Physical Chemistry (2003), 4 Detailed reference viewed: 3 (0 ULg) Influence of 2D and 3D images on performance and time estimation in minimal invasive surgeryBlavier, Adelaïde ; Nyssen, Anne-Sophie ![]() in Ergonomics (2009), 52(11), 13421349 This study aimed to evaluate the impact of 2D and 3D images on time performance and time estimation during a surgical motor task. 60 subjects without any surgical experience (nurses) and 20 expert ... [more ▼] This study aimed to evaluate the impact of 2D and 3D images on time performance and time estimation during a surgical motor task. 60 subjects without any surgical experience (nurses) and 20 expert surgeons performed a fine surgical task with a new laparoscopic technology (da Vinci robotic system). The 80 subjects were divided into two groups, one using 3D view option and another using 2D view option. We measured time performance and asked subjects to verbally estimate their time performance. Our results showed faster performance in 3D than in 2D view for novice subjects while the performance in 2D and 3D was similar in the expert group. We obtained a significant interaction between time performance and time evaluation: in 2D condition, all subjects accurately estimated their time performance while they overestimated it in the 3D condition. Our results emphasize the role of 3D in improving performance and the contradictory feeling about time evaluation in 2D and 3D. This finding is discussed in regard with the retrospective paradigm and suggests that 2D and 3D images are differently processed and memorised. [less ▲] Detailed reference viewed: 23 (3 ULg) Influence of 3 stages of maturity on total yield and on animal performance of growing fattening bulls offered a maize silage based dietDufrasne, Isabelle ; ; Istasse, Louis et alin Proceedings of the 43th Annual Meeting of E.A.A.P. (1992) Detailed reference viewed: 3 (2 ULg) Influence of a blend of fructo-oligosaccharides and sugar beet fiber on nutrient digestibility and plasma metabolites concentrations in healthy BeaglesDiez, Marianne ; Hornick, Jean-Luc ; Baldwin, Paule et alin American Journal of Veterinary Research (1997), 58 Objective-To evaluate effects of a blend of fructo-oligosaccharides and sugar beet fiber (4:1) at 3 incorporation rates on nutrient digestibility and plasma glucose, insulin, alpha-aminonitrogen, urea ... [more ▼] Objective-To evaluate effects of a blend of fructo-oligosaccharides and sugar beet fiber (4:1) at 3 incorporation rates on nutrient digestibility and plasma glucose, insulin, alpha-aminonitrogen, urea, cholesterol, and triglycerides concentrations measured weekly in nonfed dogs and during a 360-minute period after a meal. Animals-8 castrated 1 to 1.4-year-old young adult male Beagles weighing 10.0 to 13.5 kg. Procedure-Diets containing 2 incorporation rates of a blend of fructo-oligosaccharides and sugar beet fiber (5 and 10% on a dry matter basis [diets B and C, respectively]) were compared with a control diet without additional fiber (diet A). The 3 diets were evaluated for ability to modify digestibility of dry and organic matter, protein, fat, and ash and for effects on plasma glucose, insulin, alpha-aminonitrogen, urea, cholesterol, and triglycerides concentrations. Each diet was fed for 6 weeks; plasma samples were collected weekly before feeding and after feeding on the last day of the period, During 1 week at the end of the 6-week period, dogs were kept in metabolic cages. Each period of the block was followed by a 4-week washout period. Results-Incorporating the blend of fructo-oiigosaccharides and sugar beet fiber in the diet was associated with greater passage of wet feces (diets B and C) and lower protein digestibility (diet C). Postprandial glucose (diet C), urea (diets B and C) and triglyceride (diets B and C) concentrations were significantly (P < 0.01) decreased. Weekly preprandial measurements were characterized by decreased urea (diets B and C), cholesterol (diet C), and triglycerides (diets B and C) concentrations (P < 0.001). Conclusion-Chronic consumption of fermentable fiber is associated with mildly decreased protein digestibility and with metabolic effects in nonfed or fed dogs. Clinical Relevance-A blend of fructo-oligosaccharides and sugar beet fiber should he tested as a dietary aid for treatment of chronic diseases, such as diabetes mellitus or hyperlipidemia, in dogs. [less ▲] Detailed reference viewed: 23 (2 ULg) The influence of a grain boundary on the thermal transport properties of bulk, melt-processed Y-Ba-Cu-O; Fagnard, Jean-François ; et alin Superconductor Science and Technology (2013), 26 We report the dependence of thermal conductivity, thermoelectric power and electrical resistivity on temperature for a bulk, large grain melt-processed Y-Ba-Cu-O (YBCO) high temperature superconductor ... [more ▼] We report the dependence of thermal conductivity, thermoelectric power and electrical resistivity on temperature for a bulk, large grain melt-processed Y-Ba-Cu-O (YBCO) high temperature superconductor (HTS) containing two grains separated by a well-defined grain boundary. Transport measurements at temperatures between 10 and 300 K were carried out both within one single grain (intra-granular properties) and across the grain boundary (inter-granular properties). The influence of an applied external magnetic field of up to 8 T on the measured sample properties was also investigated. The presence of the grain boundary is found to affect strongly the electrical resistivity of the melt-processed bulk sample, but has almost no effect on its thermoelectric power and thermal conductivity, within experimental error. The results of this study provide direct evidence that the heat flow in multi-granular melt-processed YBCO bulk samples should be virtually unaffected by the presence of grain boundaries in the material. © 2013 IOP Publishing Ltd. [less ▲] Detailed reference viewed: 49 (13 ULg) Influence of a hormonal preparation containing glucocorticoids (dexamethasone esters), progestagen (chlormadinone acetate) and oestrogen (ethynyl oestradiol) on testosterone, insulin-like growth factor-1 (IGF-1), IGF binding proteins and spermatogenic cells in finishing bulls.Renaville, Robert ; ; Lognay, Georges et alin Animal Production (1994), 59 Detailed reference viewed: 7 (1 ULg) Influence of a low magnetic field on the thermal diffusivity of Bi2Sr2CaCu2O8Dorbolo, Stéphane ; Ausloos, Marcel ![]() in Physical Review B (2002), 65(21), The thermal diffusivity of a Bi-2212 polycrystalline sample has been measured under a 1 T magnetic field applied perpendicularly to the heat flux. The magnetic contribution to the heat carrier mean free ... [more ▼] The thermal diffusivity of a Bi-2212 polycrystalline sample has been measured under a 1 T magnetic field applied perpendicularly to the heat flux. The magnetic contribution to the heat carrier mean free path has been extracted and is found to behave as a simple power law. This behavior can be attributed to a percolation process of electrons in the vortex lattice created by the magnetic field. [less ▲] Detailed reference viewed: 6 (0 ULg) Influence of a medication history and a pharmaceutical opinion at admission of geriatric hospitalized patients on inappropriate drug prescribingSamalea Suarez, Audrey ; Petermans, Jean ; Van Hees, Thierry ![]() in International Journal of Clinical Pharmacy (2011, April), 33(2), 432-433 Detailed reference viewed: 33 (15 ULg) Influence Of A New Axial Impeller On K(L)A And Xylanase Production By Penicillium Canescens 10-10c; ; et al in Applied Biochemistry and Biotechnology (2002), 98 The effects of a new axial impeller (HTPG4) on oxygen volumetric transfer coefficient, KLa, and xylanase production by Penicillium canescens 10-10c were studied and compared for dual-impeller systems, one ... [more ▼] The effects of a new axial impeller (HTPG4) on oxygen volumetric transfer coefficient, KLa, and xylanase production by Penicillium canescens 10-10c were studied and compared for dual-impeller systems, one with one DT4 impeller below and one HTPG4 above (DT4-HTPG4) and one with two DT4 (DT4-DT4) impellers, in a 5-L bioreactor. The volumetric coefficient of oxygen transfer was measured in culture medium using a gassing-out method at different gassing rates and agitation speeds. We observed that the DT4-HTPG4 combination provided better KLa performance than the DT4-DT4 combination. The two combinations were also tested for their influence on xylanase production by a filamentous microorganism; P. canescens 10-10c. These experiments demonstrated that the DT4-HTPG4 combination impeller enhanced enzyme production up to 23% compared with the DT4-DT4 combination at an aeration rate of 1 vvm and an agitation speed of 600 rpm. The main cause for this difference is thought to be a higher shear stress generated by the DT4-DT4 combination, which damages the mycelium of P. canescens and decreases xylanase production. [less ▲] Detailed reference viewed: 18 (0 ULg) Influence of a nonlinear reference temperature profile on oscillatory Benard-Marangoni convection.; ; Dauby, Pierre ![]() in Physical Review. E : Statistical, Nonlinear, and Soft Matter Physics (2003), 68(6 Pt 2), 066310 We analyze oscillatory instabilities in a fluid layer of infinite horizontal extent, heated from above or cooled from below, taking into account the nonlinearity of the reference temperature profile ... [more ▼] We analyze oscillatory instabilities in a fluid layer of infinite horizontal extent, heated from above or cooled from below, taking into account the nonlinearity of the reference temperature profile during the transient state of heat conduction. The linear stability analysis shows that a nonlinear reference temperature profile can have a strong effect on the system, either stabilizing or destabilizing, depending on the relative importance of buoyancy and surface tension forces. For the nonlinear analysis we use a Galerkin-Eckhaus method leading to a finite set of amplitude equations. In the two-dimensional (2D) case, we show the solution of these amplitude equations are standing waves. [less ▲] Detailed reference viewed: 7 (0 ULg) Influence of a reduced gravity on the volume fraction of a monolayer of spherical grainsDorbolo, Stéphane ; ; Ludewig, François et alin Physical Review. E : Statistical, Nonlinear, and Soft Matter Physics (2011), 84 Detailed reference viewed: 32 (6 ULg) Influence of a short time specific strength training on iso-inertial performancesJidovtseff, Boris ; Croisier, Jean-Louis ; et alin Abstract Book of the 4th International Conference on Strength Training (Serres, Greece) (2004, November) Detailed reference viewed: 23 (1 ULg) Influence of a static wing wake on the stall flutter behavior of a flexible wing; Dimitriadis, Grigorios ![]() in Proceedings of the 13th International Conference on Wind Engineering, ICWE13 (2011, July 13) The subject of this paper is the experimental study of the aeroelastic behavior of a wing undergoing stall flutter in the vicinity of second, static wing. While stall flutter has been the subject of ... [more ▼] The subject of this paper is the experimental study of the aeroelastic behavior of a wing undergoing stall flutter in the vicinity of second, static wing. While stall flutter has been the subject of several investigations, such work has almost always concentrated on isolated wings. Stall flutter is a phenomenon that is mostly encountered in rotating blades, such as wind turbine or helicopter blades. In such cases, the phenomenon is influenced by the wake of the preceding blade. This paper presents a series of experiments carried out at the Goldstein Laboratory of the University of Manchester, concerning the phenomenon of stall flutter influenced by the proximity of a static wing. The work is an extension of the single wing stall flutter experiments presented by Dimitriadis and Li (2009). [less ▲] Detailed reference viewed: 36 (1 ULg) Influence of a water purification unit on the contamination level of salmonella in outcoming water and sludge; Korsak Koulagenko, Nicolas ; et alin Annales de Médecine Vétérinaire (2002), 146(5), 303-310 Foodborne pathogens occasionally harboured in the gastro-intestinal tract of some domestic animals may be retrieved in slaughterhouses waste water and in sludge of water purification units. Salmonella ... [more ▼] Foodborne pathogens occasionally harboured in the gastro-intestinal tract of some domestic animals may be retrieved in slaughterhouses waste water and in sludge of water purification units. Salmonella, athogen common to man and Animals, is often used as a biological risk indicator. The aim of the present study was to assess effectiveness of a recent water purification unit by rapid and semi-quantitative detection of this micro-organism. The water purification unit collects waste water of seven food-processing industries, including a pig slaughterhouse. This latest was the main source of Salmonella contamination with a level of more 103 colony forming unit (CFU) per ml, that was very close to the average level of contamination of total incoming waste water. The unit, whose principle of action is based on biological purification, has permitted a 4 log10 reduction of Salmonella contamination, with a final contamination level of less than 1 CFU/ml. It was observed only a little decrease of contamination levels during the biological treatment steps, in contrast with the one observed during the clarification step. This was due to adsorption of bacteria by material in suspension. Fresh sludge were harboured an average of 102 CFU of Salmonella per gram. In the beginning of December 1999, two sludge piles were let on the ground along a field and sampled microbiologically every month. Seven month later, no Salmonella were recovered from the piles. 309 [less ▲] Detailed reference viewed: 79 (7 ULg) The influence of acid rains on the geochemistry of aluminium; ; Debbaut, Vincent et alin Scope Belgium : proceedings : acid deposition and sulphur cycle (1984, June) Detailed reference viewed: 10 (1 ULg) |
||