[en] In a robust analysis, the minimum volume ellipsoid (MVE) estimator is very often used to estimate both multivariate location and scatter. The MVE estimator for the scatter matrix is defined as the smallest ellipsoid covering half of the observations, while the MVE location estimator is the midpoint of that ellipsoid. The MVE estimators can be computed by minimizing a certain criterion over a high-dimensional space. In practice, one mostly uses algorithms based on minimization of the objective function over a sequence of trial estimates. One of these estimators uses a resampling scheme, and yields the (p + 1)-subset estimator. In this note, we show how this estimator can easily be adapted, yielding a considerable increase of statistical efficiency at finite samples. This gain in precision is also observed when sampling from contaminated distributions, and it becomes larger when the dimension increases. Therefore, we do not need more computation time nor do we lose robustness properties. Moreover, only a few lines have to be added to existing computer programs. The key idea is to average over several trials close to the optimum, instead of just picking out the trial with the lowest value for the objective function. The resulting estimator keeps the equivariance and robustness properties of the original MVE estimator. This idea can also be applied to several other robust estimators, including least-trimmed-squares regression.
Beirlant, J., D.M. Mason and C. Vynckier, Goodness-of-fit tests for multivariate normality based on generalised quantiles, Internal Report (K.U. Leuven, 1996).
Cook, R.D., D.M. Hawkins and S. Weisberg, Exact iterative computation of the robust multivariate minimum volume ellipsoid estimator, Statist. and Probab. Lett., 16 (1993) 213-218.
Croux C. and P.J. Rousseeuw, A Class of high-breakdown scale estimators based on subranges, Comm. Statist. Theory Methods, Vol. 21 (1992) 1935-1951.
Croux, C., P.J. Rousseeuw and A. Van Bael, Robust regression by minimizing nested scale estimators, J. Statist. Plann. Inference, 53 (1996) 197-235.
Davies, P.L., Asymptotic behavior of S-estimates of multivariate location parameters and dispersion matrices, Ann. Statist., 15 (1987) 1269-1292.
Davies, P.L., The asymptotics of Rousseeuw's minimum volume ellipsoid estimator, Ann. Statist., 20 (1992) 1828-1843.
Donoho, D.L. and P.J. Huber, The notion of breakdown point, in: P.J. Bickel, K.A. Doksum and J.L. Hodges, Jr. (Eds.), A Festschrift for Erich L. Lehmann (Wadsworth, California, 1983) 157-184.
Einmahl, J.H.J. and D.M. Mason, Generalized quantile processes, Ann. Statist., 20 (1992) 1062-1078.
Hampel, F.R., E.M. Ronchetti, P.J. Rousseeuw and W.A. Stahel, Robust statistics: the approach based on influence functions (Wiley, New York, 1986).
Hawkins, D.M., A feasible solution algorithm for the minimum volume ellipsoid estimator in multivariate data, Comput. Statist., 8 (1993) 95-107.
Hawkins, D.M., A feasible solution algorithm for the minimum covariance determinant estimator, Comput. Statist. Data Anal., 17 (1994) 197-210.
Hawkins, D.M. and J.S. Simonoff, AS 282: high breakdown regression and multivariate estimation, Appl. Statist., 42 (1993) 423-432.
Hössjer, O., Exact computation of the least-trimmed-squares estimate in simple linear regression, Comput. Statist. Data Anal., 19 (1995) 265-268.
Lopuhaä, H.P., Multivariate τ-estimators for location and scatter, Canad. J. Statist., 19 (1991) 307-321.
Lopuhaä, H.P. and P.J. Rousseeuw, Breakdown points of affine equivariant estimators of multivariate location and covariance matrices, Ann. Statist., 19 (1991) 229-248.
Maronna, R.A. and V.J. Yohai, The behavior of the Stahel-Donoho robust multivariate estimator, J. Amer. Statist. Assoc., 90 (1995) 330-341.
Maronna, R.A., W.A. Stahel and V.J. Yohai, Bias-robust estimators of multivariate scatter based on projections, J. Multivariate Anal., 42 (1992) 141-161.
Rousseeuw, P.J., Multivariate estimation with high breakdown point, in: W. Grossmann, G. Pflug, I. Vincze and W. Wertz (Eds.), Mathematical statistics and applications, Vol. B (Reidel, Dordrecht, 1985) 283-297.
Rousseeuw, P.J. and G.W. Bassett, Robustness of the p-subset algorithm for regression with high breakdown point, in: W. Stahel and S. Weisberg (Eds.), Directions in robust statistics and diagnostics, Part II (Springer, New York, 1991) 185-194.
Rousseeuw, P.J. and A.M. Leroy, Robust regression and outlier detection (Wiley, New York, 1987).
Rousseeuw, P.J. and B.C. van Zomeren, Unmasking multivariate outliers and leverage points, J. Amer. Statist. Assoc., 85 (1990) 633-639.
Ruppert, D., Computing S-estimators for regression and multivariate location/dispersion, J. Comput. Graph. Statist., 1 (1992) 253-270.
Tyler, D.E., A distribution-free M-estimator of multivariate scatter, Ann. Statist., 15 (1987) 234-251.
Woodruff, D.L. and D.M. Rocke, Heuristic search algorithms for the minimum volume ellipsoid, J. Comput. Graph. Statist., 2 (1993) 69-95.
Woodruff, D.L. and D.M. Rocke, Computable robust estimation of multivariate location and shape in high dimension using compound estimators, J. Amer. Statist. Assoc., 89 (1994) 888-896.