Reference : Classification performance resulting from of 2-means
Scientific journals : Article
Physical, chemical, mathematical & earth Sciences : Mathematics
http://hdl.handle.net/2268/81354
Classification performance resulting from of 2-means
English
Ruwet, Christel mailto [Université de Liège - ULg > Département de mathématique > Statistique mathématique >]
Haesbroeck, Gentiane mailto [Université de Liège - ULg > Département de mathématique > Statistique mathématique >]
Feb-2013
Journal of Statistical Planning & Inference
Elsevier Science
143
2
408-418
Yes (verified by ORBi)
International
0378-3758
[en] Asymptotic loss ; Cluster analysis ; Error rate ; k-means ; Influence function ; Principal points ; Robustness
[en] The k-means procedure is probably one of the most common nonhierachical clustering techniques. From a theoretical point of view, it is related to the search for the k principal points of the underlying distribution. In this paper, the classification resulting from that procedure for k=2 is shown to be optimal under a balanced mixture of two spherically symmetric and homoscedastic distributions. Then, the classification efficiency of the 2-means rule is assessed using the second order influence function and compared to the classification efficiencies of the Fisher and logistic discriminations. Influence functions are also considered here to compare the robustness to infinitesimal contamination of the 2-means method w.r.t. the generalized 2-means technique.
http://hdl.handle.net/2268/81354
10.1016/j.jspi.2012.08.004
http://dx.doi.org/10.1016/j.jspi.2012.08.004

File(s) associated to this reference

Fulltext file(s):

FileCommentaryVersionSizeAccess
Restricted access
Classification Performance resulting from a 2-means_R3.pdfAuthor postprint238.96 kBRequest copy
Open access
Classification Performance resulting from a 2-means.pdfAuthor preprint317.65 kBView/Open

Bookmark and Share SFX Query

All documents in ORBi are protected by a user license.