Simultaneous model-based clustering and visualization in the Fisher discriminative subspace

Charles Bouveyron 1 Camille Brunet 2
2 TADIB
IBISC - Informatique, Biologie Intégrative et Systèmes Complexes
Abstract : Clustering in high-dimensional spaces is nowadays a recurrent problem in many scientific domains but remains a difficult task from both the clustering accuracy and the result understanding points of view. This paper presents a discriminative latent mixture (DLM) model which fits the data in a latent orthonormal discriminative subspace with an intrinsic dimension lower than the dimension of the original space. By constraining model parameters within and between groups, a family of 12 parsimonious DLM models is exhibited which allows to fit onto various situations. An estimation algorithm, called the Fisher-EM algorithm, is also proposed for estimating both the mixture parameters and the discriminative subspace. Experiments on simulated and real datasets show that the proposed approach performs better than existing clustering methods while providing a useful representation of the clustered data. The method is as well applied to the clustering of mass spectrometry data.
Complete list of metadatas

Cited literature [51 references]  Display  Hide  Download

https://hal-paris1.archives-ouvertes.fr/hal-00492406
Contributor : Charles Bouveyron <>
Submitted on : Tuesday, April 19, 2011 - 10:16:21 AM
Last modification on : Thursday, February 7, 2019 - 2:45:17 PM
Long-term archiving on : Wednesday, July 20, 2011 - 2:37:10 AM

File

revision_FisherEM_3.pdf
Files produced by the author(s)

Identifiers

Citation

Charles Bouveyron, Camille Brunet. Simultaneous model-based clustering and visualization in the Fisher discriminative subspace. Statistics and Computing, Springer Verlag (Germany), 2012, 22 (1), pp.301--324. ⟨10.1007/s11222-011-9249-9⟩. ⟨hal-00492406v4⟩

Share

Metrics

Record views

772

Files downloads

858