Skip to Main content Skip to Navigation
Journal articles

SMILE: A novel dissimilarity-based procedure for detecting sparse-specific profiles in sparse contingency tables

Abstract : A novel statistical procedure for clustering individuals characterized by sparse-specific profiles is introduced in the context of data summarized in sparse contingency tables. The proposed procedure relies on a single-linkage clustering based on a new dissimilarity measure designed to give equal influence to sparsity and specificity of profiles. Theoretical properties of the new dissimilarity are derived by characterizing single-linkage clustering using Minimum Spanning Trees. Such characterization allows the description of situations for which the proposed dissimilarity outperforms competing dissimilarities. Simulation examples are performed to demonstrate the strength of the new dissimilarity compared to 11 other methods. The analysis of a genomic data set dedicated to the study of molecular signatures of selection is used to illustrate the efficiency of the proposed method in a real situation
Document type :
Journal articles
Complete list of metadatas

Cited literature [20 references]  Display  Hide  Download

https://hal-univ-rennes1.archives-ouvertes.fr/hal-01269901
Contributor : Laurent Jonchère <>
Submitted on : Thursday, February 11, 2016 - 9:58:51 AM
Last modification on : Thursday, October 8, 2020 - 3:07:02 AM
Long-term archiving on: : Saturday, November 12, 2016 - 5:28:25 PM

File

SMILE A novel dissimilarity-ba...
Files produced by the author(s)

Identifiers

Citation

Mathieu Emily, Christophe Hitte, Alain Mom. SMILE: A novel dissimilarity-based procedure for detecting sparse-specific profiles in sparse contingency tables. Computational Statistics and Data Analysis, Elsevier, 2016, 99, pp.171-188. ⟨10.1016/j.csda.2016.01.017⟩. ⟨hal-01269901⟩

Share

Metrics

Record views

419

Files downloads

231