<  Back to the Polytechnique Montréal portal

High-dimensional bayesian clustering with variable selection: The R package bclust

Vahid Partovi Nia and Anthony C. Davison

Article (2012)

Open Acess document in PolyPublie and at official publisher
Show 4 files
Hide files
[img]
Preview
Open Access to the full text of this document
Published Version
Terms of Use: Creative Commons Attribution
Download (412kB)
[img] Open Access to the full text of this document
Archive - Supplemental Material
Terms of Use: GPLv3
Download (49kB)
[img] Open Access to the full text of this document
Supplemental Material
Terms of Use: GPLv3
Download (4kB)
[img] Open Access to the full text of this document
Archive - Supplemental Material
Terms of Use: GPLv3
Download (15kB)
Show abstract
Hide abstract

Abstract

The R package bclust is useful for clustering high-dimensional continuous data. The package uses a parametric spike-and-slab Bayesian model to downweight the effect of noise variables and to quantify the importance of each variable in agglomerative clustering. We take advantage of the existence of closed-form marginal distributions to estimate the model hyper-parameters using empirical Bayes, thereby yielding a fully automatic method. We discuss computational problems arising in implementation of the procedure and illustrate the usefulness of the package through examples.

Uncontrolled Keywords

Subjects: 3000 Statistics and probability > 3000 Statistics and probability
Department: Department of Mathematics and Industrial Engineering
Funders: Swiss SNF Fellowship, NSERC Discovery Grant, VAHSR&D Grant
Grant number: PBELP2-125531S, 341315/200, IIR 07-229
PolyPublie URL: https://publications.polymtl.ca/5058/
Journal Title: Journal of Statistical Software (vol. 47, no. 5)
Publisher: Foundation for Open Access Statistics
DOI: 10.18637/jss.v047.i05
Official URL: https://doi.org/10.18637/jss.v047.i05
Date Deposited: 18 Nov 2022 13:52
Last Modified: 26 Sep 2024 04:05
Cite in APA 7: Partovi Nia, V., & Davison, A. C. (2012). High-dimensional bayesian clustering with variable selection: The R package bclust. Journal of Statistical Software, 47(5), 22 pages. https://doi.org/10.18637/jss.v047.i05

Statistics

Total downloads

Downloads per month in the last year

Origin of downloads

Dimensions

Repository Staff Only

View Item View Item