Efficient Model Selection for Mixtures of Probabilistic PCA via Hierarchical BIC

来源 :The 24th International Workshop on Matrices and Statistics(第 | 被引量 : 0次 | 上传用户:sukey2
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  This paper concerns model selection for mixtures of probabilistic principal component analyzers (MPCA). The well known Bayesian information criterion (BIC) is frequently used for this purpose. However, it is found that BIC penalizes each analyzer implausibly using the whole sample size. In this paper, we present a new criterion for MPCA called hierarchical BIC in which each analyzer is penalized using its own effective sample size only. Theoretically, hierarchical BIC is a large sample approximation of variational Bayesian (VB) lower bound and BIC is a further approximation of hierarchical BIC. To learn hierarchical-BIC -based MPCA, we propose two efficient algorithms: two-stage and one-stage variants. The two-stage algorithm integrates model selection with respect to the subspace dimensions into parameter estimation and the one-stage variant further integrates the selection of the number of mixture components into a single algorithm. Experiments on a number of synthetic and real-world data sets show that (ⅰ) hierarchical BIC is more accurate than BIC and several related competitors; (ⅱ) the two proposed algorithms are not only effective but also much more efficient than the classical two-stage procedure commonly used for BIC.
其他文献
会议
We will describe work on epithelial wound healing in drosophila pupae and some more recent work on gap closure in monolayers of MDCK cells or keratinocytes. The
会议
  We propose Partial Correlation Screening (PCS) as a new row-by-row approach. To estimate the i-th row of Ω, 1 ≤ i ≤ p, PCS uses a Screen step and a Clean
会议
In eukaryotes, the absence of telomerase results in telomere shortening, leading to replicative senescence, an arrested state that prevents further cell divisio
会议
  Based upon the Grassman, Taksar and Heyman algorithm [1] and the equivalent Sheskin State Reduction algorithm [2] for finding the stationary distribution of
会议
  The goal of targeted therapeutics and molecular diagnostics is to accumulate drugs or probes at the site of disease in higher quantities relative to other l
会议
  Mixtures of common factor analyzers (MCFA), thought of as a parsimonious extension of mixture factor analyzers (MFA), have recently been developed as a nove
会议
  Polymer solar cells(PSCs)have attracted considerable attention due to their unique characteristics,such as low cost,light weight,and possible flexibility an
会议
  Organic/polymeric light-emitting materials have been widely applied in diverse areas,such as organic light-emitting diodes and organic fluorescent sensors.H
会议
  The wide use of satellite-based instruments provides measurements in climatology on a global scale, which often have nonstationary covariance structure. In
会议