现实世界中的主题突发与其间断演化发现(英文)

来源 :中国通信 | 被引量 : 0次 | 上传用户:dashao1
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Nowadays, a considerably large number of documents are available over many online news sites (e.g., CNN and NYT). Therefore, the utilization of these online documents, for example, the discovery of a burst topic and its evolution, is a significant challenge. In this paper, a novel topic model, called intermittent Evolution LDA (iELDA) is proposed. In iELDA, the time-evolving documents are divided into many small epochs. iELDA utilizes the detected global topics as priors to guide the detection of an emerging topic and keep track of its evolution over different epochs. As a natural extension of the traditional Latent Dirichlet Allocation (LDA) and Dynamic Topic Model (DTM), iELDA has an advantage: it can discover the intermittent recurring pattern of a burst topic. We apply iELDA to real-world data from NYT; the results demonstrate that the proposed iELDA can appropriately capture a burst topic and track its intermittent evolution as well as produce a better predictive ability than other related topic models. Nowadays, a substantial large number of documents are available over many online news sites (eg, CNN and NYT). Thus, the utilization of these online documents, for example, the discovery of a burst topic and its evolution, is a significant challenge. In iDADA, the time-evolving documents are divided into many small epochs. IELDA utilizes the detected global topics as priors to guide the detection of an emerging topic and keep track of its evolution over different epochs. As a natural extension of the traditional Latent Dirichlet Allocation (LDA) and Dynamic Topic Model (DTM), iELDA has an advantage: it can discover the intermittent recurring pattern of a burst topic. We apply iELDA to real-world data from NYT; the results demonstrate that the proposed iELDA capablely capture a burst topic and track its intermittent evolution as well as produce a better predictive ability than othe r related topic models.
其他文献
目的 探讨分析高血压脑出血术后并发肺部感染患者的中医护理对策.方法 选择到我院进行治疗的120例高血压脑出血患者作为研究对象,随机将其分为观察组和对照组,每组60例,对照
正益矿针对井下供电系统越级跳闸现象,基于GOOSE通讯系统对越级跳闸系统的硬件以及软件进行了设计,实践证明,优化后的越级跳闸系统下,保护设备对矿井的跳闸故障起到良好的保
期刊
目的 探究分析急诊科实施5S管理过程中应用分组管理的效果.方法 本院急诊科于2018年1月~2018年12月共收治148例急诊科患者,按照患者的入院时间分两组,对照组与观察组各有74例
经济社会的不断发展和进步,带来了建筑行业的繁荣.本文在对建筑行业整体概况进行分析的基础上,逐条列举建筑在施工过程中不合理之处,着重阐述建筑施工中的给水设计环节存在的
目的 探讨希望与失望学说理论对ICU机械通气患者脱机成功率、心理状态的影响.方法 选取本院2018年2月~2019年6月62例ICU机械通气患者作为研究对象,随机分为观察组(n=31)和对照
期刊
目的 分析宫外孕保守治疗患者的心理特点,并评估护理干预的实施效果.方法 应用SCL-90量表评估在我院接受保守治疗的60例宫外孕患者的心理状态,根据评估结果总结患者心理特征,
期刊
期刊