Finding Deceptive Opinion Spam by Correcting the Mislabeled Instances

来源 :Chinese Journal of Electronics | 被引量 : 0次 | 上传用户:AceAcer
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Assessing the trustworthiness of reviews is a key in natural language processing and computational linguistics. Previous work mainly focuses on some heuristic strategies or simple supervised learning methods, which limit the performance of this task. This paper presents a new approach, from the viewpoint of correcting the mislabeled instances, to find deceptive opinion spam. Partition a dataset into several subsets, construct a classifier set for each subset and select the best one to evaluate the whole dataset. Error variables are defined to compute the probability that the instances have been mislabeled. The mislabeled instances are corrected based on two threshold schemes, ma jority and non-objection. The results display significant improvements in our method in contrast to the existing baselines. Assessing the trustworthiness of reviews is a key in natural language processing and computational linguistics. Previous work mainly focuses on some heuristic strategies or simple supervised learning methods, which limit the performance of this task. This paper presents a new approach, from the viewpoint of correcting the mislabeled instances, to find deceptive opinion spam. Partition a dataset into several subsets, construct a classifier set for each subset and select the best one to evaluate the whole dataset. Error variables are defined to compute the probability that the instances have been mislabeled. The mislabeled instances are based on two threshold schemes, majority and non-objection. The results display significant improvements in our method in contrast to the existing baselines.
其他文献
期刊
The appearance based facial tracking methods, such as active appearance models and candide models, are widely used in intelligent user interface and facial expression recognition. This paper proposes
高血压是中老年人的常见病、多发病.中医中药是根据人体脏腑气血阴阳平衡原理,望、闻、问、切四诊和参,辨证施治,结合患者外在表现,找出内在病因,给予中医汤剂或中成药口服.r
期刊
该文从挂篮荷载计算、施工流程、支座及临时固结施工、挂篮安装及试验、合拢段施工、模板制作安装、钢筋安装、混凝土的浇筑及养生、测量监控等方面人手,介绍了S226海滨大桥
期刊
1范围rn本国际标准规定工业X射线和γ射线照相用胶片法检测缺陷的通则,可适用于金属制品和材料.rn
该文从挂篮荷载计算、施工流程、支座及临时固结施工、挂篮安装及试验、合拢段施工、模板制作安装、钢筋安装、混凝土的浇筑及养生、测量监控等方面人手,介绍了S226海滨大桥
期刊
期刊
期刊