论文部分内容阅读
基于内容的垃圾邮件过滤方法是垃圾邮件过滤方法的一个重要分支,由于其高准确率,朴素贝叶斯算法更在基于内容的过滤方法中占了一席之地。本文介绍了贝叶斯算法的基本原理及其在邮件过滤中的应用,并写出了其监督训练过程和邮件过滤具体过程,做出了全部过程的进程图。提出了笔者自己的一点想法,建立用户个人邮件训练集可能会更一步增加垃圾邮件过滤的正确度与召回率。
Content-based spam filtering is an important branch of the spam filtering approach, and because of its high accuracy, Naïve Bayes algorithms make a big difference in content-based filtering. This paper introduces the basic principle of Bayesian algorithm and its application in mail filtering, and writes out its supervision of the training process and mail filtering specific process, made the whole process of the process map. I put forward my own idea that the establishment of user personal e-mail training set may be further steps to increase the accuracy of spam filtering and recall rate.