Research on internet traffic classification techniques using supervised machine learning

来源 :High Technology Letters | 被引量 : 0次 | 上传用户:yangtianmei
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Internet traffic classification is vital to the areas of network operation and management. Traditionalclassification methods such as port mapping and payload analysis are becoming increasingly difficult asnewly emerged applications (e.g. Peer-to-Peer) using dynamic port numbers, masquerading techniquesand encryption to avoid detection. This paper presents a machine learning (ML) based traffic classificationscheme, which offers solutions to a variety of network activities and provides a platform of performanceevaluation for the classifiers. The impact of dataset size, feature selection, number of applicationtypes and ML algorithm selection on classification performance is analyzed and demonstrated by the followingexperiments: (1) The genetic algorithm based feature selection can dramatically reduce the costwithout diminishing classification accuracy. (2) The chosen ML algorithms can achieve high classificationaccuracy. Particularly, REPTree and C4.5 outperform the other ML algorithms when computational complexityand accuracy are both taken into account. (3) Larger dataset and fewer application types wouldresult in better classification accuracy. Finally, early detection with only several initial packets is proposedfor real-time network activity and it is proved to be feasible according to the preliminary results. Traditional classification methods such as port mapping and payload analysis are becoming more difficult as newly emerged applications (eg Peer-to-Peer) using dynamic port numbers, masquerading techniques and encryption to avoid detection. This paper presents a machine learning (ML) based traffic classificationscheme, which offers solutions to a variety of network activities and provides a platform of performance evaluation for the classifiers. The impact of dataset size, feature selection, number of applicationtypes and ML algorithm selection on classification Performance, analyzed and demonstrated by the followingexperiments: (1) The genetic algorithm based feature selection can be able to reduce the costwithout diminishing classification accuracy. (2) The chosen ML algorithms can achieve high classificationaccuracy. Particularly, REPTree and C4.5 outperform the other ML algorithms wh en computational complexity and accuracy are both taken into account. (3) Larger dataset and fewer application types wouldresult in better classification accuracy. Finally, early detection with only several initial packets is proposed for real-time network activity and it is proved to be feasible according to the preliminary results.
其他文献
矿用电机车可控硅脉冲调速己在我国矿山得到迅速应用和推广,由于这种电机车节约电力,操纵方便,运行平稳,受到工人们的欢迎。1975年《无线电》杂志第一期刊登了北京矿务局王
电视连续剧《勇士之城》第12集中,负责守卫常德城的国军师长余鹏程说:“计划是委员长亲自批的,如果我们更改,那就是委员长错了。”师参谋长柴志新说:“师座,将在外,军命有所
直径43毫米一字形钎头,是7655型、YT—30型、01—30型凿岩机必用的工具。如何多快好省,低成本,高质量造出来,是矿山机械工人的一项重要任务。一九七○年以来,我们结合本工段
本文介绍了对引进的日本井关HD—2000DB型联合收割机的观察性试验及性能测定的结果。对该机的优缺点进行了初步评价。 This article describes the introduction of the Jap
一个三百年的老字号、一个驰名商标不仅是企业的财富,同时也是国家和民族的财富。前人留给我们的宝贵遗产不能毁在我们这代人手中。然而──老字号,因其精湛的工艺和悠久的历史
一、导料槽安装尺寸的改进运输带与导料槽,原设计安装为两橡胶板相互贴合,如图1所示。工作时橡胶带被导料槽橡胶板压住,以达到停运时料石不至于从两侧散落出来之目的。在实
4月25日,经过近8个月的审批、过渡交割,诺基亚手机并入微软的历史性进程终于画上了句号。(一)惆怅袭身人们还记得两年多前,时任诺基亚CEO的史蒂芬·埃洛普(stephen Elop)曾说
在毛主席革命路线指引下,几年来,我厂以阶级斗争为纲,认真贯彻精料方针,根据本地区铁矿资源分散又小的特点,挖掘潜力,自力更生先后建立起磁选和成球生产线,使高炉入炉料的球
摘 要:本文通过分析“一带一路”的内涵和意义,探讨“一带一路”背景下国际化外语人才的需求特点和培养途径,从而为我国的国际化外语人才的培养提供参考。  关键词:“一带一路”;国际化外语人才;培养途径  随着经济全球化的发展,国际化的外语人才在我国的经济市场中显现出巨大的作用,其不仅助力我国经济和国际经济体的有效对接,也让更多的外国人认识和了解古老的中国文化。特别是随着“一带一路”战略的开展,国家经济
离合词偏误的产生并非某一种或两种原因所致,它是多种因素交织在一起共同作用于学生汉语学习的结果。基于上述探讨的原因,我们应从教学的四大环节(总体设计、教材编写、课堂