Method of data cleaning for network traffic classification

来源 :The Journal of China Universities of Posts and Telecommunica | 被引量 : 0次 | 上传用户:hufeng274240003
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Network traffic classification aims at identifying the application types of network packets.It is important for Internet service providers(ISPs)to manage bandwidth resources and ensure the quality of service for different network applications.However,most classification techniques using machine learning only focus on high flow accuracy and ignore byte accuracy.The classifier would obtain low classification performance for elephant flows as the imbalance between elephant flows and mice flows on Internet.The elephant flows,however,consume much more bandwidth than mice flows.When the classifier is deployed for traffic policing,the network management system cannot penalize elephant flows and avoid network congestion effectively.This article explores the factors related to low byte accuracy,and secondly,it presents a new traffic classification method to improve byte accuracy at the aid of data cleaning.Experiments are carried out on three groups of real-world traffic datasets,and the method is compared with existing work on the performance of improving byte accuracy.Experiment shows that byte accuracy increased by about 22.31%on average.The method outperforms the existing one in most cases. Network traffic classification aims at identifying the application types of network packets. It is important for Internet service providers (ISPs) to manage bandwidth resources and ensure the quality of service for different network applications. However, most classification techniques using machine learning only focus on high flow accuracy and ignore byte accuracy. classifier would obtain low classification performance for elephant flows as the imbalance between elephant flows and mice flows on Internet. elephant flows, however, consuming much more bandwidth than mice flows.When the classifier is deployed for traffic policing, the network management system can not penalize elephant flows and avoid network congestion effectively. This article explores the factors related to low byte accuracy, and secondly, it presents a new traffic classification method to improve byte accuracy at the aid of data cleaning. Experiments are carried out on three groups of real-world traffic datasets, and the method is compared with existing work on the performance of improving byte accuracy. Experiment shows that byte accuracy increased by about 22.31% on average. method outperforms the existing one in most cases.
其他文献
摘要:本文从项目可持续发展角度,根据某市农村现实状况,提出一个简约化信息系统解决方案,协助农户开发农家乐市场,改善营销手段,拓展客源,吸引、留住客户,提升经营管理能力,促使其进入良性发展轨道。  关键词:小型信息化项目;外包及租用服务  近年来,农家乐作为绿色经济项目发展迅速,为促进农村社会经济发展、改善乡村面貌和环境起了良好作用,受到国家政策扶持与政府部门鼓励。如何有效地帮助农户在项目上健康、顺