Traffic anomaly detection and cause identification using flow-level measurements
نویسنده
چکیده
Measurement and analysis of traffic in IP networks are of great interest for network operators as they provide important information about the utilization of network resources, the user behavior, as well as the deployed applications and services. In particular, flow-level traffic measurements have become more and more ubiquitous in the past years. In this context, a flow is defined as a stream of packets which are observed at a given interface in the network and which share a set of common properties called flow keys. For each flow, a flow record is generated containing the flow keys as well as additional flow attributes and statistical information, such as the observation time of the flow, the number of bytes and packets etc. This dissertation deals with the detection of traffic anomalies and the identification of their causes using flow-level measurement data. Traffic anomalies are significant deviations from the pattern of usual network traffic. Possible reasons for traffic anomalies are changes in the network topology (e.g., newly connected hosts, routing changes) or network usage (e.g., changed customer behavior, new applications). Anomalies may also be caused by failures of network devices as well as by malicious worm or attack traffic. The early detection of such events is of particular interest as they may impair the safe and reliable operation of the network. For the detection of traffic anomalies, we convert the flow records into time series of eight different traffic metrics describing the traffic volume as well as the cardinality of certain flow keys. We investigate various statistical change detection methods which have been originally conceived for quality control in manufacturing processes. In particular, we use control charts to detect shifts in the mean, standard deviation, or correlation of traffic measurement variables. As most traffic measurement variables exhibit nonstationarity and serial correlation, residual generation methods need to be applied in order to reduce the effect of systematic changes, such as seasonal variation. For anomaly detection in a single traffic metric, we examine different time-series analysis methods with special focus on robust forecasting techniques. For multi-metric anomaly detection, we study the application of principal component analysis (PCA) which allows modeling the correlation structure between different measurement variables. The investigated change detection and residual generation methods are evaluated and compared based on flow data which was collected in the network of an Internet service provider (ISP). The detected anomalies are clas-
منابع مشابه
Traffic Scene Analysis using Hierarchical Sparse Topical Coding
Analyzing motion patterns in traffic videos can be exploited directly to generate high-level descriptions of the video contents. Such descriptions may further be employed in different traffic applications such as traffic phase detection and abnormal event detection. One of the most recent and successful unsupervised methods for complex traffic scene analysis is based on topic models. In this pa...
متن کاملBehavioral Analysis of Traffic Flow for an Effective Network Traffic Identification
Fast and accurate network traffic identification is becoming essential for network management, high quality of service control and early detection of network traffic abnormalities. Techniques based on statistical features of packet flows have recently become popular for network classification due to the limitations of traditional port and payload based methods. In this paper, we propose a metho...
متن کاملFeature Extraction to Identify Network Traffic with Considering Packet Loss Effects
There are huge petitions of network traffic coming from various applications on Internet. In dealing with this volume of network traffic, network management plays a crucial rule. Traffic classification is a basic technique which is used by Internet service providers (ISP) to manage network resources and to guarantee Internet security. In addition, growing bandwidth usage, at one hand, and limit...
متن کاملMethod of Video-Measurements of Traffic Flow Characteristics at a Road Junction
In the theory of traffic flows the main characteristics are: intensity, speed, and density. They make it possible to use hydrodynamic models. In connection with the development of modern highways and road networks, traffic flows behavior is becoming more and more complex and diverse. In particular, the B.Kerner studies have shown that the laminar solution of hydrodynamic models is poorly corre...
متن کاملAnomaly Extraction Using Efficient-Web Miner Algorithm
Today network security, uptime and performance of network are important and serious issues in computer network. Anomaly is deviation from normal behaviour affecting network security. Anomaly Extraction is identification of unusual flow from network, which is need of network operator. Anomaly extraction aims to automatically find the inconsistencies in large set of data observed during an anomal...
متن کامل