Comparison of Supervised and Unsupervised Learning for Detecting Anomalies in Network Traffic
- Robert McAndrew, Statistics, Colorado State University, Fort Collins, Colorado, United States
- Stephen Hayne, Colorado State University, Fort Collins, Colorado, United States
- Haonan Wang, Statistics, Colorado State University, Fort Collins, Colorado, United States
AbstractAdversaries are always probing for vulnerable spots on the Internet so they can attack their target. By examining traffic at the firewall, we can look for anomalies that may represent these probes. To help select the right techniques we conduct comparisons of supervised and unsupervised machine learning on network flows to find sets of outliers flagged as potential threats. We apply Functional PCA and K-Means together versus Multilayer Perceptron on a real-world dataset of traffic prior to an NTP DDoS attack in January 2014; scanning activity was heightened during this pre-attack period. We partition data to evaluate detection powers of each technique and show that FPCA+Kmeans outperforms MLP. We also present a new variation of the circle plot for visualization of resulting outliers which we suggest excels at displaying multidimensional attributes of an individual IP's behavior over time. In small multiples, circle plots show a gestalt overview of traffic.
Return to previous page