Anomaly Detection Using Ml (1)
Anomaly Detection Using Ml (1)
NETWORK TRAFFIC
ANOMALY DETECTION
USING ML
Muhamad Nizam Azmi
Mataram University
WHY ANOMALY DETECTION ?
Anomaly Detection Overview
Challenges
High-dimensional data
Large-scale distributed systems
Vast amounts of data
advacements in Machine Learning
NEXT- PROJECT
Distinguishing between normal and anomalous
behaviors
Collective anomaly
3 TYPES OF
contextual anomaly ANOMALY
point anomaly
Collective anomaly
3 TYPES OF
contextual anomaly ANOMALY
Point anomaly
detection
focus project
Provide insights and guidelines for future applications in cybersecurity, industrial monitoring, and beyond
ETC.
SUMMARIZE
PREV-
RESEARCH
DATASET INFORMATION
Purpose: The KDD Cup 1999 dataset is used to build a predictive model to distinguish
between "bad" connections (intrusions or attacks) and "good" (normal) connections in
a computer network. It aims to protect the network from unauthorized users, including
potential insiders.
Background: The dataset is based on the 1998 DARPA Intrusion Detection Evaluation
Program, managed by MIT Lincoln Labs. The program's objective was to evaluate
research in intrusion detection using a standard set of data that includes various
intrusions simulated in a military network environment.
Data Collection.....
Environment: Simulated a typical U.S. Air Force LAN with multiple simulated attacks.
Duration: Data was collected over nine weeks (seven weeks for training, two weeks for testing).
Data Size:
Training data: 4 gigabytes of compressed binary TCP dump data, resulting in about five million connection
records.
Test data: Around two million connection records.
Connection Records: Each connection is a sequence of TCP packets between a source IP address and a target IP
address, labeled as either normal or a specific type of attack.
TYPES OF ATTACKS
DOS R2L
Denial of Service e.g., Syn flood Remote to Local e.g., guessing
passwords
U2R Probing
User to Root e.g., Buffer overflow e.g., port scanning
attacks
RESULT
0: Not logged in
1: Successfully logged in
High Performance:
Classes 0 and 1 have perfect AUC scores of
1.00, indicating excellent classification
performance with no false positives.
Class 2 also demonstrates very high
performance with an AUC of 0.99.
Moderate Performance:
Class 3 has an AUC of 0.97, showing strong
performance with minimal false positives.
Class 4 has a lower AUC of 0.82, indicating
room for improvement in distinguishing this
class from others.
ROC CURVE & FEATURE IMPORTANCES RESULT
10
THANK YOU FOR
YOUR ATTENTION