0% found this document useful (0 votes)

120 views14 pages

Dos and Don'ts of ML in Computer Security - Sec22 - Slides

Uploaded by

qoriah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

120 views14 pages

Dos and Don'ts of ML in Computer Security - Sec22 - Slides

Uploaded by

qoriah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Dos and Don’ts of Machine Learning

in Computer Security
Daniel Arp, Erwin Quiring, Feargus Pendlebury, Alexander Warnecke,
Fabio Pierazzi, Christian Wressnegger, Lorenzo Cavallaro, Konrad Rieck
USENIX Security 2022
Machine Learning already solved
many problems in computer security

2
Machine Learning already
Unfortunately not… !solved
many problems in computer security

2
Motivation—Historical Examples

Network intrusion detection: The base rate fallacy

• Intrusion detectors should have low false positive rates (FPR)
• ‘Low’ FPR often still corresponds to large number of false positives

Android malware detection: Spatio-temporal bias inflating performance

• Models trained with access to ‘future’ information
• Unrealistic class balance inflates performance

Axelsson. The base-rate fallacy and the difficulty of intrusion detection. ACM TISSEC, 2000.
Pendlebury et al. TESSERACT: Eliminating Experimental Bias in Malware Classification across Space and Time. USENIX Security, 2019.
4
Overview

1. Identification of common pitfalls

• 10 subtle issues affecting ML for security
• Recommendations for avoiding them

2. Survey on the prevalence of pitfalls

• Review of 30 top papers in security
• Pitfalls are widespread
2011 2012 2013 2014 2015 2016 2017 2018 2019 2020

3. Case studies demonstrating impact of pitfalls

• Mobile malware detection
• Vulnerability discovery
• Source code authorship attribution
• Network intrusion detection

Important remark
This work should not be interpreted as a finger-pointing exercise. Any work mentioned as having pitfalls still has important
contributions and we identify pitfalls in our own work also.
8
ML Pipeline and Pitfalls

Performance
System Design and Evaluation
Learning
Data Collection and Deployment and
Labeling • P6 Inappropriate baselines Operation
• P3 Data snooping
• P7 Inappropriate measures
• P1 Sampling bias • P4 Spurious correlations
• P8 Base rate fallacy • P9 Lab-only evaluation
• P2 Label Inaccuracy • P5 Biased parameters
• P10 Inappropriate threat
model

19
Prevalence Study

1. Paper Selection 2. Review Process 3. Authors Feedback

Pitfall is either…

present (but discussed)

partly present (but discussed)
not present
unclear from text

2011 2012 2013 2014 2015 2016 2017 2018 2019 2020

22
Prevalence Study

Present Partly present Discussed

0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100%

Sampling Bias 18 24 3 6 3

Label Inaccuracy 3 5 3 7 6

Data Snooping 17 5

Spurious Correlations 6 1

Biased Parameters 3 2

Inappropriate Baseline 6 2

Inappropriate Measures 10 6

Base Rate Fallacy 3 5 6 7 3

Lab-Only Evaluation 14 17 2 3 2

Inappropriate Threat Model 5 1 16 14 4

10 % 20 % 30 % 40 % 50 % 60 % 70 % 80 % 90 % 100 %

0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100%

22
Prevalence Study

Present Partly present Discussed

0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100%

Sampling Bias 18 24 3 6 3

Label Inaccuracy 3 5 3 7 6

Data Snooping 17 5

Spurious Correlations 6 1

Biased Parameters 3 2 PItfalls are prevalent even in top research!

Inappropriate Baseline 6 2

Inappropriate Measures 10 6

Base Rate Fallacy 3 5 6 7 3

Lab-Only Evaluation 14 17 2 3 2

Inappropriate Threat Model 5 1 16 14 4

10 % 20 % 30 % 40 % 50 % 60 % 70 % 80 % 90 % 100 %

0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100%

22
Impact Analysis

Android Malware Detection

P1: Sampling Bias

P4: Spurious Correlations
Network Intrusion Detection
P7: Inappropriate Performance Measures
P6: Inappropriate baselines
P9: Lab-only evaluation
Authorship Attribution Vulnerability Discovery

P1: Sampling Bias P2: Label Inaccuracy

P4: Spurious Correlations P4: Spurious Correlations
P6: Inappropriate Baselines

23
Impact Analysis

Android Malware Detection

P1: Sampling Bias

P4: Spurious Correlations
Network Intrusion Detection
P7: Inappropriate Performance Measures
P6: Inappropriate baselines
P9: Lab-only evaluation
Authorship Attribution Vulnerability Discovery

P1: Sampling Bias P2: Label Inaccuracy

P4: Spurious Correlations P4: Spurious Correlations
P6: Inappropriate Baselines

23
Impact Study: Mobile Malware Detection P1: Sampling Bias
P4: Spurious Correlations
P7: Inappropriate Performance Measures

What is the problem?

• Merging of data from different sources leads to sampling bias
• Different origins of malware and benign apps can introduce unwanted shortcuts

GooglePlay Store Chinese Markets Other Origins

1,00
Sampling Probability

≈80% ≈70%
0,75

0,50

0,25

0,00
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25

Number of AV detections

Allix et al. AndroZoo: collecting millions of Android apps for the research community. ACM MSR, 2016.
Arp et al. DREBIN: Effective and Explainable Detection of Android Malware in Your Pocket. NDSS, 2014.
24
Impact Study: Mobile Malware Detection P1: Sampling Bias
P4: Spurious Correlations
P7: Inappropriate Performance Measures

What is the impact?

• Comparison on datasets with (D1) and without (D2) the artifact
• Training of SVM on two different feature sets

1 -11%
-17%
True positive rate
0,75

0,5
With artifact (D1)
0,96 0,88
0,85 Without artifact (D2)
0,73
0,25

0
Drebin Opseqs

Results
• Experimental results show how sampling bias affects results (P1)
• The URL „play.google.com" is among top features in D1 (P4)
• Using Accuracy would have underestimated the presence of bias (P7)

Allix et al. AndroZoo: collecting millions of Android apps for the research community. ACM MSR, 2016.
Arp et al. DREBIN: Effective and Explainable Detection of Android Malware in Your Pocket. NDSS, 2014.
25
Dos and Don’ts of Machine Learning in Computer Security
• We identify 10 subtle pitfalls affecting the field
• Find that they are prevalent throughout top research
• Demonstrate their impact through case studies

Updates on pitfalls and recommendations:

• https://dodo-mlsec.org/ "

Framing A Machine Learning Problem
No ratings yet
Framing A Machine Learning Problem
32 pages
Performance Improvement - 2021 - Shestakova - Best Practices To Mitigate Bias and Discrimination in Artificial Intelligence
No ratings yet
Performance Improvement - 2021 - Shestakova - Best Practices To Mitigate Bias and Discrimination in Artificial Intelligence
6 pages
Revit MEP 2016 - Keyboard Shortcuts
No ratings yet
Revit MEP 2016 - Keyboard Shortcuts
1 page
ASCE Standard 20-96
100% (1)
ASCE Standard 20-96
3 pages
Deep Excavation KLCC
100% (1)
Deep Excavation KLCC
20 pages
Dos and Don'ts of Machine Learning in Computer Security
No ratings yet
Dos and Don'ts of Machine Learning in Computer Security
18 pages
Do's and Dont's of ML in Computer Security
No ratings yet
Do's and Dont's of ML in Computer Security
19 pages
Dos and Don't
No ratings yet
Dos and Don't
34 pages
Biased Data and Data Drifts
No ratings yet
Biased Data and Data Drifts
3 pages
Towards Machine Learning Guided by Best Practices
No ratings yet
Towards Machine Learning Guided by Best Practices
5 pages
Learning
No ratings yet
Learning
18 pages
Unit1-5notes
No ratings yet
Unit1-5notes
121 pages
How AI and Machine Learning Improve Enterprise Cybersecurity
No ratings yet
How AI and Machine Learning Improve Enterprise Cybersecurity
4 pages
Aws Responsible Use of Machine Learning Guide
No ratings yet
Aws Responsible Use of Machine Learning Guide
9 pages
Summary Biases
No ratings yet
Summary Biases
2 pages
AI Ethics 1 - Workshop Resource Guide - Co-Branded 2023 - 2024
No ratings yet
AI Ethics 1 - Workshop Resource Guide - Co-Branded 2023 - 2024
11 pages
Exploring Bias in Machine Learning Algorithms and Its Impact On Decision
No ratings yet
Exploring Bias in Machine Learning Algorithms and Its Impact On Decision
5 pages
Ethics and Bias in Machine Learning
No ratings yet
Ethics and Bias in Machine Learning
6 pages
On Bridging The Semantic Gap Between Machine
No ratings yet
On Bridging The Semantic Gap Between Machine
14 pages
How To Avoid Machine Learning Pitfalls
No ratings yet
How To Avoid Machine Learning Pitfalls
33 pages
A New Malware Detection Model Using
No ratings yet
A New Malware Detection Model Using
9 pages
Corner Cases in Machine Learning Processes: Research Open Access
No ratings yet
Corner Cases in Machine Learning Processes: Research Open Access
17 pages
EAA Module4
No ratings yet
EAA Module4
9 pages
Subtitle
No ratings yet
Subtitle
2 pages
ManagingBiasInAI CAMERAREADY
No ratings yet
ManagingBiasInAI CAMERAREADY
12 pages
Bias Evaluation: Support Pool of Experts Programme
No ratings yet
Bias Evaluation: Support Pool of Experts Programme
34 pages
AI Bias
No ratings yet
AI Bias
20 pages
Assignment # 01 (ML)
No ratings yet
Assignment # 01 (ML)
4 pages
Nooscope PDF
No ratings yet
Nooscope PDF
1 page
Uncertainty in Modeling
No ratings yet
Uncertainty in Modeling
25 pages
Machine Learning Methods For Malware Detection 1611630481
No ratings yet
Machine Learning Methods For Malware Detection 1611630481
18 pages
Subtitle
No ratings yet
Subtitle
3 pages
Grade 10 Unit 7 - Evaluation
No ratings yet
Grade 10 Unit 7 - Evaluation
50 pages
C1 W2
No ratings yet
C1 W2
60 pages
Kaspersky Lab Whitepaper Machine Learning
No ratings yet
Kaspersky Lab Whitepaper Machine Learning
17 pages
WINSEM2024-25 CBS3006 ETH VL2024250505168 2025-01-11 Reference-Material-I
No ratings yet
WINSEM2024-25 CBS3006 ETH VL2024250505168 2025-01-11 Reference-Material-I
81 pages
ML 01
No ratings yet
ML 01
24 pages
Ob Research Work
No ratings yet
Ob Research Work
13 pages
Data Leakage in Machine Learning
No ratings yet
Data Leakage in Machine Learning
28 pages
01 - Graziella
No ratings yet
01 - Graziella
6 pages
Kaspersky Lab Whitepaper Machine Learning
No ratings yet
Kaspersky Lab Whitepaper Machine Learning
17 pages
Jade Abbott - Mls Hidden Tasks
No ratings yet
Jade Abbott - Mls Hidden Tasks
78 pages
CM2203 Portfolio 3 Report
No ratings yet
CM2203 Portfolio 3 Report
8 pages
Unit 1 Introduction
No ratings yet
Unit 1 Introduction
40 pages
Cb3491 Cryptography and Cyber Security Base Rate Fallacy Insights
No ratings yet
Cb3491 Cryptography and Cyber Security Base Rate Fallacy Insights
4 pages
SMIS GROUPWORK Regulation & Governance
No ratings yet
SMIS GROUPWORK Regulation & Governance
16 pages
Position: Ai Competitions Provide The Gold Standard For Empirical Rigor in Genai Evaluation
No ratings yet
Position: Ai Competitions Provide The Gold Standard For Empirical Rigor in Genai Evaluation
12 pages
Basic Concepts of Machine Learning for Beginners
No ratings yet
Basic Concepts of Machine Learning for Beginners
102 pages
2 1 TXT Bias Variance
No ratings yet
2 1 TXT Bias Variance
4 pages
03 01 Lessonarticle
No ratings yet
03 01 Lessonarticle
5 pages
Questionable Practices in Machine Learning 1722288826
No ratings yet
Questionable Practices in Machine Learning 1722288826
48 pages
Machine Learning Yearning
No ratings yet
Machine Learning Yearning
40 pages
Unsolved Problems in ML Safety: Dan Hendrycks Nicholas Carlini John Schulman Jacob Steinhardt
No ratings yet
Unsolved Problems in ML Safety: Dan Hendrycks Nicholas Carlini John Schulman Jacob Steinhardt
28 pages
Type of Bias
No ratings yet
Type of Bias
7 pages
Fairness and Bias in Artificial Intelligence - A Brief Survey of Sources, Impacts, and Mitigation Strategies
No ratings yet
Fairness and Bias in Artificial Intelligence - A Brief Survey of Sources, Impacts, and Mitigation Strategies
16 pages
Problem Formulation Exercise Solutions
No ratings yet
Problem Formulation Exercise Solutions
3 pages
Removing Bias From Machine Learning Models
No ratings yet
Removing Bias From Machine Learning Models
16 pages
1006 Ai Evaluation
No ratings yet
1006 Ai Evaluation
4 pages
Unit 6. Ethical Issues in Data Science PDF
No ratings yet
Unit 6. Ethical Issues in Data Science PDF
19 pages
How To Avoid Machine Learning Pitfalls
No ratings yet
How To Avoid Machine Learning Pitfalls
25 pages
T Respaifb I 4 l3 en File 19.en
No ratings yet
T Respaifb I 4 l3 en File 19.en
145 pages
Machine Learning Models
No ratings yet
Machine Learning Models
54 pages
Web Penetration Testing
From Everand
Web Penetration Testing
Nathan Beckford
No ratings yet
1980 - Stable Adaptive Controller Design, Part II - Proof of Stability (Narendra)
No ratings yet
1980 - Stable Adaptive Controller Design, Part II - Proof of Stability (Narendra)
9 pages
Detailed LP in Law of Inertia
No ratings yet
Detailed LP in Law of Inertia
13 pages
Lesson - Plan - in - Constructing Perspective Drawing Q4 Week 5 and 6
No ratings yet
Lesson - Plan - in - Constructing Perspective Drawing Q4 Week 5 and 6
3 pages
Bird Species Identification Using Deep Fuzzy Neural Network
No ratings yet
Bird Species Identification Using Deep Fuzzy Neural Network
8 pages
Mathematics - Mathematics Form 2 - Zeraki Achievers 3.0 - Marking Scheme
No ratings yet
Mathematics - Mathematics Form 2 - Zeraki Achievers 3.0 - Marking Scheme
15 pages
Biostat To Answer
No ratings yet
Biostat To Answer
3 pages
Precast Lintels Beams
No ratings yet
Precast Lintels Beams
4 pages
Fire Detection and Alarm System Basics
100% (9)
Fire Detection and Alarm System Basics
115 pages
Asabe1700665 Post and Pier Foundation Design Aid-Final
No ratings yet
Asabe1700665 Post and Pier Foundation Design Aid-Final
22 pages
LEMI-120 User Manual SN 0347 - 0351 - 0352 - 0353 - 0356 - 0357 - 0359 - 0373 - 0380 - 0382
No ratings yet
LEMI-120 User Manual SN 0347 - 0351 - 0352 - 0353 - 0356 - 0357 - 0359 - 0373 - 0380 - 0382
14 pages
Predictive Coding in The Visual Cortex: A Functional Interpretation of Some Extra-Classical Receptive-Field Effects
No ratings yet
Predictive Coding in The Visual Cortex: A Functional Interpretation of Some Extra-Classical Receptive-Field Effects
9 pages
Bbs 4th Year Report
100% (1)
Bbs 4th Year Report
48 pages
Discrete Random Variables and Probability Distributions
No ratings yet
Discrete Random Variables and Probability Distributions
19 pages
Custom Python Scripts For AutoCAD Plant 3D Part 5 - AutoCAD DevBlog
No ratings yet
Custom Python Scripts For AutoCAD Plant 3D Part 5 - AutoCAD DevBlog
9 pages
LH124E-4NES-14 R404A t0 - 21 Tamb 32
No ratings yet
LH124E-4NES-14 R404A t0 - 21 Tamb 32
4 pages
Comparing Decimals Lesson Plan
No ratings yet
Comparing Decimals Lesson Plan
3 pages
TransCAD 18 WorkingWithMatrices
No ratings yet
TransCAD 18 WorkingWithMatrices
33 pages
Photography MCQ 2
No ratings yet
Photography MCQ 2
16 pages
SX020a-En-EU-Example - Tying and The Avoidance of Disproportionate Collapse
No ratings yet
SX020a-En-EU-Example - Tying and The Avoidance of Disproportionate Collapse
6 pages
Endogenic Processes
No ratings yet
Endogenic Processes
36 pages
Foundation Mock Test Studying VIII 16 10 21
No ratings yet
Foundation Mock Test Studying VIII 16 10 21
18 pages
2022 CAT - P1 - Gr12 - Aug - Prep-Unlocked
No ratings yet
2022 CAT - P1 - Gr12 - Aug - Prep-Unlocked
20 pages
Dokumen - Tips - Lecture 5 Cutcell Meshing DL Release Lecture 5 Cutcell Meshing Introduction
No ratings yet
Dokumen - Tips - Lecture 5 Cutcell Meshing DL Release Lecture 5 Cutcell Meshing Introduction
41 pages
BANK Questions
No ratings yet
BANK Questions
33 pages
Sensors and Instruments for Brix Measurement A Review
No ratings yet
Sensors and Instruments for Brix Measurement A Review
20 pages
Statistics Cheatsheet
No ratings yet
Statistics Cheatsheet
2 pages
Xge Mac Spec
No ratings yet
Xge Mac Spec
29 pages