Robin: A Novel Method to Produce Robust Interpreters for Deep Learning-Based Code Classifiers

Li, Zhen; Zhang, Ruqian; Zou, Deqing; Wang, Ning; Li, Yating; Xu, Shouhuai; Chen, Chen; Jin, Hai

Computer Science > Software Engineering

arXiv:2309.10644 (cs)

[Submitted on 19 Sep 2023]

Title:Robin: A Novel Method to Produce Robust Interpreters for Deep Learning-Based Code Classifiers

Authors:Zhen Li, Ruqian Zhang, Deqing Zou, Ning Wang, Yating Li, Shouhuai Xu, Chen Chen, Hai Jin

View PDF

Abstract:Deep learning has been widely used in source code classification tasks, such as code classification according to their functionalities, code authorship attribution, and vulnerability detection. Unfortunately, the black-box nature of deep learning makes it hard to interpret and understand why a classifier (i.e., classification model) makes a particular prediction on a given example. This lack of interpretability (or explainability) might have hindered their adoption by practitioners because it is not clear when they should or should not trust a classifier's prediction. The lack of interpretability has motivated a number of studies in recent years. However, existing methods are neither robust nor able to cope with out-of-distribution examples. In this paper, we propose a novel method to produce \underline{Rob}ust \underline{in}terpreters for a given deep learning-based code classifier; the method is dubbed Robin. The key idea behind Robin is a novel hybrid structure combining an interpreter and two approximators, while leveraging the ideas of adversarial training and data augmentation. Experimental results show that on average the interpreter produced by Robin achieves a 6.11\% higher fidelity (evaluated on the classifier), 67.22\% higher fidelity (evaluated on the approximator), and 15.87x higher robustness than that of the three existing interpreters we evaluated. Moreover, the interpreter is 47.31\% less affected by out-of-distribution examples than that of LEMNA.

Comments:	To be published in the 38th IEEE/ACM International Conference on Automated Software Engineering (ASE 2023)
Subjects:	Software Engineering (cs.SE)
Cite as:	arXiv:2309.10644 [cs.SE]
	(or arXiv:2309.10644v1 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2309.10644

Submission history

From: Ruqian Zhang [view email]
[v1] Tue, 19 Sep 2023 14:27:59 UTC (534 KB)

Computer Science > Software Engineering

Title:Robin: A Novel Method to Produce Robust Interpreters for Deep Learning-Based Code Classifiers

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:Robin: A Novel Method to Produce Robust Interpreters for Deep Learning-Based Code Classifiers

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators