research-article

DENAS: automated rule generation by knowledge extraction from neural networks

Authors:

Wei YangAuthors Info & Claims

ESEC/FSE 2020: Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering

Pages 813 - 825

https://doi.org/10.1145/3368089.3409733

Published: 08 November 2020 Publication History

Get Access

Abstract

Deep neural networks (DNNs) have been widely applied in the software development process to automatically learn patterns from massive data. However, many applications still make decisions based on rules that are manually crafted and verified by domain experts due to safety or security concerns. In this paper, we aim to close the gap between DNNs and rule-based systems by automating the rule generation process via extracting knowledge from well-trained DNNs. Existing techniques with similar purposes either rely on specific DNNs input instances or use inherently unstable random sampling of the input space. Therefore, these approaches either limit the exploration area to a local decision-space of the DNNs or fail to converge to a consistent set of rules. The resulting rules thus lack representativeness and stability.

In this paper, we address the two aforementioned shortcomings by discovering a global property of the DNNs and use it to remodel the DNNs decision-boundary. We name this property as the activation probability, and show that this property is stable. With this insight, we propose an approach named DENAS including a novel rule-generation algorithm. Our proposed algorithm approximates the non-linear decision boundary of DNNs by iteratively superimposing a linearized optimization function.

We evaluate the representativeness, stability, and accuracy of DENAS against five state-of-the-art techniques (LEMNA, Gradient, IG, DeepTaylor, and DTExtract) on three software engineering and security applications: Binary analysis, PDF malware detection, and Android malware detection. Our results show that DENAS can generate more representative rules consistently in a more stable manner over other approaches. We further offer case studies that demonstrate the applications of DENAS such as debugging faults in the DNNs and generating signatures that can detect zero-day malware.

Supplementary Material

Auxiliary Teaser Video (fse20main-p454-p-teaser.mp4)

This is a presentation video of my talk at ESEC/FSE 2020 on our paper accepted in the research track. In this paper, we propose DENAS, a tool can generate rules through extracting knowledge from a well-trained deep neural networks.

Download
4.14 MB

Auxiliary Presentation Video (fse20main-p454-p-video.mp4)

Download
66.42 MB

References

[1]

O cial Report of the Special Committee to Review the Federal Aviation Administrations Aircraft Certi cation Process Executive Summary.https://www. transportation.gov/sites/dot.gov/ les/2020-01/executive-summary.pdf, 2020.

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

Detect and Remove Watermark in Deep Neural Networks via Generative Adversarial Networks

An investigation of a deep learning based malware detection system

Symmetric Power Activation Functions for Deep Neural Networks

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations