More Web Proxy on the site http://driver.im/

research-article

TargetFuzz: Using DARTs to Guide Directed Greybox Fuzzers

Authors:

Sadullah Canakci,

Nikolay Matyunin,

Manuel EgeleAuthors Info & Claims

ASIA CCS '22: Proceedings of the 2022 ACM on Asia Conference on Computer and Communications Security

Pages 561 - 573

https://doi.org/10.1145/3488932.3501276

Published: 30 May 2022 Publication History

Abstract

Software development is a continuous and incremental process. Developers continuously improve their software in small batches rather than in one large batch. The high frequency of small batches makes it essential to use effective testing methods that detect bugs under limited testing time. To this end, researchers propose directed greybox fuzzing (DGF) which aims to generate test cases towards stressing certain target sites. Different from the coverage-based greybox fuzzing (CGF) which aims to maximize code coverage in the whole program, the goal of DGF is to cover potentially buggy code regions (e.g., a recently modified program region). While prior works improve several aspects of DGF (such as power scheduling, input prioritization, and target selection), little attention has been given to improving the seed selection process. Existing DGF tools use seed corpora mainly tailored for CGF (i.e., a set of seeds that cover different regions of the program). We observe that using CGF-based corpora limits the bug-finding capability of a directed greybox fuzzer. To mitigate this shortcoming, we propose TargetFuzz, a mechanism that provides a DGF tool with a target-oriented seed corpus. We refer to this corpus as DART corpus, which contains only 'close' seeds to the targets. This way, DART corpus guides DGF to the targets, thereby exposing bugs even under limited fuzzing time. Evaluations on 34 real bugs show that AFLGo (a state-of-the-art directed greybox fuzzer), when equipped with DART corpus, finds 10 additional bugs and achieves $4.03x speedup, on average, in the time-to-exposure compared to a generic CGF-based corpus.

References

[1]

Humberto Abdelnur, Obes Jorge Lucangeli, and Olivier Festor. 2010. Spectral Fuzzing: Evaluation & Feedback. Ph.D. Dissertation. INRIA.

[2]

aflgo. 2017. AFLGO: Directed Greybox Fuzzing. https://github.com/aflgo/aflgo.

[3]

Cornelius Aschermann, Sergej Schumilo, Ali Abbasi, and Thorsten Holz. 2020. Ijon: Exploring deep state spaces via fuzzing. In IEEE S&P. 1597--1612.

[4]

Marcel Böhme, Van-Thuan Pham, Manh-Dung Nguyen, and Abhik Roychoudhury. 2017. Directed greybox fuzzing. In ACM CCS. 2329--2344.

[5]

Marcel Böhme, Van-Thuan Pham, and Abhik Roychoudhury. 2017. Coverage-based greybox fuzzing as markov chain. IEEE TSE 45, 5 (2017), 489--506.

[6]

Lutz Bornmann, Loet Leydesdorff, and Rüdiger Mutz. 2013. The use of percentiles and percentile rank classes in the analysis of bibliometric data: Opportunities and limits. Journal of informetrics 7, 1 (2013), 158--165.

[7]

Hongxu Chen, Yinxing Xue, Yuekang Li, Bihuan Chen, Xiaofei Xie, Xiuheng Wu, and Yang Liu. 2018. Hawkeye: Towards a desired directed grey-box fuzzer. In ACM CCS. 2095--2108.

[8]

Peng Chen and Hao Chen. 2018. Angora: Efficient fuzzing by principled search. In IEEE S&P. 711--725.

[9]

Yaohui Chen, Peng Li, Jun Xu, Shengjian Guo, Rundong Zhou, Yulong Zhang, Tao Wei, and Long Lu. 2020. Savior: Towards bug-driven hybrid testing. In IEEE S&P. 1580--1596.

[10]

Nicolas Coppik, Oliver Schwahn, and Neeraj Suri. 2019. Memfuzz: Using memory accesses to guide fuzzing. In IEEE ICST. 48--58.

[11]

Leila Delshadtehrani, Sadullah Canakci, Boyou Zhou, Schuyler Eldridge, Ajay Joshi, and Manuel Egele. 2020. Phmon: a programmable hardware monitor and its security use cases. In USENIX Security. 807--824.

[12]

Firefox. 2021. Fuzzing. https://firefox-source-docs.mozilla.org/tools/fuzzing.

[13]

Shuitao Gan, Chao Zhang, Xiaojun Qin, Xuwen Tu, Kang Li, Zhongyu Pei, and Zuoning Chen. 2018. Collafl: Path sensitive fuzzing. In IEEE S&P. 679--696.

[14]

glennrp. 2018. https://github.com/glennrp/libpng/tree/libpng16/contrib/testpngs.

[15]

Google. 2016. OSS-Fuzz. https://github.com/google/oss-fuzz/.

[16]

Google. 2020. AFL dictionaries. https://github.com/google/AFL/tree/master/dictionaries.

[17]

Google. 2020. AFL test cases. https://github.com/google/AFL/tree/master/testcases.

[18]

Google. 2020. American Fuzzy Lop. https://github.com/google/AFL.

[19]

Google. 2021. ClusterFuzz. https://google.github.io/clusterfuzz/setting-up-fuzzing/libfuzzer-and-afl/#afl-limitations.

[20]

Google. 2021. Continuous Integration. https://google.github.io/oss-fuzz/getting-started/continuous-integration/.

[21]

Google. 2021. Honggfuzz. https://github.com/google/honggfuzz.

[22]

Gustavo Grieco, Martín Ceresa, and Pablo Buiras. 2016. QuickFuzz: An automatic random fuzzer for common file formats. SIGPLAN Notices 51, 12 (2016), 13--20.

Digital Library

[23]

Ahmad Hazimeh, Adrian Herrera, and Mathias Payer. 2020. Magma. hexhive.epfl.ch/magma/docs/bugs.html.

[24]

Ahmad Hazimeh, Adrian Herrera, and Mathias Payer. 2020. Magma: A Ground-Truth Fuzzing Benchmark. ACM POMACS 4, 3 (2020), 1--29.

[25]

Adrian Herrera, Hendra Gunadi, Shane Magrath, Michael Norrish, Mathias Payer, and Antony L Hosking. 2021. Seed selection for successful fuzzing. In ACM SIGSOFT ISSTA. 230--243.

[26]

Christian Holler, Kim Herzig, and Andreas Zeller. 2012. Fuzzing with code fragments. In USENIX Security. 445--458.

[27]

George Klees, Andrew Ruef, Benji Cooper, Shiyi Wei, and Michael Hicks. 2018. Evaluating fuzz testing. In ACM SIGSAC CCS. 2123--2138.

[28]

Gwangmu Lee, Woochul Shim, and Byoungyoung Lee. 2021. Constraint-guided Directed Greybox Fuzzing. In USENIX Security.

[29]

Yuwei Li, Shouling Ji, Yuan Chen, Sizhuang Liang, Wei-Han Lee, Yueyao Chen, Chenyang Lyu, Chunming Wu, Raheem Beyah, and Peng Cheng. 2021. Unifuzz: A holistic and pragmatic metrics-driven platform for evaluating fuzzers. In USENIX Security.

[30]

Hongliang Liang, Yini Zhang, Yue Yu, Zhuosi Xie, and Lin Jiang. 2019. Sequence coverage directed greybox fuzzing. In IEEE/ACM ICPC. 249--259.

[31]

LLVM. 2021. libFuzzer. https://llvm.or/docs/LibFuzzer.html#corpus.

[32]

Chenyang Lyu, Shouling Ji, Chao Zhang, Yuwei Li, Wei-Han Lee, Yu Song, and Raheem Beyah. 2019. MOPT: Optimized mutation scheduling for fuzzers. In USENIX Security. 1949--1966.

[33]

Valentin Jean Marie Manès, HyungSeok Han, Choongwoo Han, Sang Kil Cha, Manuel Egele, Edward J Schwartz, and Maverick Woo. 2019. The art, science, and engineering of fuzzing: A survey. IEEE TSE (2019).

[34]

K Paul Nesselroade Jr and Laurence G Grimm. 2018. Statistical applications for the behavioral and social sciences. John Wiley & Sons.

[35]

Manh-Dung Nguyen, Sébastien Bardin, Richard Bonichon, Roland Groz, and Matthieu Lemerre. 2020. Binary-level directed fuzzing for use-after-free vulnerabilities. In RAID. 47--62.

[36]

Sebastian Österlund, Kaveh Razavi, Herbert Bos, and Cristiano Giuffrida. 2020. Parmesan: Sanitizer-guided greybox fuzzing. In USENIX Security. 2289--2306.

[37]

Shankara Pailoor, Andrew Aday, and Suman Jana. 2018. Moonshine: Optimizing OS fuzzer seed selection with trace distillation. In USENIX Security. 729--743.

[38]

Jiaqi Peng, Feng Li, Bingchang Liu, Lili Xu, Binghong Liu, Kai Chen, and Wei Huo. 2019. 1dvul: Discovering 1-day vulnerabilities through binary patches. In IEEE/IFIP DSN. 605--616.

[39]

Alexandre Rebert, Sang Kil Cha, Thanassis Avgerinos, Jonathan Foote, David Warren, Gustavo Grieco, and David Brumley. 2014. Optimizing seed selection for fuzzing. In USENIX Security. 861--875.

Digital Library

[40]

sqlite. 2021. SQLite Source Repository. https://github.com/sqlite/sqlite/tree/master/test.

[41]

Spandan Veggalam, Sanjay Rawat, Istvan Haller, and Herbert Bos. 2016. Ifuzzer: An evolutionary interpreter fuzzer using genetic programming. In ESORICS. 581--601.

[42]

Haijun Wang, Xiaofei Xie, Yi Li, Cheng Wen, Yuekang Li, Yang Liu, Shengchao Qin, Hongxu Chen, and Yulei Sui. 2020. Typestate-guided fuzzer for discovering use-after-free vulnerabilities. In ACM/IEEE ICSE. 999--1010.

[43]

Junjie Wang, Bihuan Chen, Lei Wei, and Yang Liu. 2017. Skyfire: Data-driven seed generation for fuzzing. In IEEE S&P. 579--594.

[44]

Yanhao Wang, Xiangkun Jia, Yuwei Liu, Kyle Zeng, Tiffany Bao, Dinghao Wu, and Purui Su. 2020. Not all coverage measurements are equal: Fuzzing by coverage accounting for input prioritization. NDSS.

[45]

Cheng Wen, Haijun Wang, Yuekang Li, Shengchao Qin, Yang Liu, Zhiwu Xu, Hongxu Chen, Xiaofei Xie, Geguang Pu, and Ting Liu. 2020. Memlock: Memory usage guided fuzzing. In ACM/IEEE ICSE. 765--777.

Digital Library

[46]

Wen Xu, Sanidhya Kashyap, Changwoo Min, and Taesoo Kim. 2017. Designing new operating primitives to improve fuzzing performance. In ACM SIGSAC CCS.

[47]

Dingning Yang, Yuqing Zhang, and Qixu Liu. 2012. Blendfuzz: A model-based framework for fuzz testing programs with grammatical inputs. In IEEE TrustCom. 1070--1076.

[48]

Tai Yue, Pengfei Wang, Yong Tang, Enze Wang, Bo Yu, Kai Lu, and Xu Zhou. 2020. Ecofuzz: Adaptive energy-saving greybox fuzzing as a variant of the adversarial multi-armed bandit. In USENIX Security. 2307--2324.

[49]

Michal Zalewski. 2017. afl-cmin. https://github.com/mirrorer/afl/blob/master/aflcmin.

[50]

Michal Zalewski. 2020. afl-tmin. https://github.com/google/AFL/blob/master/afl-tmin.c.

[51]

Peiyuan Zong, Tao Lv, Dawei Wang, Zizhuang Deng, Ruigang Liang, and KaiChen. 2020. Fuzzguard: Filtering out unreachable inputs in directed grey-box fuzzing through deep learning. In USENIX Security. 2255--2269.

Cited By

Zeng QXiong DWu ZQian KWang YSu Y(2024)WolfFuzz: A Dynamic, Adaptive, and Directed Greybox FuzzerElectronics10.3390/electronics1311209613:11(2096)Online publication date: 28-May-2024
https://doi.org/10.3390/electronics13112096
Jiang ZWen MCao JShi XJin HFilkov VRay BZhou M(2024)Towards Understanding the Effectiveness of Large Language Models on Directed Test Input GenerationProceedings of the 39th IEEE/ACM International Conference on Automated Software Engineering10.1145/3691620.3695513(1408-1420)Online publication date: 27-Oct-2024
https://dl.acm.org/doi/10.1145/3691620.3695513
Sharma ACadar CMetzman JBöhme MNoller YSzekeres L(2024)Effective Fuzzing within CI/CD Pipelines (Registered Report)Proceedings of the 3rd ACM International Fuzzing Workshop10.1145/3678722.3685534(52-60)Online publication date: 13-Sep-2024
https://dl.acm.org/doi/10.1145/3678722.3685534
Show More Cited By

Index Terms

TargetFuzz: Using DARTs to Guide Directed Greybox Fuzzers
1. Security and privacy
  1. Software and application security
    1. Software security engineering

Recommendations

Directed Greybox Fuzzing
CCS '17: Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security

Existing Greybox Fuzzers (GF) cannot be effectively directed, for instance, towards problematic changes or patches, towards critical system calls or dangerous locations, or towards functions in the stack-trace of a reported vulnerability that we wish to ...
SyzDirect: Directed Greybox Fuzzing for Linux Kernel
CCS '23: Proceedings of the 2023 ACM SIGSAC Conference on Computer and Communications Security

Bug reports and patch commits are dramatically increasing for OS kernels, incentivizing a critical need for kernel-level bug reproduction and patch testing. Directed greybox fuzzing (DGF), aiming to stress-test a specific part of code, is a promising ...
HDBFuzzer–Target-oriented Hybrid Directed Binary Fuzzer
CSAE '21: Proceedings of the 5th International Conference on Computer Science and Application Engineering

In this paper, we propose a target-oriented hybrid directed binary fuzzer (HDBFuzzer) to solve the vulnerability confirmation problem based on binary code similarity comparison. HDBFuzzer combines macro function level direction fuzzing and micro path-...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

ASIA CCS '22: Proceedings of the 2022 ACM on Asia Conference on Computer and Communications Security

May 2022

1291 pages

ISBN:9781450391405

DOI:10.1145/3488932

General Chairs:
Yuji Suga
Internet Initiative Japan Inc., Japan
,
Kouichi Sakurai
Kyushu University, Japan
,
Program Chairs:
Xuhua Ding
Singapore Management University, Singapore
,
Kazue Sako
Waseda University, Japan

Copyright © 2022 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGSAC: ACM Special Interest Group on Security, Audit, and Control

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 30 May 2022

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

BU Hariri Research Incubation Award

Conference

ASIA CCS '22

Sponsor:

SIGSAC

ASIA CCS '22: ACM Asia Conference on Computer and Communications Security

May 30 - June 3, 2022

Nagasaki, Japan

Acceptance Rates

Overall Acceptance Rate 418 of 2,322 submissions, 18%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

12
Total Citations
View Citations
332
Total Downloads

Downloads (Last 12 months)71
Downloads (Last 6 weeks)0

Reflects downloads up to 05 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Zeng QXiong DWu ZQian KWang YSu Y(2024)WolfFuzz: A Dynamic, Adaptive, and Directed Greybox FuzzerElectronics10.3390/electronics1311209613:11(2096)Online publication date: 28-May-2024
https://doi.org/10.3390/electronics13112096
Jiang ZWen MCao JShi XJin HFilkov VRay BZhou M(2024)Towards Understanding the Effectiveness of Large Language Models on Directed Test Input GenerationProceedings of the 39th IEEE/ACM International Conference on Automated Software Engineering10.1145/3691620.3695513(1408-1420)Online publication date: 27-Oct-2024
https://dl.acm.org/doi/10.1145/3691620.3695513
Sharma ACadar CMetzman JBöhme MNoller YSzekeres L(2024)Effective Fuzzing within CI/CD Pipelines (Registered Report)Proceedings of the 3rd ACM International Fuzzing Workshop10.1145/3678722.3685534(52-60)Online publication date: 13-Sep-2024
https://dl.acm.org/doi/10.1145/3678722.3685534
Wen TLi YZhang LMa HPan Z(2024)An Empirical Study on the Distance Metric in Guiding Directed Grey-box Fuzzing2024 IEEE 35th International Symposium on Software Reliability Engineering (ISSRE)10.1109/ISSRE62328.2024.00038(307-318)Online publication date: 28-Oct-2024
https://doi.org/10.1109/ISSRE62328.2024.00038
Lan WZhang JYang HCui Z(2024)A directed greybox fuzzing tool for continuous integrationSoftwareX10.1016/j.softx.2024.10182427(101824)Online publication date: Sep-2024
https://doi.org/10.1016/j.softx.2024.101824
Benahmed SQasem ALounis ADebbabi M(2024)Modularizing Directed Greybox Fuzzing for Binaries over Multiple CPU ArchitecturesDetection of Intrusions and Malware, and Vulnerability Assessment10.1007/978-3-031-64171-8_5(84-103)Online publication date: 9-Jul-2024
https://doi.org/10.1007/978-3-031-64171-8_5
Luo CMeng WLi P(2023)SelectFuzz: Efficient Directed Fuzzing with Selective Path Exploration2023 IEEE Symposium on Security and Privacy (SP)10.1109/SP46215.2023.10179296(2693-2707)Online publication date: May-2023
https://doi.org/10.1109/SP46215.2023.10179296
Paduraru CCernat MStaicu A(2023)Concolic execution for RPA testing2023 27th International Conference on Engineering of Complex Computer Systems (ICECCS)10.1109/ICECCS59891.2023.00031(187-196)Online publication date: 14-Jun-2023
https://doi.org/10.1109/ICECCS59891.2023.00031
Bai WWu KWu QLu K(2023)Guiding Directed Fuzzing with Feasibility2023 IEEE European Symposium on Security and Privacy Workshops (EuroS&PW)10.1109/EuroSPW59978.2023.00010(42-49)Online publication date: Jul-2023
https://doi.org/10.1109/EuroSPW59978.2023.00010
Ganz TRall PHärterich MRieck K(2023)Hunting for Truth: Analyzing Explanation Methods in Learning-based Vulnerability Discovery2023 IEEE 8th European Symposium on Security and Privacy (EuroS&P)10.1109/EuroSP57164.2023.00038(524-541)Online publication date: Jul-2023
https://doi.org/10.1109/EuroSP57164.2023.00038
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten