Abstract
This paper proposes a black-box adversarial attack method to automatic speech recognition systems. Some studies have attempted to attack neural networks for speech recognition; however, these methods did not consider the robustness of generated adversarial examples against timing lag with a target speech. The proposed method in this paper adopts Evolutionary Multi-objective Optimization (EMO) that allows it generating robust adversarial examples under black-box scenario. Experimental results showed that the proposed method successfully generated adjust-free adversarial examples, which are sufficiently robust against timing lag so that an attacker does not need to take the timing of playing it against the target speech.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Notes
Tested 60 adversarial examples can be available at https://mediaeng.ics.kagoshima-u.ac.jp/adjustFreeAE.html.
Pairred t test with 95% confidence level was performed for each comparison.
References
Alzantot M, Balaji B, Srivastava M (2018) Did you hear that? adversarial examples against automatic speech recognition. arXiv preprint arXiv:1801.00554
Athalye A, Engstrom L, Ilyas Aa (2017) Synthesizing robust adversarial examples. arXiv preprint arXiv:1707.07397
Carlini N, Wagner D (2018) Audio adversarial examples: targeted attacks on speech-to-text. In: 2018 IEEE security and privacy workshops (SPW), pp 1–7. IEEE
Goodfellow IJ, Shlens J, Szegedy C (2014) Explaining and harnessing adversarial examples. arXiv preprint arXiv:1412.6572
Ittichaichareon C, Suksri S, Yingthawornsuk T (2012) Speech recognition using mfcc. In: International conference on computer graphics, simulation and modeling, pp 135–138
Khare S, Aralikatte R, Mani S (2018) Adversarial black-box attacks on automatic speech recognition systems using multi-objective evolutionary optimization. arXiv preprint arXiv:1811.01312
Ono S, Hirotani Y, Nakayama S (2009) A memetic algorithm for robust optimal solution search-hybridization of multi-objective genetic algorithm and quasi-newton method. Int J Innov Comput Inf Control 5(12):5011–5019
Qin Y, Carlini N, Goodfellow IA (2019) Imperceptible, robust, and targeted adversarial examples for automatic speech recognition. arXiv preprint arXiv:1903.10346
Sainath TN, Parada C (2015) Convolutional neural networks forsmall-footprint keyword spotting. In: Proceedings of the sixteenth annual conference of the international speech communication association
Storn R, Price K (1997) Differential evolution-a simple and efficient heuristic for global optimization over continuous spaces. J Global Optim 11(4):341–359
Su J, Vargas DV, Sakurai K (2019) One pixel attack for fooling deep neural networks. IEEE Trans Evol Comput 23:828–41
Suzuki T, Takeshita S, Ono S (2019) Adversarial example generation using evolutionary multi-objective optimization. In: 2019 IEEE Congress on evolutionary computation (CEC), pp 2136–2144. IEEE
Taori R, Kamsetty A, Chu B (2019) Targeted adversarial examples for black box audio systems. In: 2019 IEEE security and privacy workshops (SPW), pp 15–20. IEEE
Yakura H, Sakuma J (2018) Robust audio adversarial example for a physical attack. arXiv preprint arXiv:1810.11793
Zhang Q, Liu W, Li H (2009) The performance of a new version of moea/d on cec09 unconstrained mop test instances. In: 2009 IEEE congress on evolutionary computation, pp 203–208. IEEE
Acknowledgements
This study was partially supported by the Kayamori Foundation of Informational Science Advancement.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This work was presented in part at the 25th International Symposium on Artificial Life and Robotics (Beppu, Oita, January 22–24, 2020).
About this article
Cite this article
Ishida, S., Ono, S. Adjust-free adversarial example generation in speech recognition using evolutionary multi-objective optimization under black-box condition. Artif Life Robotics 26, 243–249 (2021). https://doi.org/10.1007/s10015-020-00671-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10015-020-00671-x