Adjust-free adversarial example generation in speech recognition using evolutionary multi-objective optimization under black-box condition

Shoma Ishida¹ &
Satoshi Ono¹

283 Accesses
1 Altmetric
Explore all metrics

Abstract

This paper proposes a black-box adversarial attack method to automatic speech recognition systems. Some studies have attempted to attack neural networks for speech recognition; however, these methods did not consider the robustness of generated adversarial examples against timing lag with a target speech. The proposed method in this paper adopts Evolutionary Multi-objective Optimization (EMO) that allows it generating robust adversarial examples under black-box scenario. Experimental results showed that the proposed method successfully generated adjust-free adversarial examples, which are sufficiently robust against timing lag so that an attacker does not need to take the timing of playing it against the target speech.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price includes VAT (United Kingdom)

Instant access to the full article PDF.

Institutional subscriptions

Fig. 3

IMPGA: An Effective and Imperceptible Black-Box Attack Against Automatic Speech Recognition Systems

An approach for speech enhancement with dysarthric speech recognition using optimization based machine learning frameworks

Article 21 February 2023

Adversarial Examples Attack and Countermeasure for Speech Recognition System: A Survey

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Notes

Tested 60 adversarial examples can be available at https://mediaeng.ics.kagoshima-u.ac.jp/adjustFreeAE.html.
Pairred t test with 95% confidence level was performed for each comparison.

References

Alzantot M, Balaji B, Srivastava M (2018) Did you hear that? adversarial examples against automatic speech recognition. arXiv preprint arXiv:1801.00554
Athalye A, Engstrom L, Ilyas Aa (2017) Synthesizing robust adversarial examples. arXiv preprint arXiv:1707.07397
Carlini N, Wagner D (2018) Audio adversarial examples: targeted attacks on speech-to-text. In: 2018 IEEE security and privacy workshops (SPW), pp 1–7. IEEE
Goodfellow IJ, Shlens J, Szegedy C (2014) Explaining and harnessing adversarial examples. arXiv preprint arXiv:1412.6572
Ittichaichareon C, Suksri S, Yingthawornsuk T (2012) Speech recognition using mfcc. In: International conference on computer graphics, simulation and modeling, pp 135–138
Khare S, Aralikatte R, Mani S (2018) Adversarial black-box attacks on automatic speech recognition systems using multi-objective evolutionary optimization. arXiv preprint arXiv:1811.01312
Ono S, Hirotani Y, Nakayama S (2009) A memetic algorithm for robust optimal solution search-hybridization of multi-objective genetic algorithm and quasi-newton method. Int J Innov Comput Inf Control 5(12):5011–5019
Google Scholar
Qin Y, Carlini N, Goodfellow IA (2019) Imperceptible, robust, and targeted adversarial examples for automatic speech recognition. arXiv preprint arXiv:1903.10346
Sainath TN, Parada C (2015) Convolutional neural networks forsmall-footprint keyword spotting. In: Proceedings of the sixteenth annual conference of the international speech communication association
Storn R, Price K (1997) Differential evolution-a simple and efficient heuristic for global optimization over continuous spaces. J Global Optim 11(4):341–359
Article MathSciNet Google Scholar
Su J, Vargas DV, Sakurai K (2019) One pixel attack for fooling deep neural networks. IEEE Trans Evol Comput 23:828–41
Article Google Scholar
Suzuki T, Takeshita S, Ono S (2019) Adversarial example generation using evolutionary multi-objective optimization. In: 2019 IEEE Congress on evolutionary computation (CEC), pp 2136–2144. IEEE
Taori R, Kamsetty A, Chu B (2019) Targeted adversarial examples for black box audio systems. In: 2019 IEEE security and privacy workshops (SPW), pp 15–20. IEEE
Yakura H, Sakuma J (2018) Robust audio adversarial example for a physical attack. arXiv preprint arXiv:1810.11793
Zhang Q, Liu W, Li H (2009) The performance of a new version of moea/d on cec09 unconstrained mop test instances. In: 2009 IEEE congress on evolutionary computation, pp 203–208. IEEE

Download references

Acknowledgements

This study was partially supported by the Kayamori Foundation of Informational Science Advancement.

Author information

Authors and Affiliations

Department of Information Science and Biomedical Engineering, Graduate School of Science and Engineering, Kagoshima University, Kagoshima, Japan
Shoma Ishida & Satoshi Ono

Authors

Shoma Ishida
View author publications
You can also search for this author in PubMed Google Scholar
Satoshi Ono
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shoma Ishida.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This work was presented in part at the 25th International Symposium on Artificial Life and Robotics (Beppu, Oita, January 22–24, 2020).

About this article

Cite this article

Ishida, S., Ono, S. Adjust-free adversarial example generation in speech recognition using evolutionary multi-objective optimization under black-box condition. Artif Life Robotics 26, 243–249 (2021). https://doi.org/10.1007/s10015-020-00671-x

Download citation

Received: 15 April 2020
Accepted: 20 November 2020
Published: 06 January 2021
Issue Date: May 2021
DOI: https://doi.org/10.1007/s10015-020-00671-x

Adjust-free adversarial example generation in speech recognition using evolutionary multi-objective optimization under black-box condition

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

IMPGA: An Effective and Imperceptible Black-Box Attack Against Automatic Speech Recognition Systems

An approach for speech enhancement with dysarthric speech recognition using optimization based machine learning frameworks

Adversarial Examples Attack and Countermeasure for Speech Recognition System: A Survey

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Adjust-free adversarial example generation in speech recognition using evolutionary multi-objective optimization under black-box condition

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

IMPGA: An Effective and Imperceptible Black-Box Attack Against Automatic Speech Recognition Systems

An approach for speech enhancement with dysarthric speech recognition using optimization based machine learning frameworks

Adversarial Examples Attack and Countermeasure for Speech Recognition System: A Survey

Explore related subjects

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation