- Research Article
- Open access
- Published:
Wavelets in Recognition of Bird Sounds
EURASIP Journal on Advances in Signal Processing volume 2007, Article number: 051806 (2006)
Abstract
This paper presents a novel method to recognize inharmonic and transient bird sounds efficiently. The recognition algorithm consists of feature extraction using wavelet decomposition and recognition using either supervised or unsupervised classifier. The proposed method was tested on sounds of eight bird species of which five species have inharmonic sounds and three reference species have harmonic sounds. Inharmonic sounds are not well matched to the conventional spectral analysis methods, because the spectral domain does not include any visible trajectories that computer can track and identify. Thus, the wavelet analysis was selected due to its ability to preserve both frequency and temporal information, and its ability to analyze signals which contain discontinuities and sharp spikes. The shift invariant feature vectors calculated from the wavelet coefficients were used as inputs of two neural networks: the unsupervised self-organizing map (SOM) and the supervised multilayer perceptron (MLP). The results were encouraging: the SOM network recognized 78% and the MLP network 96% of the test sounds correctly.
References
Catchpole CK, Slater PJB: Bird Song: Biological Themes and Variations. Cambridge University Press, Cambridge, UK; 1995.
Kroodsma DE: The Singing Life of Birds: The Art and Science of Listening Birdsong. Houghton Miflin, Boston, Mass, USA; 2005.
Greenewalt CH: Bird Song: Acoustics and Physiology. Smithsonian Institution Press, Washington, DC, USA; 1968.
Zollinger SA, Riede T, Suthers RA: Production of nonlinear phenomena in the Northern Mockingbirds ( Minus polyglottos ). Proceedings of the 1st International Conference on Acoustic Communication by Animals, July 2003, College Park, Md, USA 283–284.
Suthers RA, Beckers G, Zollinger SA, Vallet E, Kreuzer M: Mechanisms of vocal complexity in birds. Proceedings of the 1st International Conference on Acoustic Communication by Animals, July 2003, College Park, Md, USA 237–238.
Bradbury JW: Parrots and technology. Proceedings of the 1st International Conference on Acoustic Communication by Animals, July 2003, College Park, Md, USA 29–30.
Baker MC, Logue DM: Population differentiation in a complex bird sound: a comparison of three bioacoustical analysis procedures. Ethology 2003,109(3):223–242. 10.1046/j.1439-0310.2003.00866.x
Groth JG: Call matching and positive assortative mating in red crossbills. The Auk 1993,110(2):398–401.
Robb MS: Introduction to vocalizations of crossbills in Northwestern Europe. Dutch Birding 2000,22(2):61–107.
Deecke VB, Janik VM: Automated categorization of bioacoustic signals: avoiding perceptual pitfalls. Journal of the Acoustical Society of America 2006,119(1):645–653. 10.1121/1.2139067
Elowson AM, Hailman JP: Analysis of complex variation: dichotomous sorting of predator-elicited calls of the Florida scrub jay. Bioacoustics 1991,3(4):295–320.
Groth JG: Resolution of cryptic species in appalachian red crossbills. The Condor 1988,90(4):745–760. 10.2307/1368832
Lovell SF, Lein MR: Song variation in a population of Alder Flycatchers. Journal of Field Ornithology 2004,75(2):146–151.
Härmä A: Automatic identification of bird species based on sinusoidal modelling of syllables. Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '03), April 2003, Hong Kong 5: 545–548.
Härmä A, Somervuo P: Classification of the harmonic structure in bird vocalization. Proceedings IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '04), May 2004, Montreal, Quebec, Canada 5: 701–704.
Mesgarani N, Shamma S: Bird call classification using multiresolution spectrotemporal auditory model. Proceedings of the 1st International Conference on Acoustic Communication by Animals, July 2003, College Park, Md, USA 155–156.
Tanttu JT, Turunen J, Selin A, Ojanen M: Automatic feature extraction and classification of crossbill ( Loxia spp. ) flight calls. Bioacoustics 2006,15(3):251–269.
Somervuo P, Härmä A: Bird song recognition based on syllable pair histograms. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '04), May 2004, Montreal, Quebec, Canada 5: 825–828.
Fagerlund S, Härmä A: Parametrization of inharmonic bird sounds for automatic recognition. proceedings of the 13th European Signal Processing Conference (EUSIPCO '05), September 2005, Antalya, Turkey Proceedings on CD-ROM
Rioul O, Vetterli M: Wavelets and signal processing. IEEE Signal Processing Magazine 1991,8(4):14–38. 10.1109/79.91217
Soman AK, Vaidyanathan PP: Paraunitary filter banks and wavelet packets. Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '92), March 1992, San Francisco, Calif, USA 397–400.
Pittner S, Kamarthi SV: Feature extraction from wavelet coefficients for pattern recognition tasks. IEEE Transactions on Pattern Analysis and Machine Intelligence 1999,21(1):83–88. 10.1109/34.745739
Learned R: Wavelet packet based transient signal classification, M.S. thesis.
Phelps SM, Ryan MJ: Neural networks predict response biases of female tungara frogs. Proceedings of the Royal Society—Biological Sciences (Series B) 1998,265(1393):279–285. 10.1098/rspb.1998.0293
Deecke VB, Ford JKB, Spong P: Quantifying complex patterns of bioacoustic variation: use of a neural network to compare killer whale (Orcinus orca) dialects. The Journal of the Acoustical Society of America 1999,105(4):2499–2507. 10.1121/1.426853
Placer J, Slobodchikoff CN: A fuzzy-neural system for identification of species-specific alarm calls of Gunnison's prairie dogs. Behavioural Processes 2000,52(1):1–9. 10.1016/S0376-6357(00)00105-4
Thorn A: Artificial neural networks for vocal repertoire analysis. Proceedings of the 1st International Conference on Acoustic Communication by Animals, July 2003, College Park, Md, USA 245–246.
McIlraith AL, Card HC: Birdsong recognition using backpropagation and multivariate statistics. IEEE Transactions on Signal Processing 1997,45(11):2740–2748. 10.1109/78.650100
Terry AMR, McGregor PK: Census and monitoring based on individually identifiable vocalizations: the role of neural networks. Animal Conservation 2002,5(2):103–111. 10.1017/S1367943002002147
Somervuo P, Härmä A: Analyzing bird song syllables on the self-organizing map. Proceedings of the Workshop on Self-Organizing Maps (WSOM '03), September 2003, Hibikino, Japan Proceedings on CD-ROM
Boggess A, Narcowich FJ: A First Course in Wavelets with Fourier Analysis. Prentice-Hall, Upper Saddle River, NJ, USA; 2001.
Daubechies I: Ten Lectures on Wavelets. SIAM, Philadelphia, Pa, USA; 1992.
Akansu AN, Haddad RA: Multiresolution Signal Decomposition: Transforms, Subbands, and Wavelets. Academic Press, Boston, Mass, USA; 1992.
Misiti M, Misiti Y, Oppenheim G, Poggi J-M: Wavelet Toolbox for Use with Matlab. MathWorks, Natick, Mass, USA; 2000.
Kohonen T: Self-Organizing Maps. Springer, Berlin, Germany; 2001.
Haykin S: Neural Networks: A Comprehensive Foundation. Macmillan College, New York, NY, USA; 1994.
MathWorks : Matlab Software Homepage. June 2005, https://doi.org/www.mathworks.com
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
About this article
Cite this article
Selin, A., Turunen, J. & Tanttu, J.T. Wavelets in Recognition of Bird Sounds. EURASIP J. Adv. Signal Process. 2007, 051806 (2006). https://doi.org/10.1155/2007/51806
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1155/2007/51806