Abstract
Musical Instrument Identification research is an important problem in Music Information Retrieval (MIR) in which most of the research going on now is using signal processing method. In this paper, at first a model that uses harmonic structured Gaussian mixture for modeling instrument is described and EM algorithm is used to estimate parameters in the model. Therefore features such as Harmonic Temporal Timbre Energy Ratio (HTTER) and Harmonic Temporal Timbre Envelope Similarity (HTTES) are generated from the model. To utilize the features efficiently, a new boosting algorithm based on Probabilistic Decisions is proposed for musical instrument identification. In contrast to the conventional boosting algorithm, which uses a deterministic decision method during the iterations and which does not consider the noise in the data set sufficiently, the new boosting algorithm is proposed to use probabilistic decisions for every hypothesis at the iterations of the boosting scheme, selecting the data events from a dataset, and then combines them. It improves the musical instrument classifier without using boosting approach and the conventional boosting algorithm significantly, which was proved by the experimental.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Brown, R.J.C.: Computer identification of musical instruments using pattern recognition with cepstral coefficients as features. Journal of the Acoustical Society of America 105(3), 1933–1941 (1999)
Eronen, A., Klapuri, A.: Musical instrument recognition using cepstral coefficients and temporal features. In: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2000), Istanbul, Turkey, vol. 2, pp. 753–756 (2000)
Agostini, G., Longari, M., Pollastri, E.: Musical instrument timbres classification with spectral features. EURASIP Journal on Applied Signal Processing 2003(1), 5–14 (2003)
Kinoshita, T., Sakai, S., Tanaka, H.: Musical sound source identification based on frequency component adaptation. In: Proceedings of IJCAI Workshop on Computational Auditory Scene Analysis (IJCAI-CASA 1999), Stockholm, Sweden, pp. 18–24 (1999)
Eggink, J., Brown, G.J.: Application of missing feature theory to the recognition of musical instruments in polyphonic audio. In: Proceedings of International Symposium on Music Information Retrieval (ISMIR 2003), Baltimore, Md, USA (2003)
Kitahara, T., Goto, M., Komatani, K., Ogata, T., Okuno, H.G.: Instrument Identification in Polyphonic Music: Feature Weighting to Minimize Influence of Sound Overlaps. EURASIP Journal on Advances in Signal Processing 2007, Article ID 51979, 15 pages (2007)
Kameoka, H., Nishimoto, T., Sagayama, S.: A Multipitch Analyzer Based on Harmonic Temporal Structured Clustering. IEEE Trans. on Audio, Speech and Language Processing 15(3), 982–994 (2007)
Klapuri, A.: Multiple fundamental frequency estimation based on harmonicity and spectral smoothness. IEEE Trans. Speech and Audio Processing 11(6), 804–816 (2003)
Goto, M.: A real-time music-scene-description system: Predominant-F0 estimation for detecting melody and bass lines in real-world audio signals. ISCA J. 43(4), 311–329 (2004)
Miyamoto, K., Kameoka, H., Nishimoto, T., Ono, N., Sagayama, S.: Harmonic-Temporal-Timbral Clustering (HTTC) For the Analysis of Multi-instrument Polyphonic Music Signals. In: Proc. of ICASSP, pp. 113–116 (2008)
Healy, M., Sourabh, R., Anderson, D.: Effects of Varying Parameters in Asymmetric AdaBoost on the Accuracy of a Cascade Audio Classifier. In: Proceedings of the Southeastern Conference of the Institute of Electrical and Electronics Engineers, pp. 169–172 (2004)
Dixon, S., Gouyon, F., Widmer, G.: Towards characterization of music via rhythmic patterns. In: Proceedings of the 5th International Conference on Music Information Retrieval (ISMIR), pp. 509–516 (2004)
McKay, C., Fiebrink, R., McEnnis, D., Li, B., Fujinaga, I.: A framework for optimizing music classification. In: Proceedings of the 5th International Conference on Music Information Retrieval (ISMIR), pp. 42–49 (2005)
Bergstra, J., Casagrande, N., Erhan, D., Eck, D., Kégl, B.: Aggregate features and AdaBoost for music classification. Machine Learning 65(2-3), 473–484 (2006)
Turnbull, D., Lanckriet, G., Pampalk, E., Goto, M.: A Supervised Approach for Detecting Boundaries in Music Using Difference Features and Boosting. In: Proceedings of the 5th International Conference on Music Information Retrieval (ISMIR), pp. 42–49 (2007)
Yang, Y., Lin, Y., Su, Y., Chen, H.: Music Emotion Classification: A Regression Approach. In: Proceedings IEEE Int. Conf. Multimedia and Expo. (ICME 2007), pp. 208–211 (2007)
Goto, M., Hashiguchi, H., Nishimura, T., Oka, R.: RWC music database: Popular, classical, and jazz music database. In: Proceedings. ISMIR, pp. 287–288 (2002)
Wu, J., Kim, Y., Song, C., Lee, W.: A New Classifier to Deal with Incomplete Data. In: The Ninth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing (SNPD 2008), Thailand, pp. 105–110 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wu, J., Sagayama, S. (2012). Musical Instrument Identification Based on New Boosting Algorithm with Probabilistic Decisions. In: Ystad, S., Aramaki, M., Kronland-Martinet, R., Jensen, K., Mohanty, S. (eds) Speech, Sound and Music Processing: Embracing Research in India. CMMR FRSM 2011 2023. Lecture Notes in Computer Science, vol 7172. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31980-8_6
Download citation
DOI: https://doi.org/10.1007/978-3-642-31980-8_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31979-2
Online ISBN: 978-3-642-31980-8
eBook Packages: Computer ScienceComputer Science (R0)