Abstract
We propose a new generative model for polyphonic music based on nonlinear Independent Subspace Analysis (ISA) and factorial Hidden Markov Models (HMM). ISA represents chord spectra as sums of note power spectra and note spectra as sums of instrument-dependent log-power spectra. HMM models note duration. Instrument-dependent parameters are learnt on solo excerpts and used to transcribe musical recordings as collections of notes with time-varying power and other descriptive parameters such as vibrato. We prove the relevance of our modeling assumptions by comparing them with true data distributions and by giving satisfying transcriptions of two duo recordings.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Vincent, E., Févotte, C., Gribonval, R.: A tentative typology of audio source separation tasks. In: Proc. ICA, pp. 715–720 (2003)
Eggink, J., Brown, G.: Application of missing feature theory to the recognition of musical instruments in polyphonic audio. In: Proc. ISMIR, pp. 125–131 (2003)
Abdallah, S., Plumbley, M.: An ICA approach to automatic music transcription. In: Proc. 114th AES Convention (2003)
Virtanen, T.: Sound source separation using sparse coding with temporal continuity objective. In: Proc. ICMC (2003)
Eronen, A.: Musical instrument recognition using ICA-based transform of features and discriminatively trained HMMs. In: Proc. ISSPA (2003)
Mitianoudis, N., Davies, M.: Intelligent audio source separation using Independent Component Analysis. In: Proc. 112th AES Convention (2002)
Roweis, S.: One microphone source separation. In: Proc. NIPS, pp. 793–799 (2000)
Ghahramani, Z., Jordan, M.: Factorial hidden Markov models. Machine Learning 29, 245–273 (1997)
Penny, W., Everson, R., Roberts, S.: Hidden Markov Independent Components Analysis. In: Advances in Independent Component Analysis, Springer, Heidelberg (2000)
Hand, D., Yu, K.: Idiot’s bayes - not so stupid after all? International Statistical Review 69, 385–398 (2001)
Ostendorf, M., Digalakis, V., Kimball, O.: From HMMs to segment models: a unified view of stochastic modeling for speech recognition. IEEE Trans. on Speech and Audio Processing 4, 360–378 (1996)
Vincent, E., Rodet, X.: Underdetermined source separation with structured source priors. In: Proc. ICA (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Vincent, E., Rodet, X. (2004). Music Transcription with ISA and HMM. In: Puntonet, C.G., Prieto, A. (eds) Independent Component Analysis and Blind Signal Separation. ICA 2004. Lecture Notes in Computer Science, vol 3195. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30110-3_151
Download citation
DOI: https://doi.org/10.1007/978-3-540-30110-3_151
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23056-4
Online ISBN: 978-3-540-30110-3
eBook Packages: Springer Book Archive