Abstract
Two algorithms for movie sound tracks restoration are discussed in the paper. The first algorithm is the unpredictability measure computation applied to the psychoacoustic model-based broadband noise attenuation. A learning decision algorithm, based on a neural network, is employed for determining useful audio signal components acting as maskers of the noisy spectral parts. An application of the rough set decision system to this task is also considered. An iterative method for calculating the sound masking pattern is presented. The second of presented algorithms is the routine for precise evaluation of parasite frequency modulations (wow) utilizing sinusoidal components extracted from the sound spectrum. The results obtained employing proposed intelligent signal processing algorithms, as well as the relationship between both routines, will be presented and discussed in the paper.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Vaseghi, S.: Advanced Signal Processing and Noise Reduction. Wiley & Teubner, New York (1997)
Welch, G., Bishop, G.: An Introduction to the Kalman Filter. Technical Report of The University of North Carolina in Chapel Hill, USA, No. 95-041
Widrow, B., Stearns, S.: Adaptive Signal Processing. Prentice-Hall Intl. Inc., New Jersey (1985)
Kunieda, N., Shimamura, T., Suzuki, J., Yashima, H.: Reduction of Noise Level by SPAD (Speech Processing System by Use of Auto-Difference Function). In: International Conference on Spoken Language Processing, Yokohama (1994)
Yoshiya, K., Suzuki, J.: Improvement in Signal-to-Noise Ratio by SPAC (Speech Processing System Using Autocorrelation Function). Electronics and Communications in Japan 61-A(3), 18–24 (1978)
Eprahim, Y.: A Bayesian Estimation for Speech Ebhacement Using Hidden Markov Models. IEEE Transactions on Signal Processing 40(4), 725–735 (1992)
Eprahim, Y.: Statistical-Model-Based Speech Enhacement Systems. Proceedings of the IEEE 80(10), 1526–1555 (1992)
Feder, M., Oppenheim, A., Weinstein, E.: Maximum Likelihood Noise Cancellation Using the EM Algorithm. IEEE Transactions on Acoustics Speech and Signal Processing 37(2), 204–216 (1989)
Lim, J., Oppenheim, A.: Enhancement and Bandwidth Compression of Noisy Speech. Proceedings of the IEEE 67(12), 1586–1604 (1979)
Czyzewski, A., Kaczmarek, A.: Speaker-independent recognition of isolated words using rough sets. In: Proc. Second Annual Joint Conference on Information Sciences, North Carolina, USA, 28 September - 01 October, pp. 397–400 (1995)
Czyzewski, A., Krolikowski, R.: Neuro-Rough Control of Masking Tresholds for Audio Signal Enhancements. Neuro Computing 36(1-4), 5–27 (2001)
Knecht, W., Schenkel, M., Moschytz, G.: Neural Network Filters for Speech Enhancement. IEEE Transactions on Speech and Audio Processing 3(6), 433–438 (1995)
Asano, F., Hayamizu, S., Yamada, T., Nakamura, S.: Speech Enhacement Based on the Subspace Method. IEEE Transactions on Speech and Audio Processing 8(5), 497–507 (2000)
Elko, G.: Adaptive Noise Cancellation with Directional Microphones. In: Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz (1997)
Wallace, G.: The JPEG: Still Picture Compression Standard. Communication of the ACM 34(4), 31–44 (1991)
Gibson, J., Koo, B.: Filtering of Colored Noise for Speech Enhancement and Coding. IEEE Transactions on Signal Processing 39(8), 1732–1742 (1991)
Lee, K., Jung, S.: Time-Domain Approach Using Multiple Kalman filters and EM Algorithm to Speech Enhacement with Stationary Noise. IEEE Transaction on Signal Processing 44(3), 282–291 (2000)
Ikeda, S., Sugiyama, A.: An Adaptive Noise Canceller with Low Signal Distortion for Speech Codecs. IEEE Transactions on Signal Processing 47(3), 665–674 (1999)
Sambur, M.: Adaptive Noise Cancelling for Speech Signals. IEEE Transactions on Acoustics Speech and Signal Processing ASSP-26(5), 419–423 (1978)
Eprahim, Y., Malah, D., Juang, B.: On the Application of Hidden Markov Models for Enhancing Noisy Speech. IEEE Transactions on Acoustics Speech and Signal Processing 37(12), 1846–1856 (1989)
Sameti, H., Sheikhzadeh, H., Brennan, R.: HMM-Based Strategies for Enhacement of Speech Signals Embeeded in Nonstationary Noise. IEEE Transactions on Speech and Audio Processing 6(5), 445–455 (1998)
Sim, B., Tong, Y., Chang, J., Tan, C.: A Parametric Formulation of the Generalized Spectral Subtraction Method. IEEE Transactions on Speech and Audio Processing 6(4), 328–337 (1998)
Vaseghi, S., Frayling-Cork, R.: Restoration of Old Gramophonic Recordings. Journal of Audio Engineering Society 40(10), 791–800 (1997)
Zwicker, E., Zwicker, T.: Audio Engineering and Psychoacoustics: Matching Signals to the Final Receiver,the Human Auditory System. Journal of Audio Engineering Society 39(3), 115–126 (1991)
Czyżewski, A., Dziubinski, M.: Noise Reduction in Audio Employing Spectral Unpredictability Measure and Neural Net. In: Negoita, M.G., Howlett, R.J., Jain, L.C. (eds.) KES 2004. LNCS (LNAI), vol. 3213, pp. 743–749. Springer, Heidelberg (2004)
Tsoukalas, D., et al.: Perceptual Filters for Audio Signal Enhacement. Journal of Audio Engineering Society 45(1/2), 22–36 (1997)
MPEG-4, International Standard ISO/IEC FCD 14496-3, Subpart 4 (1998)
Shlien, S.: Guide to MPEG-1 Audio Standard. IEEE Transactions on Broadcasting 40, 206–218 (1994)
Czyzewski, A., Krolikowski, R.: Noise Reduction in Audio Signals Based on the Perceptual Coding Approach. In: Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, pp. 147–150 (October 1999)
Krolikowski, R., Czyzewski, A.: Noise Reduction in Acoustic Signals Using the Perceptual Coding. In: 137th Meeting of Acoustical Society of America, Berlin, CD-Preprint (1998)
McAulay, J., Quatieri, T.F.: Speech Analysis/Synthesis Based on a Sinusoidal Representation. IEEE Transactions on Acoustics, Speech, and Signal Processing 34(4), 744–754 (1986)
Godsill, J.S., Rayner, J.W.: The Restoration of Pitch Variation Defects in Gramophone Recordings. In: Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz (October 1993)
Godsill, J.S.: Recursive Restoration of Pitch Variation Defects in Musical Recordings. In: Proceedings of International Conference on Acoustics, Speech, and Signal Processing, Adelaide, vol. 2, pp. 233–236 (April 1994)
Walmsley, P.J., Godsill, S.J., Rayner, P.J.W.: Polyphonic Pitch Tracking Using Joint Bayesian Estimation of Multiple Frame parameters. In: Proceedings of 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz (October 1999)
Godsill, J.S., Rayner, P.J.W.: Digital Audio Restoration. In: Kahrs, M., Brandenburg, K. (eds.) Applications of Digital Signal Processing to Audio and Acoustics, pp. 41–46. Kluwer Academic Publishers, Dordrecht (1998)
Godsill, J.S., Rayner, P.J.W.: Digital Audio Restoration - A Statistical Model-Based Approach. Springer, London (1998)
Czyzewski, A., Maziewski, P., Dziubinski, M., Kaczmarek, A., Kostek, B.: Wow Detection and Compensation Employing Spectral Processing of Audio. 117 Audio Engineering Society Convention, Convention Paper 6212, San Francisco (October 2004)
Maziewski, P.: Wow Defect Reduction Based on Interpolation Techniques. In: Proceedings of 4th Polish National Electronic Conference, vol. 1/2, pp. 481–486 (June 2005)
Czyzewski, A., Dziubinski, M., Ciarkowski, A., Kulesza, M., Maziewski, P., Kotus, J.: New Algorithms for Wow and Flutter Detection and Compensation in Audio. 118th Audio Engineering Society Convention, Convention Paper No. 6212, Barcelona (May 2005)
Czyzewski, A., Maziewski, P., Dziubinski, M., Kaczmarek, A., Kulesza, M., Ciarkowski, A.: Methods for Detection and Removal of Parasitic Frequency Modulation in Audio Recordings. In: AES 26th International Conference, Denver (July 2005)
Litwic, L., Maziewski, P.: Evaluation of Wow Defects Based on Tonal Components Detection and Tracking. In: Proceeding of 11th International AES Symposium, Krakow, pp. 145–150 (June 2005)
Czyżewski, A., Dziubinski, M., Litwic, Ł., Maziewski, P.: Intelligent Algorithms for Optical Track Audio Restoration. In: Ślęzak, D., Yao, J., Peters, J.F., Ziarko, W.P., Hu, X. (eds.) RSFDGrC 2005. LNCS (LNAI), vol. 3642, pp. 283–293. Springer, Heidelberg (2005)
Ciarkowski, A., Czyzewski, A., Kulesza, M., Maziewski, P.: DSP Techniques in Wow Defect Evaluation. In: Proceedings of Signal Processing 2005 Workshop, pp. 103–108 (September 2005)
Nichols, J.: An Interactive Pitch Defect Correction System for Archival Audio. In: AES 20th International Conference, Budapest (October 2001)
Howarth, J., Wolfe, P.: Correction of Wow and Flutter Effects in Analog Tape Transfers. 117 Audio Engineering Society Convention, Convention Paper 6213, San Francisco (October 2004)
Wolfe, P., Howarth, J.: Nonuniform Sampling Theory in Audio Signal Processing. 116 Audio Engineering Society Convention, Convention Paper 6123, Berlin (May 2004)
Beerends, J., Stemerdink, J.: A Perceptual Audio Quality Measure Based on a Psychoacoustic Sound Representation. Journal of Audio Engineering Society 40(12), 963–978 (1992)
Humes, L.: Models of the Additivity of Masking. Journal of Acoustical Society of America 85, 1285–1294 (1989)
Brandenburg, K.: Second Generation Perceptual Audio Coding: The Hybrid Coder. In: Proceedings of the 90th Audio Engineering Society Convention, Convetion Paper 2937 Montreux (1990)
Vaseghi, S.: Advanced Signal Processing and Digital Noise Reduction. Wiley&Teubner, New York (1997)
Depalle, P., Garcia, G., Rodet, X.: Analysis of Sound for Additive Synthesis: Tracking of Partials Using Hidden Markov Models. In: Proceedings of IEEE International Conference on Speech and Signal Processing (ICASSP 1993) (1993)
Lagrange, M., Marchand, S., Rault, J.B.: Tracking Partials for Sinusoidal Modeling of Polyphonic Sounds. In: Proceedings of IEEE International Conference on Speech and Signal Processing (ICASSP 2005), Philadelphia (March 2005)
Serra, X.: Musical Sound Modeling with Sinusoids plus Noise. In: Pope, S., Picalli, A., De Poli, G., Roads, C. (eds.) Musical Signal Processing, Swets & Zeitlinger Publishers (1997)
Rodet, X.: Musical Sound Signal Analysis/Synthesis: Sinusoidal + Residual and Elementary Waveform Models. In: Proceedings of IEEE Symposium on Time-Frequency and Time-Scale Analysis (1997)
Lagrange, M., Marchand, S., Rault, J.B.: Sinusoidal Parameter Extraction and Component Selection in a Non-stationary Model. In: Proc. of the 5th Int. Conference on Digital Audio Effects, Hamburg (September 2002)
Auger, F., Flandrin, P.: Improving the Readability of Time-frequency and Time-scale Representations by the Reassignment Method. IEEE Transactions on Signal Processing 43(5), 1068–1089 (1995)
Keiler, F., Marchand, S.: Survey on Extraction of Sinusoids in Stationary Sounds. In: Proceedings of the 5th International Conference on Digital Audio Effects, Hamburg (September 2002)
Sound examples: http://sound.eti.pg.gda.pl/~llitwic/SoundRest/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Czyżewski, A., Dziubiński, M., Litwic, Ł., Maziewski, P. (2006). Intelligent Algorithms for Movie Sound Tracks Restoration. In: Peters, J.F., Skowron, A. (eds) Transactions on Rough Sets V. Lecture Notes in Computer Science, vol 4100. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11847465_6
Download citation
DOI: https://doi.org/10.1007/11847465_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-39382-5
Online ISBN: 978-3-540-39383-2
eBook Packages: Computer ScienceComputer Science (R0)