[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2980258.2980296acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiciaConference Proceedingsconference-collections
research-article

Dimension Compactness in Speaker Identification

Published: 25 August 2016 Publication History

Editorial Notes

NOTICE OF CONCERN: ACM has received evidence that casts doubt on the integrity of the peer review process for the ICIA 2016 Conference. As a result, ACM is issuing a Notice of Concern for all papers published and strongly suggests that the papers from this Conference not be cited in the literature until ACM's investigation has concluded and final decisions have been made regarding the integrity of the peer review process for this Conference.

Abstract

The automatic speaker identification procedure is used to extract features that help to identify the components of the acoustic signal by discarding all the other stuff like background noise, emotion, hesitation, etc. The acoustic signal is generated by a human that is filtered by the shape of the vocal tract, including tongue, teeth, etc. The shape of the vocal tract determines and produced, what signal comes out in real time. The analytically develops shape of the vocal tract, which exhibits envelop for the short time power spectrum. The ASR needs efficient way of extracting features from the acoustic signal that is used effectively to makes the shape of the individual vocal tract. To identify any acoustic signal in the large collection of acoustic signal i.e. corpora, it needs dimension compactness of total variability space by using the GMM mean supervector. This work presents the efficient way to implement dimension compactness in total variability space and using cosine distance scoring to predict a fast output score for small size utterance.

References

[1]
Dehak, N., Kenny, P., Dehak, R., Dumouchel, P., Ouellet, P. 2001. Front-end factor analysis for speaker verification. IEEE Trans. Audio Speech lang. Process. 19, 788--798.
[2]
Kenny, P., Gupta, V., Stafylakis, T., Ouellet, P and J. Alam. 2014. Deep Neural Networks for extracting Baum-Welch statistics for Speaker. In Proc. of Odyssey-2014.
[3]
Kobayashi, T., Ye, J. 2014. Acoustic feature extraction by statistics based local binary pattern for environmental sound classification. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP- 2014), 3052--3056.
[4]
Kumar, K, Kim, C,. Stern, R. 2011. Delta-spectral cepstral coefficients for robust speech recognition. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP- 2011). 4784--4787. 2011.
[5]
Kenney, P., Boulianne, G., Dumouchel, P. 2005. Eigen voice modeling with sparse training data. IEEE transaction on speech and audio processing. Vol 13, No 3, 345--354.
[6]
Bahari, M., Saeid, R., Hamme, H., Leeuwen, D. 2013. Accent recognition using i-vector, Gaussian Mean Supervector and Gaussian posterior probability supervector for spontaneous telephone speech. IEEE International Conference on Acoustics, Speech and Signal Processing. 7344--7348.
[7]
Kanrar, S., et al. 2015. Text and Language Independent Speaker Identification by GMM based i Vector. In Proceedings of the Sixth International Conference on Computer and Communication Technology ICCCT '15. 95--100.
[8]
Kanrar, S. 2015. Impact of Threshold to Identify Vocal Tract. Advances in Intelligent Systems and Computing. Vol 404, 97--105. 2015.
[9]
Reynolds, D., Quatieri, T., Dunn, R. 2000.Speaker verification using adapted Gaussian mixture models. Digital Signal processing, Vol. 10, 19--41.
[10]
Xiang, B., et.al. 2002. Short-time gaussianization for robust speaker verification. In Proc. ICASSP, Vol. 1, 9681--684.
[11]
Reynolds, D. 1995. Automatic speaker recognition using Gaussian mixture speaker models. Lincoln Laboratory J. 8(2), 173--191.
[12]
Reynolds, D. 2003. Channel robust speaker verification via feature mapping. Proc, Internet. Conf. Acoustics Speech Signal Process, 53--56.
[13]
Martinez, D., Plchot, O., Burget, L., Glembek, O., Matejka, P. 2011. Language recognition in i vectors space. In: Annual Conference of the International Speech Communication Association (Inter speech), 861--864.
[14]
Kenny, P., Boulianne, G., Ouellet, P., Dumouchel, P. 2007. Joint factor analysis versus eigen channels in speaker recognition. IEEE Trans. Audio, Speech, Lang, Process. vol. 15, no.4, 1435--1447.
[15]
Kanrar S., et al. 2015. Detect Mimicry by speaker Recognition system. Adv. Intell. Syst. Comput. 339, 21--31.
[16]
Kanrar S., et al. 2016. E-health monitoring system enhancement with Gaussian mixture model. Multimedia Tools and Applications, Springer US. 1--23.
[17]
Kanrar S., et al. 2016. Video traffic analytics for large scale surveillance. Multimedia Tools and Applications, Springer US. 1--28.

Cited By

View all
  • (2023)Secure digital documents sharing using blockchain and attribute-based cryptosystemMultiagent and Grid Systems10.3233/MGS-22136118:3-4(365-379)Online publication date: 3-Feb-2023
  • (2022)A novel model to enhance the data security in cloud environmentMultiagent and Grid Systems10.3233/MGS-22036118:1(45-63)Online publication date: 1-Jan-2022
  • (2022)Robust Threshold Selection for Environment Specific Voice in Speaker RecognitionWireless Personal Communications: An International Journal10.1007/s11277-022-09852-2126:4(3071-3092)Online publication date: 1-Oct-2022
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICIA-16: Proceedings of the International Conference on Informatics and Analytics
August 2016
868 pages
ISBN:9781450347563
DOI:10.1145/2980258
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 August 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Cepstral coefficient
  2. Cosine scoring
  3. Dimension Compactness
  4. Feature Vector
  5. Spectral Analysis
  6. Supervector

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICIA-16

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 14 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Secure digital documents sharing using blockchain and attribute-based cryptosystemMultiagent and Grid Systems10.3233/MGS-22136118:3-4(365-379)Online publication date: 3-Feb-2023
  • (2022)A novel model to enhance the data security in cloud environmentMultiagent and Grid Systems10.3233/MGS-22036118:1(45-63)Online publication date: 1-Jan-2022
  • (2022)Robust Threshold Selection for Environment Specific Voice in Speaker RecognitionWireless Personal Communications: An International Journal10.1007/s11277-022-09852-2126:4(3071-3092)Online publication date: 1-Oct-2022
  • (2021)Vehicle Detection and Count in the Captured Stream Video Using Machine LearningMachine Learning Approaches for Urban Computing10.1007/978-981-16-0935-0_5(79-112)Online publication date: 29-Apr-2021
  • (2020)Pedestrian localisation in the typical indoor environmentsMultimedia Tools and Applications10.1007/s11042-020-09291-w79:37-38(27833-27866)Online publication date: 1-Oct-2020
  • (2016)Fast load balancing approach for growing clusters by Bioinformatics2016 International Conference on Signal Processing, Communication, Power and Embedded System (SCOPES)10.1109/SCOPES.2016.7955857(382-385)Online publication date: Oct-2016

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media