default search action
19. SPECOM 2017: Hatfield, UK
- Alexey Karpov, Rodmonga Potapova, Iosif Mporas:
Speech and Computer - 19th International Conference, SPECOM 2017, Hatfield, UK, September 12-16, 2017, Proceedings. Lecture Notes in Computer Science 10458, Springer 2017, ISBN 978-3-319-66428-6
Invited Talks
- Mark J. F. Gales, Kate M. Knill, Anton Ragni:
Low-Resource Speech Recognition and Keyword-Spotting. 3-19 - Björn W. Schuller:
Big Data, Deep Learning - At the Edge of X-Ray Speaker Analysis. 20-34
Conference Papers
- Niksa Jakovljevic, Ivan D. Jokic, Slobodan Josic, Vlado Delic:
A Comparison of Covariance Matrix and i-vector Based Speaker Recognition. 37-45 - Oliver Jokisch, Horst-Udo Hain:
A Trainable Method for the Phonetic Similarity Search in German Proper Names. 46-55 - Michaela Strinzel, Vasilisa Verkhodanova, Fedor Jalvingh, Roel Jonkers, Matt Coler:
Acoustic and Perceptual Correlates of Vowel Articulation in Parkinson's Disease With and Without Mild Cognitive Impairment: A Pilot Study. 56-64 - Ingo Siegert, Oliver Jokisch, Alicia Flores Lotz, Franziska Trojahn, Martin Meszaros, Michael Maruschke:
Acoustic Cues for the Perceptual Assessment of Surround Sound. 65-75 - Ivan Medennikov, Aleksei Romanenko, Alexey Prudnikov, Valentin Mendelev, Yuri Y. Khokhlov, Maxim Korenevsky, Natalia A. Tomashenko, Alexander Zatvornitskiy:
Acoustic Modeling in the STC Keyword Search System for OpenKWS 2016 Evaluation. 76-86 - Federico Landini, Luciana Ferrer, Horacio Franco:
Adaptation Approaches for Pronunciation Scoring with Sparse Training Data. 87-97 - Sri Harsha Dumpala, K. N. R. K. Raju Alluri:
An Algorithm for Detection of Breath Sounds in Spontaneous Speech with Application to Speaker Recognition. 98-108 - Fahim A. Salim, Fasih Haider, Owen Conlan, Saturnino Luz:
An Alternative Approach to Exploring a Video. 109-118 - Jan Svec, Lubos Smídl, Josef V. Psutka:
An Analysis of the RNN-Based Spoken Term Detection Training. 119-129 - Anastasiia Spirina, Olesia Vaskovskaia, Tatiana Karaseva, Alina Skorokhod, Iana Polonskaia, Maxim Sidorov:
Analysis of Interaction Parameter Levels in Interaction Quality Modelling for Human-Human Conversation. 130-140 - Jindrich Matousek, Daniel Tihelka:
Annotation Error Detection: Anomaly Detection vs. Classification. 141-151 - Oleg Akhtiamov, Dmitrii Ubskii, Evgeniia Feldina, Aleksei Pugachev, Alexey Karpov, Wolfgang Minker:
Are You Addressing Me? Multimodal Addressee Detection in Human-Human-Computer Conversations. 152-161 - Otilia Kocsis, Basilis Kladis, Anastasios Tsopanoglou, Nikos Fakotakis:
Assessing Spoken Dialog Services from the End-User Perspective: Usability and Experience. 162-170 - Galina Lavrentyeva, Sergey Novoselov, Egor Malykh, Alexander Kozlov, Oleg Kudashev, Vadim Shchemelinin:
Audio-Replay Attack Detection Countermeasures. 171-181 - Abualsoud Hanani, Mohammad Al-Amleh, Waseem Bazbus, Saleem Salameh:
Automatic Estimation of Presentation Skills Using Speech, Slides and Gestures. 182-191 - Vera Evdokimova, Pavel A. Skrelin, Tatiana Chukaeva:
Automatic Phonetic Transcription for Russian: Speech Variability Modeling. 192-199 - Amir Hossein Poorjam, Soheila Hesaraki, Saeid Safavi, Hugo Van hamme, Mohamad Hasan Bahari:
Automatic Smoker Detection from Telephone Speech Signals. 200-210 - Eugene Luckyanets, Aleksandr Melnikov, Oleg Kudashev, Sergey Novoselov, Galina Lavrentyeva:
Bimodal Anti-Spoofing System for Mobile Security. 211-220 - Tatiana Shevchenko, Daria Pozdeeva:
Canadian English Word Stress: A Corpora-Based Study of National Identity in a Multilingual Community. 221-232 - István Szekrényes, György Kovács:
Classification of Formal and Informal Dialogues Based on Turn-Taking and Intonation Using Deep Neural Networks. 233-243 - Andrey Shulipa, Aleksey Sholohov, Yuri Matveev:
Clustering Target Speaker on a Set of Telephone Dialogs. 244-252 - Rodmonga Potapova, Vsevolod Potapov:
Cognitive Entropy in the Perceptual-Auditory Evaluation of Emotional Modal States of Foreign Language Communication Partner. 253-261 - Eugeny U. Kostyuchenko, Roman V. Meshcheryakov, Dariya Ignatieva, Alexander Pyatkov, Evgeniy L. Choynzonov, Lidiya N. Balatskaya:
Correlation Normalization of Syllables and Comparative Evaluation of Pronunciation Quality in Speech Rehabilitation. 262-271 - Markéta Juzová:
CRF-Based Phrase Boundary Detection Trained on Large-Scale TTS Speech Corpora. 272-281 - Mohammed Salah Al-Radhi, Tamás Gábor Csapó, Géza Németh:
Deep Recurrent Neural Networks in Speech Synthesis Using a Continuous Vocoder. 282-291 - Andrey Barabanov, Evgenij Vikulov:
Design of Online Echo Canceller in Duplex Mode. 292-301 - Maria Skeppstedt, Vasiliki Simaki, Carita Paradis, Andreas Kerren:
Detection of Stance and Sentiment Modifiers in Political Blogs. 302-311 - Josef Chaloupka:
Digits to Words Converter for Slavic Languages in Systems of Automatic Speech Recognition. 312-321 - Halim Sayoud, Siham Ouamour, Zohra Hamadache:
Discriminating Speakers by Their Voices - A Fusion Based Approach. 322-331 - Aitzol Astigarraga, José María Martínez-Otzeta, Igor Rodriguez Rodriguez, Basilio Sierra, Elena Lazkano:
Emotional Poetry Generation. 332-342 - Branislav M. Popovic, Edvin Pakoci, Darko Pekar:
End-to-End Large Vocabulary Speech Recognition for the Serbian Language. 343-352 - Nikolaos Spatiotis, Michael Paraskevas, Isidoros Perikos, Iosif Mporas:
Examining the Impact of Feature Selection on Sentiment Analysis for the Greek Language. 353-361 - Irina S. Kipyatkova:
Experimenting with Hybrid TDNN/HMM Acoustic Models for Russian Speech Recognition. 362-369 - Emer Gilmartin, Benjamin R. Cowan, Carl Vogel, Nick Campbell:
Exploring Multiparty Casual Talk for Social Human-Machine Dialogue. 370-378 - Cédric Fayet, Arnaud Delhay, Damien Lolive, Pierre-François Marteau:
First Experiments to Detect Anomaly Using Personality Traits vs. Prosodic Features. 379-388 - Purvi Agrawal, Hemant A. Patil:
Fusion of a Novel Volterra-Wiener Filter Based Nonlinear Residual Phase and MFCC for Speaker Verification. 389-397 - Vasilisa Verkhodanova, Vladimir Shapranov, Irina S. Kipyatkova:
Hesitations in Spontaneous Speech: Acoustic Analysis and Detection. 398-406 - Rodmonga Potapova, Vsevolod Potapov:
Human as Acmeologic Entity in Social Network Discourse (Multidimensional Approach). 407-416 - Thai Son Nguyen, Kevin Kilgour, Matthias Sperber, Alex Waibel:
Improved Speaker Adaptation by Combining I-vector and fMLLR with Deep Bottleneck Networks. 417-426 - Petr Mizera, Petr Pollák:
Improving of LVCSR for Causal Czech Using Publicly Available Language Resources. 427-437 - Saeid Safavi, Iosif Mporas:
Improving Performance of Speaker Identification Systems Using Score Level Fusion of Two Modes of Operation. 438-444 - Ingo Siegert, Alicia Flores Lotz, Olga Egorow, Andreas Wendemuth:
Improving Speech-Based Emotion Recognition by Using Psychoacoustic Modeling and Analysis-by-Synthesis. 445-455 - Natalia Bogdanova-Beglarian:
In Search of Sentence Boundaries in Spontaneous Speech. 456-463 - Gábor Pintér, Oliver Jokisch, Shinobu Mizuguchi:
Investigating Acoustic Correlates of Broad and Narrow Focus Perception by Japanese Learners of English. 464-472 - Markus Müller, Sebastian Stüker, Alex Waibel:
Language Adaptive Multilingual CTC Speech Recognition. 473-482 - Edvin Pakoci, Branislav M. Popovic, Darko Pekar:
Language Model Optimization for a Deep Neural Network Based Speech Recognition System for Serbian. 483-492 - Rodmonga Potapova, Liliya Komalova:
Lexico-Semantical Indices of "Deprivation - Aggression" Modality Correlation in Social Network Discourse. 493-502 - Natalia Bogdanova-Beglarian, Tatiana Y. Sherstinova, Olga Blinova, Gregory Y. Martynenko:
Linguistic Features and Sociolinguistic Variability in Everyday Spoken Russian. 503-511 - Erik Edwards, Wael Salloum, Greg Finley, James Fone, Greg Cardiff, Mark Miller, David Suendermann-Oeft:
Medical Speech Recognition: Reaching Parity with Humans. 512-524 - Sergey I. Salishev, Ilya Klotchkov, Andrey Barabanov:
Microphone Array Post-filter in Frequency Domain for Speech Recognition Using Short-Time Log-Spectral Amplitude Estimator and Spectral Harmonic/Noise Classifier. 525-534 - Abhimanyu Popli, Arun Kumar:
Multimodal Keyword Search for Multilingual and Mixlingual Speech Corpus. 535-545 - Natalia E. Maslova, Vsevolod Potapov:
Neural Network Doc2vec in Automated Sentiment Analysis for Short Informal Texts. 546-554 - Zbynek Zajíc, Jan Zelinka, Ludek Müller:
Neural Network Speaker Descriptor in Speaker Diarization of Telephone Speech. 555-563 - Ami Gandhi, Hemant A. Patil:
Novel Linear Prediction Temporal Phase Based Features for Speaker Recognition. 564-571 - Apeksha J. Naik, Rishabh Tak, Hemant A. Patil:
Novel Phase Encoded Mel Cepstral Features for Speaker Verification. 572-581 - Boris Lobanov, Yelena Karnevskaya, Vladimir Zhitko:
On a Way to the Computer Aided Speech Intonation Training. 582-592 - Egor Malykh, Sergey Novoselov, Oleg Kudashev:
On Residual CNN in Text-Dependent Speaker Verification Task. 593-601 - Elena E. Lyakso, Olga V. Frolova, Aleksey Grigorev:
Perception and Acoustic Features of Speech of Children with Autism Spectrum Disorders. 602-612 - Marek Hrúz, Petr Salajka:
Phase Analysis and Labeling Strategies in a CNN-Based Speaker Change Detection System. 613-622 - Tatiana Y. Sherstinova:
Preparing Audio Recordings of Everyday Speech for Prosody Research: The Case of the ORD Corpus. 623-631 - Kohei Mukaihara, Sakriani Sakti, Satoshi Nakamura:
Recognizing Emotionally Coloured Dialogue Speech Using Speaker-Adapted DNN-CNN Bottleneck Features. 632-641 - Ryohei Ohno, Masanori Morise, Tetsuro Kitahara:
Relationship Between Perception of Cuteness in Female Voices and Their Durations. 642-650 - Li Meng, Aruna Shenoy:
Retaining Expression on De-identified Faces. 651-661 - Miroslav Hlavác, Ivan Gruber, Milos Zelezný, Alexey Karpov:
Semi-automatic Facial Key-Point Dataset Creation. 662-668 - Athanasios Koutras:
Song Emotion Recognition Using Music Genre Information. 669-679 - Maxim Tkachenko, Alexander Yamshinin, Nikolay Lyubimov, Mikhail Kotov, Marina Nastasenko:
Speech Enhancement for Speaker Recognition Using Deep Recurrent Neural Networks. 690-699 - Vasiliki Simaki, Carita Paradis, Andreas Kerren:
Stance Classification in Texts from Blogs on the 2016 British Referendum. 700-709 - Arto Mustajoki, Tatiana Y. Sherstinova:
The "Retrospective Commenting" Method for Longitudinal Recordings of Everyday Speech. 710-718 - Pavel Golik, Zoltán Tüske, Kazuki Irie, Eugen Beck, Ralf Schlüter, Hermann Ney:
The 2016 RWTH Keyword Search System for Low-Resource Languages. 719-730 - Anton Stepikhov, Anastassia Loukina:
The Effect of Morphological Factors on Sentence Boundaries in Russian Spontaneous Speech. 731-740 - Arman Kaliyev, Sergey V. Rybin, Yuri N. Matveev:
The Pausing Method Based on Brown Clustering and Word Embedding. 741-747 - Jaromír Novotný, Pavel Ircing:
Unsupervised Document Classification and Topic Detection. 748-756 - Denis Ivanko, Alexey Karpov, Dmitry Ryumin, Irina S. Kipyatkova, Anton I. Saveliev, Victor Budkov, Dmitriy Ivanko, Milos Zelezný:
Using a High-Speed Video Camera for Robust Audio-Visual Speech Recognition in Acoustically Noisy Conditions. 757-766 - Karel Palecek:
Utilizing Lipreading in Large Vocabulary Continuous Speech Recognition. 767-776 - Susmitha Vekkot, Shikha Tripathi:
Vocal Emotion Conversion Using WSOLA and Linear Prediction. 777-787 - Vadim Zahariev, Elias Azarov, Alexander A. Petrovsky:
Voice Conversion for TTS Systems with Tuning on the Target Speaker Based on GMM. 788-798 - Ladan Baghai-Ravary, Steve W. Beet:
VoiScan: Telephone Voice Analysis for Health and Biometric Applications. 799-808 - Alaa Mohasseb, Mohamed Bader-El-Den, Andreas Kanavos, Mihaela Cocea:
Web Queries Classification Based on the Syntactical Patterns of Search Types. 809-819 - Yang Chao, Marie-Luce Bourguet:
What Speech Recognition Accuracy is Needed for Video Transcripts to be a Useful Search Interface? 820-828
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.