default search action
24th TSD 2021: Olomouc, Czech Republic
- Kamil Ekstein, Frantisek Pártl, Miloslav Konopík:
Text, Speech, and Dialogue - 24th International Conference, TSD 2021, Olomouc, Czech Republic, September 6-9, 2021, Proceedings. Lecture Notes in Computer Science 12848, Springer 2021, ISBN 978-3-030-83526-2
Keynote Talks
- Diyi Yang, Lucie Flek:
Towards User-Centric Text-to-Text Generation: A Survey. 3-22 - Amirpasha Ghabussi, Lili Mou, Olga Vechtomova:
Wasserstein Autoencoders with Mixture of Gaussian Priors for Stylized Text Generation. 23-31
Text
- Sidney Evaldo Leal, Edresson Casanova, Gustavo Paetzold, Sandra M. Aluísio:
Evaluating Semantic Similarity Methods to Build Semantic Predictability Norms of Reading Data. 35-47 - Tomás Jelínek, Jan Krivan, Vladimír Petkevic, Hana Skoumalová, Jana Sindlerová:
SYN2020: A New Corpus of Czech with an Innovated Annotation. 48-59 - Juan S. Lara, Mario Ezra Aragón, Fabio A. González, Manuel Montes-y-Gómez:
Deep Bag-of-Sub-Emotions for Depression Detection in Social Media. 60-72 - Dou Liu, Tingting Zhu, Jörg Schlötterer, Christin Seifert, Shenghui Wang:
Rewriting Fictional Texts Using Pivot Paraphrase Generation and Character Modification. 73-85 - Jan Svec, Jan Lehecka, Lubos Smídl, Pavel Ircing:
Transformer-Based Automatic Punctuation Prediction and Word Casing Reconstruction of the ASR Output. 86-94 - Gábor Bella, Khuyagbaatar Batsuren, Fausto Giunchiglia:
A Database and Visualization of the Similarity of Contemporary Lexicons. 95-104 - Manfred Klenner, Anne Göhring:
The Detection of Actors for German. 105-110 - Ander Cejudo, Owen Trigueros, Alicia Pérez, Arantza Casillas, Daniel Cobos:
Verbal Autopsy: First Steps Towards Questionnaire Reduction. 111-123 - Wen-Ting Tseng, Yung-Chang Hsu, Berlin Chen:
Effective FAQ Retrieval and Question Matching Tasks with Unsupervised Knowledge Injection. 124-134 - Ashwin Geet D'Sa, Irina Illina, Dominique Fohr, Dietrich Klakow, Dana Ruiter:
Exploring Conditional Language Model Based Data Augmentation Approaches for Hate Speech Classification. 135-146 - Jackylyn Beredo, Carlo Migel Bautista, Macario O. Cordel, Ethel Ong:
Generating Empathetic Responses with a Pre-trained Conversational Model. 147-158 - Klára Bendová, Silvie Cinková:
Adaptation of Classic Readability Metrics to Czech. 159-171 - Maksim Duszkin, Danuta Roszko, Roman Roszko:
New Parallel Corpora of Baltic and Slavic Languages - Assumptions of Corpus Construction. 172-183 - Anton Golubev, Natalia V. Loukachevitch:
Use of Augmentation and Distant Supervision for Sentiment Analysis in Russian. 184-196 - Milan Straka, Jakub Náplava, Jana Straková, David Samuel:
RobeCzech: Czech RoBERTa, a Monolingual Contextualized Language Representation Model. 197-209 - Prateek Saxena, Soma Paul:
Labelled EPIE: A Dataset for Idiom Sense Disambiguation. 210-221 - Eszter Simon, Noémi Vadász:
Introducing NYTK-NerKor, A Gold Standard Hungarian Named Entity Annotated Corpus. 222-234 - Samiran Pal, Avinash Kumar Singh, Soham Datta, Sangameshwar Patil, Indrajit Bhattacharya, Girish Keshav Palshikar:
Semantic Templates for Generating Long-Form Technical Questions. 235-247 - Gil Rocha, Henrique Lopes Cardoso:
Rethinking Adversarial Training for Language Adaptation. 248-260 - Maarten Janssen:
A Corpus with Wavesurfer and TEI: Speech and Video in TEITOK. 261-268 - Samuel Pecar, Marián Simko:
Exploiting Subjectivity Knowledge Transfer for End-to-End Aspect-Based Sentiment Analysis. 269-280 - Ondrej Sotolár, Jaromír Plhák, David Smahel:
Towards Personal Data Anonymization for Social Messaging. 281-292 - Matyás Kopp, Vladislav Stankov, Jan Oldrich Kruza, Pavel Stranák, Ondrej Bojar:
ParCzech 3.0: A Large Czech Speech Corpus with Rich Metadata. 293-304 - Kamran Ibiyev, Attila Novák:
Using Zero-Shot Transfer to Initialize azWikiNER, a Gold Standard Named Entity Corpus for the Azerbaijani Language. 305-317 - Melika Golestani, Seyedeh Zahra Razavi, Zeinab Borhanifard, Farnaz Tahmasebian, Hesham Faili:
Using BERT Encoding and Sentence-Level Language Model for Sentence Ordering. 318-330 - Kentaro Kamiya, Takuya Kawase, Ryuichiro Higashinaka, Katashi Nagao:
Using Presentation Slides and Adjacent Utterances for Post-editing of Speech Recognition Results for Meeting Recordings. 331-340 - Nima Nabizadeh, Heiko Wersing, Dorothea Kolossa:
Leveraging Inter-step Dependencies for Information Extraction from Procedural Task Instructions. 341-353
Speech
- Irina Illina, Dominique Fohr:
DNN-Based Semantic Rescoring Models for Speech Recognition. 357-370 - Petr Cerva, Lukás Mateju, Frantisek Kynych, Jindrich Zdánský, Jan Nouza:
Identification of Scandinavian Languages from Speech Using Bottleneck Features and X-Vectors. 371-381 - Tamás Grósz, Mikko Kurimo:
LSTM-XL: Attention Enhanced Long-Term Memory for LSTM Cells. 382-393 - Swayambhu Nath Ray, Soumyajit Mitra, Raghavendra Bilgi, Sri Garimella:
Improving RNN-T ASR Performance with Date-Time and Location Awareness. 394-404 - Brett Drury, Samuel Morais Drury:
BrAgriSpeech: A Corpus of Brazilian-Portuguese Agricultural Reported Speech. 405-412 - Jing Liu, Rupak Vignesh Swaminathan, Sree Hari Krishnan Parthasarathi, Chunchuan Lyu, Athanasios Mouchtaris, Siegfried Kunzmann:
Exploiting Large-Scale Teacher-Student Training for On-Device Acoustic Models. 413-424 - F. Amal Jude Ashwin, V. Srinivasa Chakravarthy, Sunil Kumar Kopparapu:
An AI-Based Detection System for Mudrabharati: A Novel Unified Fingerspelling System for Indic Scripts. 425-434 - Cristian D. Ríos-Urrego, Juan Camilo Vásquez-Correa, Juan Rafael Orozco-Arroyave, Elmar Nöth:
Is There Any Additional Information in a Neural Network Trained for Pathological Speech Classification? 435-447 - Michal Vrastil, Jindrich Matousek:
On Comparison of XGBoost and Convolutional Neural Networks for Glottal Closure Instant Detection. 448-456 - Paula Andrea Pérez-Toro, Juan Camilo Vásquez-Correa, Tomás Arias-Vergara, Philipp Klumpp, Maria Schuster, Elmar Nöth, Juan Rafael Orozco-Arroyave:
Emotional State Modeling for the Assessment of Depression in Parkinson's Disease. 457-468 - Dejan Porjazovski, Juho Leinonen, Mikko Kurimo:
Attention-Based End-to-End Named Entity Recognition from Speech. 469-480 - Aidar Khusainov, Dzhavdet Suleymanov, Ilnur Muhametzyanov:
Incorporation of Iterative Self-supervised Pre-training in the Creation of the ASR System for the Tatar Language. 481-488 - Zdenek Hanzlícek, Jakub Vít, Markéta Rezácková:
Speakers Talking Foreign Languages in a Multi-lingual TTS System. 489-498 - Amin Honarmandi Shandiz, László Tóth:
Voice Activity Detection for Ultrasound-Based Silent Speech Interfaces Using Convolutional Neural Networks. 499-510 - Daniel Tihelka, Jindrich Matousek, Alice Tihelková:
How Much End-to-End is Tacotron 2 End-to-End TTS System. 511-522 - Josef V. Psutka, Jan Svec, Ales Prazák:
CNN-TDNN-Based Architecture for Speech Recognition Using Grapheme Models in Bilingual Czech-Slovak Task. 523-533
Dialogue
- Auriane Boudin, Roxane Bertrand, Stéphane Rauzy, Magalie Ochs, Philippe Blache:
A Multimodal Model for Predicting Conversational Feedbacks. 537-549 - Pavel Kholiavin, Alla Menshikova, Tatiana Kachkovskaia, Daniil Kocharov:
Estimating Social Distance Between Interlocutors with MFCC-Based Acoustic Models for Vowels. 550-557 - Taisei Najima, Tsuneo Kato, Akihiro Tamura, Seiichi Yamamoto:
Remote Learning of Speaking in Syntactic Forms with Robot-Avatar-Assisted Language Learning System. 558-566
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.