default search action
24th O-COCOSDA 2021: Singapore
- 24th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2021, Singapore, November 18-20, 2021. IEEE 2021, ISBN 978-1-6654-0870-7
- Nobuya Tachimori, Sakriani Sakti, Satoshi Nakamura:
Multi-Encoder Sequential Attention Network for Context-Aware Speech Recognition in Japanese Dialog Conversation. 1-6 - Md Mahbub E. Noor, Yen-Ju Lu, Syu-Siang Wang, Supratip Ghose, Chia-Yu Chang, Ryandhimas E. Zezario, Shafique Ahmed, Wei-Ho Chung, Yu Tsao, Hsin-Min Wang:
Investigation of a Single-Channel Frequency-Domain Speech Enhancement Network to Improve End-to-End Bengali Automatic Speech Recognition Under Unseen Noisy Conditions. 7-12 - Abhayjeet Singh, Achuth Rao MV, Rakesh Vaideeswaran, Chiranjeevi Yarra, Prasanta Kumar Ghosh:
A Study on Native American English Speech Recognition by Indian Listeners with Varying Word Familiarity Level. 13-18 - Dinda Yora Islami, Dessi Puji Lestari:
Speech Recognition System for Writing Dentist Medical Records. 19-25 - Erbaz Khan, Sahar Rauf, Farah Adeeba, Sarmad Hussain:
A Multi-Genre Urdu Broadcast Speech Recognition System. 25-30 - Amita Dev, Shweta A. Bansal, Shyam S. Agrawal:
An Empirical Study of Speaker Identification System for Mono and Traverse Linguistic Background Using EM and SMEM. 31-36 - Keisuke Toyoda, Yusuke Kimura, Mingxin Zhang, Kent Hino, Kosuke Mori, Takahiro Shinozaki:
Self-Supervised Spoken Question Understanding and Speaking with Automatic Vocabulary Learning. 37-42 - Hikaru Oishi, Mika Enomoto, Keiko Ochi, Yasunari Obuchi:
Design and Basic Analysis of the TUT Emotional Storytelling Corpus. 43-48 - Xinyi Zhang, Wenhuan Lu, Xinyue Zhao, Yi Zhu, Jianguo Wei:
Construction and Analysis of Tibetan AMDO Dialect Speech Dataset for Speech Synthesis. 49-52 - Aijun Li, Ziyu Xiong:
Into-Cass: A Corpus for the Study of Intonation and Prosody in Chinese Dialects and Ethnic Languages. 53-58 - Jue Yu, Qianwen Jin:
Discourse Timing in Children's Rhyme Speech Produced by Prelingually Deaf Mandarin-Speaking Children with Cochlear Implants. 59-64 - Ronald John Cabatic, Angelica H. De La Cruz:
Towards the Development of Segment Level Speech Overlap Detection Using Convolutional Neural Network. 65-69 - Taiga Mori, Kristiina Jokinen, Yasuharu Den:
On The Use of Gestures in Dialogue Breakdown Detection. 70-75 - Ian Michael Urriza, Maria Art Antonette D. Clariño:
Aspect-Based Sentiment Analysis of User Created Game Reviews. 76-81 - Yizhou Lan, Tongtong Xie:
L2 Accent and Intelligibility by Chinese L2 Speakers of English. 82-87 - Tong Li, Hui Feng:
A Study on English Word-Final Coronal Stop Deletion by Chinese EFL Learners. 88-93 - Qianxi Yu, Ping Tang:
The Role of High Variability Phonetic Training on Chinese EFL Learners' Perception of English Vowels in Noisy Environment. 94-99 - Yanan Shen, Ping Tang:
The Effect of Overnight Consolidation on English Vowel Perception by Chinese Learners After High Speaker Variability Phonetic Training. 100-105 - Chia-Wei Chuang:
Mandarin Speakers' Acquisitions and Representations of Flapping in American English in An ESL Context: A Perception and Production Study. 106-110 - Yuan Jia, Bin Li:
Tonal Patterns of Tri-Syllabic Words in the Production of Standard Chinese of Bilingual Teachers. 111-115 - Tilak Purohit, Tejas Umesh, Shankar Narayanan, Minulakshmi S, Prasanta Kumar Ghosh:
SPIRE VCV: An Acoustic-Articulatory Corpus with Three Different Speaking Rates. 116-121 - Soky Kak, Masato Mimura, Tatsuya Kawahara, Sheng Li, Chenchen Ding, Chenhui Chu, Sethserey Sam:
Khmer Speech Translation Corpus of the Extraordinary Chambers in the Courts of Cambodia (ECCC). 122-127 - Xinyuan Qian, Bidisha Sharma, Amine El Abridi, Haizhou Li:
SLoClas: A Database for Joint Sound Localization and Classification. 128-133 - Shinnosuke Isobe, Ryuichi Hirose, Takumi Nishiwaki, Tomohiro Hattori, Satoshi Tamura, Yuuto Gotoh, Masaki Nose:
GAMVA: A Japanese Audio-Visual Multi-Angle Speech Corpus. 134-139 - Tiankai Zhi, Ying Shi, Wenqiang Du, Guanyu Li, Dong Wang:
M2ASR-MONGO: A Free Mongolian Speech Database and Accompanied Baselines. 140-145 - Bhavuk Singhal, Abinay Reddy Naini, Prasanta Kumar Ghosh:
wSPIRE: A Parallel Multi-Device Corpus in Neutral and Whispered Speech. 146-151 - Xuefei Liu, Jianhua Tao, Yurong Han, Chenglong Wang, Xueying Zheng, Zhengqi Wen:
Which Phonemes Will Distinguish the Different Regions Within the Same Dialect? 152-157 - Huaijin Deng, Takehito Utsuro, Akio Kobayashi, Hiromitsu Nishizaki:
Comparison of Static and Time-Sequential Features in Automatic Fluency Detection of Spontaneous Speech. 158-163 - Michiko Watanabe, Yuma Shirahata, Ralph Rose, Kikuo Maekawa:
How Do Speakers Pause and Hesitate in English and Japanese? - A Comparison Using Parallel Corpora of English and Japanese Presentation Speeches -. 164-167 - Jooyoung Lee, Kyungwha Kim, Minhwa Chung:
Korean Dialect Identification Based on Intonation Modeling. 168-173 - Quang Tien Duong, Van Hai Do:
Development of Accent Recognition Systems for Vietnamese Speech. 174-179 - Dac-Thang Hoang, Tat-Thang Vu:
A Blind Method for Phone Segmentation and Its Evaluation on Vietnamese Speech Corpus. 180-185 - Ryo Fukuda, Sashi Novitasari, Yui Oka, Yasumasa Kano, Yuki Yano, Yuka Ko, Hirotaka Tokuyama, Kosuke Doi, Tomoya Yanagita, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura:
Simultaneous Speech-to-Speech Translation System with Transformer-Based Incremental ASR, MT, and TTS. 186-192 - Rui Jiang, Chengsi Chen, Xin Shan, Hongwu Yang:
Using Speech Enhancement to Realize Speech Synthesis of Low-Resource Dungan Languages. 193-198 - Pham Ngoc Phuong, Chung Tran Quang, Quoc Truong Do, Mai Chi Luong:
A Study on Neural-Network-Based Text-to-Speech Adaptation Techniques for Vietnamese. 199-205 - Nobuyoshi Kaiki, Sakriani Sakti, Satoshi Nakamura:
Using Local Phrase Dependency Structure Information in Neural Sequence-to-Sequence Speech Synthesis. 206-211 - Edsel Jedd Renovalles, Crisron Rudolf Lucas, Franz A. de Leon, Angelina Aquino, Izza Jalandoni:
Text-to-Speech Systems for Filipino Using Unit Selection and Deep Learning. 212-217 - Pongsathon Janyoi, Ausdang Thangthai:
Investigation of an Input Sequence on Thai Neural Sequence-to-Sequence Speech Synthesis. 218-223
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.