default search action
10. ISCSLP 2016: Tianjin, China
- 10th International Symposium on Chinese Spoken Language Processing, ISCSLP 2016, Tianjin, China, October 17-20, 2016. IEEE 2016, ISBN 978-1-5090-4294-4
- Zhiying Huang, Shaofei Xue, Zhijie Yan, Li-Rong Dai:
Unsupervised speaker adaptation of BLSTM-RNN for LVCSR based on speaker code. 1-5 - Hao Ni, Jiangyan Yi, Zhengqi Wen, Bin Liu, Jianhua Tao:
Improving accented Mandarin speech recognition by using recurrent neural network based language model adaptation. 1-5 - Jinguang Zhang, Xiyu Wu, Jiangping Kong:
Tongue shape variation model for simulating Mandarin Chinese articulation. 1-5 - Jingyong Hou, Lei Xie, Zhonghua Fu:
Investigating neural network based query-by-example keyword spotting approach for personalized wake-up word detection in Mandarin Chinese. 1-5 - Cheng-Hsien Lin, Chung-Long You, Chen-Yu Chiang, Yih-Ru Wang, Sin-Horng Chen:
Rich prosodic information exploration on spontaneous Mandarin speech. 1-5 - Jing Wang, Yahui Shan, Shequan Jiang, Xiang Xie:
Microphone array speech denoising modeled by tensor filtering. 1-5 - Jia Dai, Shan Liang, Wei Xue, Chongjia Ni, Wenju Liu:
Long short-term memory recurrent neural network based segment features for music genre classification. 1-5 - Yuanyuan Zhao, Jie Li, Shuang Xu, Bo Xu:
Investigating gated recurrent neural networks for acoustic modeling. 1-5 - Biswajit Das, Ashish Panda:
Vector taylor series expansion with auditory masking for noise robust speech recognition. 1-5 - Pengrui Wang, Jie Li, Bo Xu:
Applying connectionist temporal classification objective function to Chinese Mandarin speech recognition. 1-5 - Chen-Yu Chiang, Yu-Ping Hung, Guan-Ting Liou, Yih-Ru Wang:
Improvements on punctuation generation inspired linguistic features for Mandarin prosody generation. 1-5 - Shaofei Xue, Zhijie Yan, Zhiying Huang, Li-Rong Dai:
Rapid speaker adaptation based on D-code extracted from BLSTM-RNN in LVCSR. 1-5 - Jen-Tzung Chien, Alim Misbullah:
Deep long short-term memory networks for speech recognition. 1-5 - Wentao Gu, Jiao Yin, James J. Mahshie:
Categorical perception in two pairs of Mandarin tones among bimodal cochlear implanted children. 1-5 - Nan Chen, Changchun Bao, Feng Deng:
Speech enhancement with binaural cues derived from a priori codebook. 1-5 - Dean Luo, Wentao Gu, Ruxin Luo, Lixin Wang:
Investigation of the effects of automatic scoring technology on human raters' performances in L2 speech proficiency assessment. 1-5 - Sheng Zhang, Wu Guo, Guoping Hu:
First investigation of universal speech attributes for speaker verification. 1-5 - Lantian Li, Dong Wang, Chao Xing, Thomas Fang Zheng:
Max-margin metric learning for speaker recognition. 1-4 - Lantian Li, Chao Xing, Dong Wang, Kaimin Yu, Thomas Fang Zheng:
Binary speaker embedding. 1-4 - Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao, Hsin-Min Wang:
Dictionary update for NMF-based voice conversion using an encoder-decoder network. 1-5 - Yimeng Zhuang, Sibo Tong, Maofan Yin, Yanmin Qian, Kai Yu:
Multi-task joint-learning for robust voice activity detection. 1-5 - Peixin Chen, Wu Quo, Guoping Hu:
Digit-dependent local i-vector for text-prompted speaker verification with random digit sequences. 1-5 - Pengyu Cong, Chaomin Wang, Zhijie Ren, Huixin Wang, Yanmeng Wang, Junlan Feng:
Unsatisfied customer call detection with deep learning. 1-5 - Mengjie Qian, Ian McLoughlin, Wu Quo, Li-Rong Dai:
Mismatched training data enhancement for automatic recognition of children's speech using DNN-HMM. 1-5 - Nana Fan, Jun Du, Li-Rong Dai:
A regression approach to binaural speech segregation via deep neural network. 1-5 - Yang Liu, Naushin Nower, Shota Morita, Masashi Unoki:
Robust front-end for speech recognition by human and machine in noisy reverberant environments: The effect of phase information. 1-5 - Richeng Duan, Tatsuya Kawahara, Masatake Dantsuji, Jinsong Zhang:
Pronunciation error detection using DNN articulatory model based on multi-lingual and multi-task learning. 1-5 - Chao-yu Su, Chiu-yu Tseng:
L1/L2 difference in phonological sensitivity and information planning - Evidence from F0 patterns. 1-5 - Vinay Kumar Mittal:
Discriminating features of infant cry acoustic signal for automated detection of cause of crying. 1-5 - Changhao Shan, Lei Xie, Kaisheng Yao:
A bi-directional LSTM approach for polyphone disambiguation in Mandarin Chinese. 1-5 - Vinay Kumar Mittal, Anil Kumar Vuppala:
Significance of automatic detection of vowel regions for automatic shout detection in continuous speech. 1-5 - Vinay Kumar Mittal, B. Yegnanarayana:
A sparse representation of the excitation source characteristics of nonnormal speech sounds. 1-5 - Peng Song, Xinran Zhang, Shifeng Ou, Jingjing Liu, Yanwei Yu, Wenming Zheng:
Cross-corpus speech emotion recognition using transfer semi-supervised discriminant analysis. 1-5 - Zhong-Hua Fu:
Interferences suppression using two closely-spaced microphones. 1-5 - Wai-Kim Leung, Jia Jia, Yu-Hao Wu, Jiayu Long, Lianhong Cai:
THear: Development of a mobile multimodal audiometry application on a cross-platform framework. 1-5 - Wei Lai, Mark Liberman, Jiahong Yuan, Xiaoying Xu:
Prosodic strength intrinsic to lexical items: A corpus study on tone reduction in Tone4+Tone4 words in Mandarin Chinese. 1-5 - Junhua Liu, Zhen-Hua Ling, Si Wei, Guoping Hu, Li-Rong Dai:
Cluster-based senone selection for the efficient calculation of deep neural network acoustic models. 1-5 - Zhipeng Xie, Jun Du, Ian McLoughlin, Yong Xu, Feng Ma, Haikun Wang:
Deep neural network for robust speech recognition with auxiliary features from laser-Doppler vibrometer sensor. 1-5 - Minghui Wu, Marjoleine Sloos, Jeroen van de Weijer:
The perception of the English alveolar-velar nasal coda contrast by monolingual versus bilingual Chinese speakers. 1-5 - Liang He, Yao Tian, Yi Liu, Fang Dong, Weiqiang Zhang, Jia Liu:
A study of variational method for text-independent speaker recognition. 1-5 - Hsin-Ju Hsieh, Berlin Chen, Jeih-weih Hung:
Employing median filtering to enhance the complex-valued acoustic spectrograms in modulation domain for noise-robust speech recognition. 1-5 - Zhi-Ping Zhou, Zhen-Hua Ling:
DNN-based unit selection using frame-sized speech segments. 1-5 - Yong Feng, Xinyuan Cai, Ruifang Ji:
Evaluation of the deep nonlinear metric learning based speaker identification on the large scale of voiceprint corpus. 1-4 - Zhan Wang, Jian Yang, Xin Yang:
The design and implementation of HMM-based Dai speech synthesis. 1-5 - Lin Li, Wenhao Xu, Qingyang Hong, Feng Tong, Jinzhun Wu:
Classification between normal and adventitious lung sounds using deep neural network. 1-5 - Peng Shen, Xugang Lu, Hisashi Kawai:
Comparison of regularization constraints in deep neural network based speaker adaptation. 1-5 - Peng Shen, Xugang Lu, Hisashi Kawai:
Automatic acoustic segmentation in N-best list rescoring for lecture speech recognition. 1-5 - Junfeng Hou, Shiliang Zhang, Li-Rong Dai:
Learning FOFE based FNN-LMs with noise contrastive estimation and part-of-speech features. 1-5 - Ming Xiu, Camille Fauth, Béatrice Vaxelaire, Jean-François Rodier, Pierre-Philippe Volkmar, Rudolph Sock:
A post-thyroidectomy voice quality study in patients suffering or not from Recurrent laryngeal paralysis. 1-4 - Helen Kai-Yun Chen, Wei-te Fang, Chiu-yu Tseng:
Advance prosodic indexing - Acoustic realization of prompted information projection in continuous speeches and discourses. 1-5 - Hao Zhang, Fei Chen, Nan Yan, Lan Wang, Yu Chen, Feng Shi:
The effects of tone categories on the perception of Mandarin vowels. 1-5 - Bogu Li, Zhilei Liu, Jianwu Dang:
Study on the relation of fundamental and formant frequencies for affective speech synthesis. 1-5 - Jing Li, Kiyoshi Honda, Ju Zhang, Jianguo Wei:
Individual difference and acoustic effect of female laryngeal cavities. 1-5 - Yi-Ting Chen, Tzu-Hao Chen, Mao-Chang Huang, Tai-Shih Chi:
Interaural coherence induced ideal binary mask for binaural speech separation and dereverberation. 1-5 - Zhan Shen, Jianguo Wei, Wenhuan Lu, Jianwu Dang:
Voice activity detection based on sequential Gaussian mixture model with maximum likelihood criterion. 1-5 - Xueyang Wu, Su Zhu, Yue Wu, Kai Yu:
Rich punctuations prediction using large-scale deep learning. 1-5 - Sheng Li, Xugang Lu, Shinsuke Mori, Yuya Akita, Tatsuya Kawahara:
Confidence estimation for speech recognition systems using conditional random fields trained with partially annotated data. 1-5 - Jiangyan Yi, Hao Ni, Zhengqi Wen, Bin Liu, Jianhua Tao:
CTC regularized model adaptation for improving LSTM RNN based multi-accent Mandarin speech recognition. 1-5 - Xiao Wang, Gang Peng:
Cantonese spoken word retention by speakers with and without congenital amusia: Implications from phonological similarity and cognitive load effects. 1-5 - Huangmei Liu, Jie Liang:
Vowels as acoustic cues for sub-dialect identification in Chinese. 1-5 - Tianxing He, Yu Zhang, Jasha Droppo, Kai Yu:
On training bi-directional neural network language model with noise contrastive estimation. 1-5 - Congcong Zhang, Kiyoshi Honda, Ju Zhang, Jianguo Wei:
Contributions of the piriform fossa of female speakers to vowel spectra. 1-5 - Yibin Zheng, Ya Li, Zhengqi Wen, Bin Liu, Jianhua Tao:
Text-based sentential stress prediction using continuous lexical embedding for Mandarin speech synthesis. 1-5 - Yibin Zheng, Ya Li, Zhengqi Wen, Bin Liu, Jianhua Tao:
Investigating deep neural network adaptation for generating exclamatory and interrogative speech in Mandarin. 1-5 - Lin Li, Jiawen Wu, Xinghao Ding, Qingyang Hong, Delu Zeng:
Speech enhancement based on nonparametric factor analysis. 1-5 - Leyuan Qu, Yanlu Xie, Jinsong Zhang:
Senone log-likelihood ratios based articulatory features in pronunciation erroneous tendency detecting. 1-5 - Michael C. W. Yip:
Recognition of spoken words in L2 speech using L1 probabilistic phonotactics: Evidence from Cantonese-English bilinguals. 1-4 - Ying-Hui Lai, Syu-Siang Wang, Yu-Ting Su, Cheng Han-Che, Fan Kang Fu, Yu Tsao:
Improving the performance of speech perception in noisy environment based on an FAME strategy. 1-5 - Liang Zhang, Yuan Jia, Aijun Li:
An interface research on rhetorical structure and prosody features in Chinese reading texts. 1-5 - Yanhui Tu, Jun Du, Li-Rong Dai, Chin-Hui Lee:
A speaker-dependent deep learning approach to joint speech separation and acoustic modeling for multi-talker automatic speech recognition. 1-5 - Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai:
A pseudo-task design in multi-task learning deep neural network for speaker recognition. 1-5 - Licheng Liu, Yan Ji, Hongcui Wang, Bruce Denby:
Comparison of DCT and autoencoder-based features for DNN-HMM multimodal silent speech recognition. 1-5 - Zhaofeng Zhang, Xiong Xiao, Longbiao Wang, Jianwu Dang, Masahiro Iwahashi, Eng Siong Chng, Haizhou Li:
Multi-channel feature adaptation for robust speech recognition. 1-5 - Wei Rao, Xiong Xiao, Chenglin Xu, Haihua Xu, Kong-Aik Lee, Eng Siong Chng, Haizhou Li:
Neural networks based channel compensation for i-vector speaker verification. 1-5 - Peidong Guo, Heyan Huang, Ping Jian, Yuhang Guo:
Prosodic annotation enriched statistical machine translation. 1-5 - Ming-Hsiang Su, Chung-Hsien Wu, Kun-Yi Huang, Tsung-Hsien Yang, Tsui-Ching Huang:
Dialog state tracking for interview coaching using two-level LSTM. 1-5 - Tsung-Hsien Yang, Chung-Hsien Wu, Kun-Yi Huang, Ming-Hsiang Su:
Detection of mood disorder using speech emotion profiles and LSTM. 1-5 - Quan Zhou, Yu Chen, Yanting Chen, Hao Zhang, Jianguo Wei, Jianwu Dang:
Tongue performance in articulating Mandarin apical syllables by prelingual deaf adults using ultrasonic technology: Two case studies. 1-5 - Zhengqi Wen, Kehuang Li, Zhen Huang, Jianhua Tao, Chin-Hui Lee:
Learning auxiliary categorical information for speech synthesis based on deep and recurrent neural networks. 1-5 - Siyuan Feng, Tan Lee, Haipeng Wang:
Exploiting language-mismatched phoneme recognizers for unsupervised acoustic modeling. 1-5 - Yuke Si, Jianwu Dang, Gaoyan Zhang:
Investigation of the spatiotemporal dynamics of the brain during perceiving words. 1-5 - Ju Zhang, Kiyoshi Honda, Jianguo Wei, Jianrong Wang, Jianwu Dang:
Spatial co-variation of lip and tongue at strong and weak syllables. 1-5 - Ying Qin, Tan Lee, Anthony Pak-Hin Kong, Sam-Po Law:
Towards automatic assessment of aphasia speech using automatic speech recognition techniques. 1-4 - Da Zheng, Zhehuai Chen, Yue Wu, Kai Yu:
Directed automatic speech transcription error correction using bidirectional LSTM. 1-5 - Jian Li, Hongcui Wang, Longbiao Wang, Jianwu Dang, Kuntharrgyal Khyuru, Gyaltsen Lobsang:
Exploring tonal information for Lhasa dialect acoustic modeling. 1-5 - Zhaoqiong Huang, Ge Zhan, Dongwen Ying, Ruohua Zhou, Jielin Pan, Yonghong Yan:
Robust multiple speech source localization based on phase difference regression. 1-5 - Yingke Zhu, Brian Mak:
An investigation of adaptation techniques for building acoustic models for hearing-impaired children in a CAPT application. 1-5 - Na Hu, Pengfei Shao, Yiqing Zu, Zuyan Wang, Wei Huang, Shijin Wang:
Discourse prosody and its application to speech synthesis. 1-5 - Ying Zhou, Fei Chen, Hui Chen, Lan Wang, Nan Yan:
Evaluation of a multimodal 3-D pronunciation tutor for learning Mandarin as a second language: An eye-tracking study. 1-5 - Viet-Hang Duong, Yuan-Shan Lee, Bach-Tung Pham, Seksan Mathulaprangsan, Pham The Bao, Jia-Ching Wang:
Spatial dispersion constrained NMF for monaural source separation. 1-4 - Jun Yu, Rongfeng Su, Lan Wang, Wenpeng Zhou:
A multi-channel/multi-speaker interactive 3D audio-visual speech corpus in Mandarin. 1-5 - Ying Chen, Li Liu, Xueqin Zhao:
Perceptual evaluation of natural and synthesized speech with prosodic focus in Mandarin production of American learners. 1-5 - Jian Kang, Wei-Qiang Zhang, Jia Liu:
Lattice based transcription loss for end-to-end speech recognition. 1-5 - Jian Kang, Wei-Qiang Zhang, Jia Liu:
Gated recurrent units based hybrid acoustic models for robust speech recognition. 1-5 - Kaile Zhang, Yonghong Li, Gang Peng:
Cognitive representation of phonological categories: The evidence from Mandarin speakers' learning of cantonese tones. 1-5 - Na Zhao, Hongwu Yang:
Realizing speech to gesture conversion by keyword spotting. 1-5 - Wenzhi He, Nengheng Zheng, Qinglin Meng:
The effect of gain thresholds on speech intelligibility for statistical model based noise reduction for cochlear implants: A simulation based verification. 1-4 - Ye Bai, Jiangyan Yi, Hao Ni, Zhengqi Wen, Bin Liu, Ya Li, Jianhua Tao:
End-to-end keywords spotting based on connectionist temporal classification for Mandarin. 1-5 - Ge Zhan, Zhaoqiong Huang, Dongwen Ying, Jielin Pan, Yonghong Yan:
Improvement of mask-based speech source separation using DNN. 1-5 - Zhili Tan, Yingke Zhu, Man-Wai Mak, Brian Kan-Wing Mak:
Senone I-vectors for robust speaker verification. 1-5 - Ka-Ho Wong, Hoi Kiu Kristy Mok, Helen Meng:
Exploratory data analysis on nuclei in cantonese dysarthric speech. 1-5 - Zhiping Zhang, Zhiqiang Wu:
An adaptive filter with gain and time-shift parameters for echo cancellation. 1-5 - Bin Zhao, Jianwu Dang, Gaoyan Zhang:
EEG evidence for a three-phase recurrent process during spoken word processing. 1-5 - Runnan Li, Zhiyong Wu, Helen M. Meng, Lianhong Cai:
DBLSTM-based multi-task learning for pitch transformation in voice conversion. 1-5 - Wei Shan, Keiichi Funaki:
F0 estimation of speech based on IRAPT using WLP-based TV-CAR analysis. 1-4 - Yanping Li, Yanlu Xie, Luoduo Feng, Jinsong Zhang:
The perceptual cues for nasal finals in standard Chinese. 1-5 - Dan Hu, Hui Feng, Tongyu Wu:
English stress acquisition by native speakers of Tibetan. 1-5 - Yue Chen, Yanlu Xie, Bin Wu, Jinsong Zhang:
A study on functional load of Chinese prosodic boundaries under reduction of syllable information. 1-5 - Yu Chen, Weifeng Kong, Yujie Chi, Yanting Chen, Jianguo Wei, Jianwu Dang:
The singing voice before and after vocal warm-up by students of Chinese national singing. 1-5 - Ju Lin, Yanlu Xie, Yingming Gao, Jinsong Zhang:
Improving Mandarin tone recognition based on DNN by combining acoustic and articulatory features. 1-5 - Lei Wang, Fei Chen:
Effects of background noise and tonal target stimulus on human auditory evoked potential. 1-4 - Yuan Jia:
The effect of information structure on the distribution of stress degree in Chinese reading texts. 1-7 - Chong Cao, Yanlu Xie, Ju Lin, Qian Li, Jinsong Zhang:
The preliminary study of influence on tone perception from segments. 1-5 - Shuju Shi, Chiharu Tsurutani, Xiaoli Feng, Jinsong Zhang, Nobuaki Minematsu:
Acoustic correlates and gender effects in production and perception of Japanese polite speech. 1-5 - Tianyan Zhou, Weicheng Cai, Xiaoyan Chen, Xiaobing Zou, Shilei Zhang, Ming Li:
Speaker diarization system for autism children's real-life audio data. 1-5 - Yali Liu, Zihou Meng:
The examination of the relationship between perception and production of Mandarin tone of Kazak students. 1-5 - Aihui Zhang, Hui Feng, Siyu Wang, Jianwu Dang:
Relationship between perception and production of English vowels by Chinese English learners. 1-5 - Zhipeng Chen, Ji Wu:
Exploiting noisy web data by OOV ranking for low-resource keyword search. 1-5 - Shuju Shi, Yanlu Xie, Xiaoli Feng, Jinsong Zhang:
Automatic detection of rhythmic patterns in native and L2 speech: Chinese, Japanese, and Japanese L2 Chinese. 1-4 - Huijun Ding, Chenxi Xie, Lei Zeng, Yang Xu, Guo Dan:
The correlation between signal distance and consonant pronunciation in Mandarin words. 1-5 - Xiaxia Zhang, Bei Wang:
Production and perception of focus in L2 Mandarin of Qiang speakers. 1-5 - Feiya Li, Yanlu Xie, Xiaomin Yu, Jinsong Zhang:
A study on perceptual training of Japanese CSL learners to discriminate Mandarin lexical tones. 1-5 - Ju Lin, Yanlu Xie, Wei Zhang, Jinsong Zhang:
Automatic Mandarin prosody boundary detecting based on tone nucleus features and DNN model. 1-5 - Yue Zhao, Rui Zhao, Xiaona Xu, Licheng Wu, Qiang Ji:
Phone recognition for Lhasa-Tibetan based on articulatory features augmentation learning. 1-4 - Xunan Huang, Caicai Zhang, Fei Chen, Jonathan Sieg, Lan Wang, Feng Shi:
Effects of preceding vocabulary context on the perception of Mandarin vowels. 1-5 - Dongdong Li, Jianyu Wang, Yingchun Yang:
PVD: A new pathological voice dataset for intra-speaker recognition research interest. 1-5 - Chia-Yung Hsu, Ryandhimas E. Zezario, Jia-Ching Wang, Chin-Wen Ho, Xugang Lu, Yu Tsao:
Incorporating local environment information with ensemble neural networks to robust automatic speech recognition. 1-5 - Yuan Jia, Aijun Li:
A linguistic annotation scheme of Chinese discourse structures and study of prosodic interactions. 1-5 - Ping Fan, Wentao Gu:
Prosodic cues in polite and rude Mandarin speech. 1-4 - Kaituo Xu, Lei Xie, Kaisheng Yao:
Investigating LSTM for punctuation prediction. 1-5 - Bijun Ling, Jie Liang:
The influence of syllable structure and prosodic strengthening on consonant production in Shanghai Chinese. 1-5 - Zhihua Xia, Qiu Wu Ma:
Gender and prosodic entrainment in Mandarin conversations. 1-4 - Lei Liu, Nan Huang, Wentao Gu:
Mandarin neutral tone by native speakers and Cantonese L2 learners. 1-5 - Feng Deng, Changchun Bao, Mao-shen Jia:
HMM-based cue parameters estimation for speech enhancement. 1-4 - Junfeng Li, Risheng Xia, Qiang Fang, Aijun Li, Yonghong Yan:
Speech intelligibility enhancement in noisy reverberant conditions. 1-5
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.