


default search action
11th ISCSLP 2018: Taipei City, Taiwan
- 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018, Taipei City, Taiwan, November 26-29, 2018. IEEE 2018, ISBN 978-1-5386-5627-3

- Haiwei Wu, Ming Li, Zexin Cai, Haibin Zhong:

Unsupervised query by example spoken term detection using features concatenated with Self-Organizing Map distances. 1-5 - Yi-Yang Ding, Ya-Jun Hu, Zhen-Hua Ling:

GTDNN-Based Voice Conversion Using DAEs with Binary Distributed Hidden Units. 1-5 - Haikun Wang, Zhongfu Ye, Jingdong Chen:

A Front-End Speech Enhancement System for Robust Automotive Speech Recognition. 1-5 - Yupeng Shi, Weicong Rong, Nengheng Zheng:

Speech Enhancement using Convolutional Neural Network with Skip Connections. 6-10 - Bin Liu, Jianhua Tao, Yibin Zheng:

A Novel Unified Framework for Speech Enhancement and Bandwidth Extension Based on Jointly Trained Neural Networks. 11-15 - Shih-Kuang Lee

, Syu-Siang Wang, Yu Tsao
, Jeih-weih Hung:
Speech Enhancement Based on Reducing the Detail Portion of Speech Spectrograms in Modulation Domain via DiscreteWavelet Transform. 16-20 - Quandong Wang, Sicheng Wang, Fengpei Ge, Chang Woo Han, Jaewon Lee, Lianghao Guo, Chin-Hui Lee:

Two-Stage Enhancement of Noisy and Reverberant Microphone Array Speech for Automatic Speech Recognition Systems Trained with Only Clean Speech. 21-25 - Cunhang Fan, Bin Liu, Jianhua Tao, Zhengqi Wen, Jiangyan Yi, Ye Bai:

Utterance-level Permutation Invariant Training with Discriminative Learning for Single Channel Speech Separation. 26-30 - Xiaoyong Lu, Yanqin Li, Hongwu Yang

:
A Method for Emotional Speech Synthesis Based on Speaker Adaptive Training. 31-35 - Xurong Xie, Xunying Liu, Tan Lee

, Lan Wang:
Investigation of Stacked Deep Neural Networks and Mixture Density Networks for Acoustic-to-Articulatory Inversion. 36-40 - Weizhao Zhang, Hongwu Yang

, Pengpeng Zhi:
Emotional speech synthesis based on DNN and PAD emotional state model. 41-45 - Lijia Chen, Hongwu Yang

, Hui Wang:
Research on Dungan speech synthesis based on Deep Neural Network. 46-50 - Wen-Chin Huang, Hsin-Te Hwang, Yu-Huai Peng, Yu Tsao

, Hsin-Min Wang
:
Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders. 51-55 - Feng-Long Xie, Frank K. Soong, Xi Wang, Lei He, Haifeng Li:

Frame Selection in SI-DNN Phonetic Space with WaveNet Vocoder for Voice Conversion without Parallel Training Data. 56-60 - Yuanyuan Liu

, Ying Qin, Siyuan Feng
, Tan Lee
, P. C. Ching:
Disordered Speech Assessment Using Kullback-Leibler Divergence Features with Multi-Task Acoustic Modeling. 61-65 - Ying Qin, Tan Lee

, Yuzhong Wu, Anthony Pak-Hin Kong
:
An End-to-End Approach to Automatic Speech Assessment for People with Aphasia. 66-70 - Yahui Shan, Jing Wang, Xiang Xie, Liuchen Meng, Jingming Kuang:

Non-intrusive Speech Quality Assessment Using Deep Belief Network and Backpropagation Neural Network. 71-75 - Xin Wang

, Jun Du, Lei Sun, Qing Wang, Chin-Hui Lee:
A Progressive Deep Learning Approach to Child Speech Separation. 76-80 - Jen-Tzung Chien

, Kai-Wei Tsou:
Convolutional Neural Turing Machine for Speech Separation. 81-85 - Danyang Liu, Xinxin Wan, Ji Xu, Pengyuan Zhang:

Multilingual Speech Recognition Training and Adaptation with Language-Specific Gate Units. 86-90 - Yuan Jia, Cuiping Li:

Acquisition of English Tense-lax Vowels by Chinese EFL Learners from Different Dialectal Regions. 91-95 - Yuan Jia, Huimin Zhang:

An Acoustic Study of English Monophthongs Acquisition by Chinese EFL Learners from Northeast Region. 96-100 - Yuan Jia, Xinyin Sun:

Chinese EFL Learners' Acquisition of English Monophthongs-A Typological Study of Fuzhou, Ningbo, and Beijing. 101-105 - Bin Li, Yuan Jia:

An Empirical Study of English Vowels Acquisition of EFL Learners in Tianjin and Zibo. 106-110 - Jingyong Hou, Wenping Hu, Frank K. Soong, Lei Xie:

A Refined Query-by-Example Approach to Spoken-Term-Detection on ESL learners' Speech. 111-115 - Wei Wang, Wei Wei, Yanlu Xie, Minghao Guo, Jinsong Zhang

:
Improve the Accuracy of Non-native Speech Annotation with a Semi-automatic Approach. 116-120 - Peiyao Sheng, Zhuolin Yang, Hu Hu, Tian Tan, Yanmin Qian:

Data Augmentation using Conditional Generative Adversarial Networks for Robust Speech Recognition. 121-125 - Jie Li, Yahui Shan, Xiaorui Wang, Yan Li:

Improving Gated Recurrent Unit Based Acoustic Modeling with Batch Normalization and Enlarged Context. 126-130 - Yuan-Fu Liao, Matús Pleva

, Daniel Hládek
, Ján Stas, Peter Viszlay, Martin Lojka, Jozef Juhár
:
Gated Module Neural Network for Multilingual Speech Recognition. 131-135 - Lahiru Samarakoon, Brian Mak

, Albert Y. S. Lam:
Subspace Based Sequence Discriminative Training of LSTM Acoustic Models with Feed-Forward Layers. 136-140 - Hengguan Huang, Brian Mak

:
WaveNet MH-SRU: Deep and Wide Multiple-history Simple Recurrent Unit for Speech Recognition. 141-145 - Zhangyu Xiao, Zhijian Ou, Wei Chu, Hui Lin:

Hybrid CTC-Attention based End-to-End Speech Recognition using Subword Units. 146-150 - Srinivas Kantheti, Rohan Kumar Das

, Hemant A. Patil:
Combining Phase-based Features for Replay Spoof Detection System. 151-155 - Meng Ge, Longbiao Wang, Seiichi Nakagawa, Yuta Kawakami, Jianwu Dang, Xiangang Li:

Pitch Synchronized Relative Phase with Peak Error Detection For Noise-robust Speaker Recognition. 156-160 - Lei Wang, Fei Chen:

Visual Information Affects Auditory Frequency Discrimination with Random Stimulus Sequences: Evidence from ERPs. 161-164 - Di Zhou

, Jinfeng Huang
, Jianwu Dang:
Investigation of the Comprehension Process during Silent Reading based on Eye Movements. 165-169 - Ju Lin, Wei Zhang, Linxuan Wei, Yanlu Xie, Jinsong Zhang

:
A Multi-modal Soft Targets Approach for Pronunciation Erroneous Tendency Detection. 170-174 - Zhenyu Wang, Qi Zhang, Shuang Zheng, Jinsong Zhang

, Yanlu Xie:
A Study on Landmark Verification of Mandarin Alveolar-palatal Consonants. 175-179 - Jinghua Zhong, Helen Meng:

DNN i-vector based Fishervoice and PLDA SVM scoring for NIST SRE 2016. 180-184 - Madhu R. Kamble, Hemant A. Patil:

Novel Amplitude Weighted Frequency Modulation Features for Replay Spoof Detection. 185-189 - Yutian Li, Feng Gao, Zhijian Ou, Jiasong Sun:

Angular Softmax Loss for End-to-end Speaker Verification. 190-194 - Shuai Wang, Zili Huang, Yanmin Qian, Kai Yu:

Deep Discriminant Analysis for i-vector Based Robust Speaker Recognition. 195-199 - Yi Liu, Liang He

, Weiwei Liu, Jia Liu:
Exploring a Unified Attention-Based Pooling Framework for Speaker Verification. 200-204 - Yexin Yang, Shuai Wang, Man Sun, Yanmin Qian, Kai Yu:

Generative Adversarial Networks based X-vector Augmentation for Robust Probabilistic Linear Discriminant Analysis in Speaker Verification. 205-209 - Long Zhang, Jia Jia, Fanbo Meng, Suping Zhou, Wei Chen, Cunjun Zhang, Runnan Li:

Emphasis Detection for Voice Dialogue Applications Using Multi-channel Convolutional Bidirectional Long Short-Term Memory Network. 210-214 - Yueheng Li, Biao Luo:

Topic and Prosody Interaction in Chinese Discourse. 215-219 - Xuanda Chen, Yuan Jia, Ziyu Xiong:

Measuring Prosodic Transfer in Vector Space by Weighted Tonal Events. 220-224 - Fang Yu, Chin-Tuan Tan

, Fei Chen:
An ERP Study to Evaluate the Quality of Speech Processed by Wiener Filtering. 225-229 - Yongwei Li, Ken-Ichi Sakakibara, Masato Akagi:

Estimation of glottal source waveforms and vocal tract shapes from speech signals based on ARX-LF model. 230-234 - Zexin Cai, Xiaoyi Qin, Danwei Cai, Ming Li, Xinzhong Liu, Haibin Zhong:

The DKU-JNU-EMA Electromagnetic Articulography Database on Mandarin and Chinese Dialects with Tandem Feature based Acoustic-to-Articulatory Inversion. 235-239 - Ivan Fung, Brian Mak

:
Multi-Head Attention for End-to-End Neural Machine Translation. 250-254 - Zhaoheng Ni, Rutuja Ubale, Yao Qian, Michael I. Mandel, Su-Youn Yoon, Abhinav Misra, David Suendermann-Oeft:

Unusable Spoken Response Detection with BLSTM Neural Networks. 255-259 - Mu Wang, Zhiyong Wu, Shiyin Kang, Xixin Wu, Jia Jia, Dan Su, Dong Yu, Helen Meng:

Speech Super-Resolution Using Parallel WaveNet. 260-264 - Kun-Yi Huang, Chung-Hsien Wu

, Qian-Bei Hong, Ming-Hsiang Su, Yuan-Rong Zeng:
Speech Emotion Recognition using Convolutional Neural Network with Audio Word-based Embedding. 265-269 - Yuan-Fu Liao, Wu-Hua Hsu, Yu-Chen Lin, Yung-Hsiang Shawn Chang, Matús Pleva

, Jozef Juhár
, Guang-Feng Deng:
Formosa Speech Recognition Challenge 2018: Data, Plan and Baselines. 270-274 - Ye Bai, Jianhua Tao, Jiangyan Yi, Zhengqi Wen, Cunhang Fan:

CLMAD: A Chinese Language Model Adaptation Dataset. 275-279 - Yao Qian, Rutuja Ubale, Patrick L. Lange, Keelan Evanini, Frank K. Soong:

From Speech Signals to Semantics - Tagging Performance at Acoustic, Phonetic and Word Levels. 280-284 - Minglu Liu, Miao Li

, Ji Wu, Xiangling Fu, Ji Gao:
Using Dempster-Shafer Evidence Theory for Dialog State Tracking. 285-289 - Yuanyuan Liu

, Tan Lee
, Thomas K. T. Law, Kathy Y. S. Lee
, P. C. Ching:
Prediction of Voice Disorder Severity: Contributions from Sustained Vowels and Continuous Speech. 290-294 - Qing Wang, Jun Du, Li Chai, Li-Rong Dai, Chin-Hui Lee:

A Maximum Likelihood Approach to Masking-based Speech Enhancement Using Deep Neural Network. 295-299 - Hongcui Wang, Dongxiao He, Jianwu Dang, Xi Liang:

Manifold-based incremental community detection method for online speaker identification. 300-303 - Ruifang Ji, Junhua Cao, Xinyuan Cai, Bo Xu:

Max Margin Cosine Loss for Speaker Identification on Short Utterances. 304-308 - Minxian Zhu, Xiang Xie, Liqiang Zhang, Jing Wang:

Automatic Personality Perception from Speech in Mandarin. 309-313 - Shengyu Yao, Houjun Huang, Ruohua Zhou

, Yonghong Yan:
Text-dependent Speaker Verification Using Word-based Scoring. 314-318 - Jinkun Chen, Weicheng Cai, Danwei Cai, Zexin Cai, Haibin Zhong, Ming Li:

End-to-end Language Identification using NetFV and NetVLAD. 319-323 - Meghna Pandharipande, Rupayan Chakraborty

, Ashish Panda, Sunil Kumar Kopparapu
:
Robust Front-End Processing For Emotion Recognition In Noisy Speech. 324-328 - Meng Liu, Longbiao Wang, Zeyan Oo, Jianwu Dang, Dongbo Li, Seiichi Nakagawa:

Replay Attacks Detection Using Phase and Magnitude Features with Various Frequency Resolutions. 329-333 - Madhu R. Kamble, Hemlata Tak, Maddala Venkata Siva Krishna, Hemant A. Patil:

Novel Demodulation-Based Features using Classifier-level Fusion of GMM and CNN for Replay Detection. 334-338 - Liang Zhang, Aijun Li, Yingyi Luo:

Chinese Causal Relation: Conjunction, Order and Focus-to-Stress Assignment. 339-343 - Tianyu Liang

, Xianhong Chen, Can Xu, Liang He
:
Parallel Double Audio Fingerprinting. 344-348 - Wei Zhang, Qi Zhang, Yanlu Xie, Jinsong Zhang

:
LSTM-Based Pitch Range Estimation from Spectral Information of Brief Speech Input. 349-353 - Yue Sun, Manwa L. Ng, Chongyuan Lian, Lan Wang, Feng Yang, Nan Yan:

Acoustic and Kinematic Examination of Dysarthria in Cantonese Patients of Parkinson's Disease. 354-358 - Sonal Joshi

, Ashish Panda, Biswajit Das:
Enhanced Denoising Auto-Encoder for Robust Speech Recognition in Unseen Noise Conditions. 359-363 - Gaofeng Cheng, Lu Huang, Jiasong Sun, Yonghong Yan:

Bidirectional LSTM with Extended Input Context. 364-368 - Wei Zou, Dongwei Jiang, Shuaijiang Zhao, Guilin Yang, Xiangang Li:

Comparable Study Of Modeling Units For End-To-End Mandarin Speech Recognition. 369-373 - Yiyan Wang, Yanhua Long:

Keyword Spotting Based On CTC and RNN For Mandarin Chinese Speech. 374-378 - Long Wu, Li Wang, Pengyuan Zhang, Ta Li, Yonghong Yan:

Space-Time Residual LSTM Architechture for Distant Speech Recognition. 379-383 - Dongwei Jiang, Wei Zou, Shuaijiang Zhao, Guilin Yang, Xiangang Li:

An Analysis of Decoding for Attention-Based End-to-End Mandarin Speech Recognition. 384-388 - Jiarui Wang, Si Ioi Ng, Dehua Tao, Wing Yee Ng, Tan Lee

:
A Study on Acoustic Modeling for Child Speech Based on Multi-Task Learning. 389-393 - Dongbo Li, Longbiao Wang, Jianwu Dang, Meng Ge, Haotian Guan:

Distant-talking Speech Recognition Based on Multi-objective Learning using Phase and Magnitude-based Feature. 394-398 - Shuaishuai Ye, Ting Jiang, Shan Qin, Weixia Zou, Chengyun Deng:

Speech Enhancement Based on A New Architecture of Wasserstein Generative Adversarial Networks. 399-403 - Hengshun Zhou, Xue Bai, Jun Du:

An Investigation of Transfer Learning Mechanism for Acoustic Scene Classification. 404-408 - Junhao Ding, Bin Ren, Nengheng Zheng:

Microphone Array Acoustic Source Localization system based on Deep Learning. 409-413 - Chang Liu, Yike Zhang, Pengyuan Zhang, Yaofeng Wang:

Evaluating Modeling Units and Sub-word Features in Language Models for Turkish ASR. 414-418 - Jiyuan Zhang, Dong Wang:

Chinese Poetry Generation with Flexible Styles. 419-423 - Chao-yu Su, Chiu-yu Tseng:

Perceivable information structure in discourse prosody-Detecting prominent prosodic words in spoken discourse using F0 contour. 424-428 - Chunyu Ge, Aijun Li:

Declination and boundary effect in Cantonese declarative sentence. 429-433 - Xinyi Wen, Yuan Jia, Aijun Li:

Interaction of Syntax, Semantics and Pragmatics on Discourse Prosody in Standard Chinese. 434-438 - Wei Zhang, Yanlu Xie, Jinsong Zhang

:
A Preliminary Study on Quantitative Calculation of Prosodic Strength in Mandarin Speech. 439-443 - Zhenyu Wang, Jinsong Zhang

, Yanlu Xie:
L2 Mispronunciation Verification Based on Acoustic Phone Embedding and Siamese Networks. 444-448 - Lei Liu, Xuemei Zhai, Wentao Gu

:
Comparing Mandarin Lexical Stress Produced by Native Speakers and L2 Learners in Hong Kong. 449-453 - Ziyu Xiong, Maolin Wang:

A study on the pitch realization of focus in Chinese. 454-457 - Ziyu Xiong, Maolin Wang:

Effect of Anticipatory Vowel-to-Vowel Coarticulation at Different Prosodic Boundaries in Chinese. 458-462 - Wai-Sum Lee, Yueh-Chin Chang, Feng-fan Hsieh

:
Co-articulation between Consonant and Vowel in Cantonese and Taiwanese CVC Syllables. 463-467 - Qian Li, Yingyi Luo, Aijun Li:

Cross-Dialectal Perception of the Third-Tone Sandhi in Standard Chinese - Evidence from Eye Movements. 468-472 - Xin Li, Rene Kager

:
An Acoustic Comparison between Two Pairs of Assimilatory and Dissimilatory Tone Sandhi Processes in Nanjing Mandarin in Categoricalness/Gradience. 473-477 - Aijun Li:

Response Acts in Chinese Conversation: the Coding Scheme and Analysis. 478-482 - Jingdong Li, Hui Zhang, Rui Liu

, Xueliang Zhang, Feilong Bao:
End-to-End Mongolian Text-to-Speech System. 483-487 - Gan Huang, Lin Zhu, Aijun Li:

Syntactic Structure and Communicative Function of Echo Questions in Chinese Dialogues. 488-492 - Si Ioi Ng, Dehua Tao, Jiarui Wang, Yi Jiang, Wing Yee Ng, Tan Lee

:
An Automated Assessment Tool for Child Speech Disorders. 493-494 - Ji-Yan Han, Wei-Zhong Zheng, Ren-Jie Huang, Yu Tsao

, Ying-Hui Lai:
Hearing aids APP design based on deep learning technology. 495-496 - Wen-Huei Liao, Pei-Chun Li, Shuenn-Tsong Young, Ying-Hui Lai, Yu Tsao

:
IOS-based Ear Scale application for Clinical Audiology and Otology Usage. 497-498

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














