ISCSLP 2006


Technical Program

Click the link to find the list of accepted papers.

Summary of the Program

TIME
Dec 13 (WED)
Dec 14 (THU)
Dec 15 (FRI)
Dec 16 (SAT)
0830--0900
-

Plenary 3
AUD

Plenary 4
AUD

0900--0930

Registration

0930--1000
Refreshment
1000--1030

L1

AUD

WP1

BIG

L2

ISS

EX

HALL

L5

AUD

L6

BIG

WP4

ISS

P3

HALL

SPE5

AUD

L9

BIG
1030--1100
1100--1130
1130--1200
1200--1230
Lunch
Closing Ceremony
1230--1300
Lunch
1300--1330
1330--1400

L3

AUD

WP2

BIG
ISS

P1

HALL

L7

AUD
BIG

WP5

ISS

P4

HALL
1400--1430
-
1430--1500
1500--1530
1530--1600
Refreshment
1600--1630
Opening Ceremony

L4

AUD

WP3

BIG

SPE2

ISS

P2

HALL

L8

AUD

SPE4

BIG

WP6

ISS

SC

Mtg

1630--1700
1700--1730
1730--1800
-
1800--1830

Reception
HALL

-

ISCA-SIG-CSLP
General Assembly

1830--1900
Banquet
Asia Europe Foundation
1900--1930
-
SC Business Dinner
1930--2000
2000--2030
2030--2100

Notations in the table:

  • AUD: Auditorium (up to 400 seats)
  • BIG: The Big One (up to 60 seats)
  • ISS: ISS 3-03 (up to 50 seats)
  • HALL: Multipurpose Hall (up to 20 posters)
  • SPE: Special Session
  • L: Oral Session
  • P: Poster Session
  • EX: Exhibition Session
  • WP: Virtual Emotions Workshop Session
  • SC: Steering Committee

Summary of the Sessions

Tutorials and Plenaries

Special Sessions

Lecture Sessions

Poster Sessions

TUTORIALS

Tutorial 1 (10:00-12:00 Dec 13)
An HMM-Based Approach to Flexible Speech Synthesis
Keiichi Tokuda
Department of Computer Science and Engineering, Nagoya Institute of Technology

Tutorial 2 (13:30-15:30 Dec 13)
Text Information Extraction and Retrieval
Hang Li
Microsoft Research Asia

PLENARY TALKS

Plenary 1 (16:30-17:30 Dec 13)
Interactive Computer Aids for Acquiring Proficiency in Mandarin
Stephanie Seneff
Computer Science and Artificial Intelligence Laboratory (CSAIL), MIT

Plenary 2 (8:30-9:30 Dec 14)
The Affective and Pragmatic Coding of Prosody
Klaus R. Scherer
Swiss Center for Affective Sciences, University of Geneva, Switzerland

Plenary 3 (8:30-9:30 Dec 15)
Challenges in Machine Translation
Franz Josef Och
Google Research

Plenary 4 (8:30-9:30 Dec 16)
Automatic Indexing and Retrieval of Large Broadcast News Video Collections - the TRECVID Experience
Tat-Seng Chua
School of Computing, National University of Singapore

SPECIAL SESSIONS

SPE1 RICH INFORMATION ANNOTATION AND SPOKEN LANGUAGE PROCESSING
Time: 13:30-15:30 Dec 14

SPE1.1 - Nonlinear Emotional Prosody Generation and Annotation
Author(s): Jianhua Tao, Jian Yu, Yongguo Kang

SPE1.2 - Rhythmic Organization of Mandarin Utterances --- A Two-Stage Process
Author(s): Min Chu, Yunjia Wang

SPE1.3 - Contextual Maximum Entropy Model for Edit Disfluency Detection of Spontaneous Speech
Author(s): Jui-Feng Yeh, Chung-Hsien Wu, Wei-Yen Wu

* SPE1.4 - The Breath Segment in Expressive Speech
Author(s): Chu Yuan, Aijun Li

* SPE1.5 - Applying SFC Model for Chinese Expressive Speech Synthesis
Author(s): Bufan Zhang, Zhenhua Ling, Long Qin, Renhua Wang

SPE1.6 - HMM-Based Emotional Speech Synthesis using Average Emotion Model
Author(s): Long Qin, Zhen-Hua Ling, Yi-Jian Wu, Bu-Fan Zhang, Ren-Hua Wang

SPE2 SPEAKER RECOGNITION
Time: 16:00-18:00 Dec 14

SPE2.1 - CCC Speaker Recognition Evaluation 2006: Overview, Methods, Data, Results and Perspective
Author(s): Thomas Fang Zheng, Zhanjiang Song, Lihong Zhang, Michael Brasser, Wei Wu, Jing Deng

SPE2.2 - The IIR Submission to CSLP 2006 Speaker Recognition Evaluation
Author(s): Kong-Aik Lee, Hanwu Sun, Rong Tong, Bin Ma, Minghui Dong, Changhuai You, Donglai Zhu, Chin-Wei Eugene Koh, Lei Wang, Tomi Kinnunen, Eng-Siong Chng, Haizhou Li

SPE2.3 - A Novel Alternative Hypothesis Characterization Using Kernel Classifiers for LLR-based Speaker Verification
Author(s): Yi-Hsiang Chao, Hsin-Min Wang, Ruei-Chuan Chang

SPE2.4 - Speaker Verification Using Complementary Information from Vocal Source and Vocal Tract
Author(s): Nengheng Zheng, Ning Wang, Tan Lee, P. C. Ching

SPE2.5 - ISCSLP SR Evaluation, UVA--CSes System Description. A System Based on ANNs
Author(s): Carlos E. Vivaracho

SPE2.6 - Evaluation of EMD-based Speaker Recognition using ISCSLP2006 Chinese Speaker Recognition Evaluation Corpus
Author(s): Shingo Kuroiwa, Satoru Tsuge, Masahiko Kita, Fuji Ren

SPE3 MULTILINGUAL CORPUS DEVELOPMENT - I
Time: 13:30-15:30 Dec 15

SPE3.1 - The Contribution of Lexical Resources to Natural Language Processing of CJK Languages
Author(s): Jack Halpern

SPE3.2 - Multilingual Spoken Language Corpus Development for Communication Research
Author(s): Toshiyuki Takezawa

SPE3.3 - The Paradigm for Creating Multi-lingual Text-to-Speech Voice Databases
Author(s): Min Chu, Yong Zhao, Yining Chen, Lijuan Wang, Frank Soong

* SPE3.4 - Recent Advances of Speech Databases Development Activity for Indian Languages
Author(s): S. S. Agrawal, K. Samudravijaya, Karunesh Arora

SPE3.5 - HKUST/MTS: A Very Large Scale Mandarin Telephone Speech Corpus
Author(s): Yi Liu, Pascale Fung, Yongsheng Yang, Christopher Cieri, Shudong Huang, David Graff

SPE3.6 - Development of Multi-lingual Spoken Corpora of Indian Languages
Author(s): K Samudravijaya

SPE4 MULTILINGUAL CORPUS DEVELOPMENT - II
Time: 16:00-18:00 Dec 15

* SPE4.1 - Design of Vietnamese Speech Corpus and Current Status
Author(s): Chi Mai Luong, Ngoc Duc Dang

SPE4.2 - Multilingual Speech Corpora for TTS System Development
Author(s): Hsi-Chun Hsiao, Hsiu-Min Yu, Yih-Ru Wang, Sin-Horng Chen

SPE4.3 - Construct Trilingual Parallel Corpus on Demand
Author(s): Muyun Yang, Hongfei Jiang, Tiejun Zhao, Sheng Li

* SPE4.4 - Multilingual Text - Speech Corpus of Mongolian
Author(s): I. Dawa, Husal, Liu Yue, Yue Yao Ming, Uulang, Bai Shuang Cheng, Batsaihan, Y. Arai, M. Mitsunaga, H. Isahara, S. Nakamura

* SPE4.5 - Design of Cross-lingual and Multilingual Corpora for Speaker Recognition Research and Evaluation in Indian Languages
Author(s): Hemant A. Patil, S. Ghosh, A. Si, T. K. Basu

* SPE4.6 - Multi-lingual TTS Speech Corpus Development
Author(s): Yiqing Zu, Zhenhai Cao, Guilin Chen, Kesong Han, Peng Lu, Runqiang Yan, Kaizhi Wang, Zhenli Yu, Dongjian Yue, Aijun Li, Zhigang Yin

SPE5 ROBUST TECHNIQUES FOR ORGANIZING AND RETRIEVING SPOKEN DOCUMENTS
Time: 10:00-12:00 Dec 16

SPE5.1 - A Multi-layered Summarization System for Multi-media Archives by Understanding and Structuring of Chinese Spoken Documents
Author(s): Lin-shan Lee, Sheng-Yi Kong, Yi-Cheng Pan, Yi-Sheng Fu, Yu-Tsun Huang, Chien-Chih Wang

SPE5.2 - Extractive Chinese Spoken Document Summarization Using Probabilistic Ranking Models
Author(s): Yi-Ting Chen, Suhan Yu, Hsin-Min Wang, Berlin Chen

SPE5.3 - Initial Experiments on Automatic Story Segmentation in Chinese Spoken Documents using Lexical Cohesion of Extracted Named Entities
Author(s): Devon Li, Wai-Kit Lo, Helen Meng

* SPE5.4 - Syllable Based Audio Search Using Confusion Network Arc as Indexing Unit
Author(s): Jian Shao, Pengyuan Zhang, Jiang Han, Jun Yang, Yonghong Yan

SPE5.5 - Meeting Segmentation Using Two-Layer Cascaded Subband Filters
Author(s): Manuel Giuliani, Tin Lay Nwe, Haizhou Li

SPE5.6 - Speaker-and-environment Change Detection in Broadcast News using Maximum Divergence Common Component GMM
Author(s): Yih-Ru Wang

LECTURE SESSIONS

L1 ROBUST SPEECH RECOGNITION
Time: 10:00-12:00 Dec 14

L1.1 - Vector Autoregressive Model for Missing Feature Reconstruction
Author(s): Xiong Xiao, Haizhou Li, Eng-Siong Chng

L1.2 - Auditory Contrast Spectrum for Robust Speech Recognition
Author(s): Xugang Lu, Jianwu Dang

L1.3 - Signal Trajectory Based Noise Compensation for Robust Speech Recognition
Author(s): Zhi-Jie Yan, Jian-Lai Zhou, Frank Soong, Ren-Hua Wang

L1.4 - An HMM Compensation Approach Using Unscented Transformation For Noisy Speech Recognition
Author(s): Yu Hu, Qiang Huo

L1.5 - Noisy Speech Recognition Performance of Discriminative HMMs
Author(s): Jun Du, Peng Liu, Frank Soong, Jian-Lai Zhou, Ren-Hua Wang

L1.6 - Distributed Speech Recognition of Mandarin Digits String
Author(s): Yih-Ru Wang, Bo-Xuan Lu, Yuan-Fu Liao, Sin-Horng Chen

L2 SPEECH ANALYSIS AND ENHANCEMENT
Time: 10:00-12:00 Dec 14

L2.1 - A Robust Voice Activity Detection based on Noise Eigenspace Projection
Author(s): Dongwen Ying, Yu Shi, Frank Soong, Jianwu Dang, Xugang Lu

L2.2 - Pitch Mean Based Frequency Warping
Author(s): Jian Liu, Thomas Fang Zheng, Wenhu Wu

L2.3 - A Study of Knowledge-based Features for Obstruent Detection and Classification in Continuous Mandarin Speech
Author(s): Kuang-Ting Sung, Hsiao-Chuan Wang

L2.4 - Adaptive Null-Forming Algorithm with Auditory Sub-bands
Author(s): Heng Zhang, Qiang Fu, Yonghong Yan

L2.5 - Multi-channel Noise Reduction in Noisy Environments
Author(s): Junfeng Li, Masato Akagi, Yoiti Suzuki

L2.6 - A Minimum Boundary Error Framework for Automatic Phonetic Segmentation
Author(s): Jen-Wei Kuo, Hsin-Min Wang

L3 LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION
Time: 13:30-15:30 Dec 14

L3.1 - Advances in Mandarin Broadcast Speech Transcription at IBM under the DARPA GALE Program
Author(s): Yong Qin, Qin Shi, Yi Y. Liu, Hagai Aronowitz, Stephen M. Chu, Hong-Kwang Kuo, Geoffrey Zweig

L3.2 - Improved Large Vocabulary Continuous Chinese Speech Recognition by Character-based Consensus Networks
Author(s): Yi-Sheng Fu, Yi-Cheng Pan, Lin-shan Lee

L3.3 - All-Path Decoding Algorithm for Segmental Based Speech Recognition
Author(s): Yun Tang, Wenju Liu, Bo Xu

L3.4 - Improved Mandarin Speech Recognition by Lattice Rescoring with Enhanced Tone Models
Author(s): Huanliang Wang, Yao Qian, Frank Soong, Jian-Lai Zhou, Jiqing Han

L3.5 - On Using Entropy Information to Improve Posterior Probability-based Confidence Measures
Author(s): Tzan-Hwei Chen, Berlin Chen, Hsin-Min Wang

L3.6 - Vietnamese Automatic Speech Recognition: the FLaVoR Approach
Author(s): Quan Vu, Kris Demuynck, Dirk Van Compernolle

L4 ACOUSTIC MODELING AND SPEAKER ADAPTATION
Time: 16:00-18:00 Dec 14

L4.1 - Minimum Phone Error (MPE) Model and Feature Training on Mandarin Broadcast News Task
Author(s): Jia-Yu Chen, Chia-Yu Wan, Yi Chen, Berlin Chen, Lin-shan Lee

L4.2 - State-Dependent Phoneme-Based Model Merging for Dialectal Chinese Speech Recognition
Author(s): Linquan Liu, Thomas Fang Zheng, Wenhu Wu

L4.3 - Non-uniform Kernel Allocation based Parsimonious HMM
Author(s): Peng Liu, Jian-Lai Zhou, Frank Soong

L4.4 - Consistent Modeling of the Static and Time-Derivative Cepstrums for Speech Recognition Using HSPTM
Author(s): Yiu-Pong Lai, Man-Hung Siu

L4.5 - Unsupervised Speaker Adaptation using Reference Speaker Weighting
Author(s): Tsz-Chung Lai, Brian Mak

L4.6 - Automatic Construction of Regression Class Tree for MLLR via Model-based Hierarchical Clustering
Author(s): Shih-Sian Cheng, Yeong-Yuh Xu, Hsin-Min Wang, Hsin-Chia Fu

L5 SPEECH SYNTHESIS
Time: 10:00-12:00 Dec 15

L5.1 - Predicting Prosody From Text
Author(s): Keh-Jiann Chen, Chiu-Yu Tseng, Chia-Hung Tai

L5.2 - Prosodic Boundary Prediction Based on Maximum Entropy Model with Error-Driven Modification
Author(s): Xiaonan Zhang, Jun Xu, Lianhong Cai

L5.3 - Prosodic Word Prediction using a Maximum Entropy Approach
Author(s): Honghui Dong, Jianhua Tao, Bo Xu

L5.4 - Prosodic Words Prediction from Lexicon Words with CRF and TBL Joint Method
Author(s): Heng Kang, Wenju Liu

L5.5 - An HMM-Based Mandarin Chinese Text-to-Speech System
Author(s): Yao Qian, Frank Soong, Yining Chen, Min Chu

L5.6 - A Hakka Text-To-Speech System
Author(s): Hsiu-Min Yu, Hsin-Te Hwang, Dong-Yi Lin, Sin-Horng Chen

L6 RECOGNITION OF SPEAKERS AND LANGUAGES
Time: 10:00-12:00 Dec 15

L6.1 - Integrating Complementary Features with a Confidence Measure for Speaker Identification
Author(s): Nengheng Zheng, P. C. Ching, Ning Wang, Tan Lee

L6.2 - Discriminative Transformation for Sufficient Adaptation in Text-Independent Speaker Verification
Author(s): Hao Yang, Yuan Dong, Xianyu Zhao, Jian Zhao, Haila Wang

L6.3 - Fusion of Acoustic and Tokenization Features for Speaker Recognition
Author(s): Rong Tong, Bin Ma, Kong-Aik Lee, Changhuai You, Donglai Zhu, Tomi Kinnunen, Hanwu Sun, Minghui Dong, Eng-Siong Chng, Haizhou Li

L6.4 - UBM based Speaker Segmentation and Clustering for 2-Speaker Detection
Author(s): Jing Deng, Thomas Fang Zheng, Wenhu Wu

L6.5 - Design of Cubic Spline Wavelet for Open Set Speaker Classification in Marathi
Author(s): Hemant A. Patil, T. K. Basu

L6.6 - Language Identification by Using Syllable-based Duration Classification on Code-switching Speech
Author(s): Dau-Cheng Lyu, Ren-Yuan Lyu, Yuang-Chin Chiang, Chun-Nan Hsu

L7 TOPICS IN SPEECH SCIENCE
Time: 13:30-15:30 Dec 15

L7.1 - Mechanisms of Question Intonation in Mandarin
Author(s): Jiahong Yuan

L7.2 - Comparison of Perceived Prosodic Boundaries and Global Characteristics of Voice Fundamental Frequency Contours in Mandarin Speech
Author(s): Wentao Gu, Keikichi Hirose, Hiroya Fujisaki

L7.3 - Linguistic Markings of Units in Spontaneous Mandarin
Author(s): Shu-Chuan Tseng

L7.4 - Phonetic and Phonological Analysis of Focal Accents of Disyllabic Words in Standard Chinese
Author(s): Yuan Jia, Ziyu Xiong, Aijun Li

L7.5 - Focus, Lexical Stress and Boundary Tone: Interaction of Three Prosodic Features
Author(s): Lu Zhang, Yi-Qing Zu, Run-Qiang Yan

L7.6 - Speech Synthesis Based on a Physiological Articulatory Model
Author(s): Qiang Fang, Jianwu Dang

L8 SPOKEN AND MULTIMODAL DIALOG SYSTEMS AND APPLICATIONS
Time: 16:00-18:00 Dec 15

L8.1 - A Corpus-based Approach for Cooperative Response Generation in a Dialog System
Author(s): Zhiyong Wu, Helen Meng, Hui Ning, Sam C. Tse

L8.2 - A Cantonese Speech-Driven Talking Face Using Translingual Audio-to-Visual Conversion
Author(s): Lei Xie, Helen Meng, Zhi-Qiang Liu

L8.3 - The Implementation of Service Enabling with Spoken Language of a Multi-modal System Ozone
Author(s): Sen Zhang, Yves Laprie

L8.4 - Spoken Correction for Chinese Text Entry
Author(s): Bo-June Paul Hsu, James Glass

L8.5 - Automatic Detection of Tone Mispronunciation in Mandarin
Author(s): Li Zhang, Chao Huang, Min Chu, Frank Soong, Xianda Zhang, Yudong Chen

L8.6 - Towards Automatic Tone Correction in Non-native Mandarin
Author(s): Mitchell Peabody, Stephanie Seneff

L9 MACHINE TRANSLATION AND LANGUAGE MODELING FOR SPOKEN LANGUAGE PROCESSING
Time: 10:00-12:00 Dec 16

L9.1 - Some Improvements in Phrase-Based Statistical Machine Translation
Author(s): Zhendong Yang, Wei Pang, Jinhua Du, Wei Wei, Bo Xu

L9.2 - Automatic Spoken Language Translation Template Acquisition Based on Boosting Structure Extraction and Alignment
Author(s): Rile Hu, Xia Wang

* L9.3 - A Feasibility Study for Chinese-Spanish Statistical Machine Translation
Author(s): Rafael E. Banchs, Josep M. Crego, Patrik Lambert, Jose B. Marino

L9.4 - A Unified Framework for Text Analysis in Chinese TTS
Author(s): Guohong Fu, Min Zhang, GuoDong Zhou, Kang-Kuong Luke

* L9.5 - Chinese Character-based Segmentation & POS-tagging and Named Entity Identification with a CRF Chunker
Author(s): Xinhui Hu, Hideki Kashioka

* L9.6 - Automatic Chinese Dialogue Text Summarization Based On LSA and Segmentation
Author(s): Chuanhan Liu, Yongcheng Wang, Fei Zheng, Derong Liu

POSTER SESSIONS

* P1 SPEECH ANALYSIS, ENHANCEMENT, CODING AND SYNTHESIS
Time: 13:30-15:30 Dec 14

P1.1 - Acoustic Analysis of Emotional Speech in Mandarin Chinese
Author(s): Sheng Zhang, P.C. Ching, Fanrang Kong

P1.2 - Investigation on Pleasure Related Acoustic Features of Affective Speech
Author(s): Dandan Cui, Lianhong Cai, Yongxin Wang, Xiaozhou Zhang

P1.3 - Comparison of News Announcing and Talking Styles in Broadcast Speech
Author(s): Yu Zou, Xiaohua Li, Min Hou, Anna

P1.4 - Multi-Pitch Detection for Co-Channel Speech Utilizing Frequency Channel Piecewise Integration and Morphological Feedback Verification Tracking
Author(s): Yong Guan, Peng Li, Wenju Liu, Bo Xu

P1.5 - A New Approach for Speech/Music Discrimination Based on Cepstral Distance
Author(s): Mu-Yeol Choi, Seul-Han Park, Hwa Jeon Song, Hyung Soon Kim

P1.6 - Speaker Diarization System Based on GMM and BIC
Author(s): Tantan Liu, Xiaoxing Liu, Yonghong Yan

P1.7 - A Robust Acoustic Echo Canceller for Noisy Environment
Author(s): Shenghao Qin, Sha Meng, Jia Liu

P1.8 - Short-Time ICA for Blind Separation of Noisy Speech
Author(s): Jing Zhang, P.C. Ching

P1.9 - A Closed-loop Multimode Variable Bit Rate Characteristic Waveform Interpolation Coder
Author(s): Jing Wang, Jingming Kuang, Shenghui Zhao

P1.10 - A Low-complexity Improved WI Speech Coding at 2kbps
Author(s): Fengyan Qi, Changchun Bao

P1.11 - Modular Text-to-Speech Synthesis Evaluation for Mandarin Chinese
Author(s): Jilei Tian, Jani Nurminen, Imre Kiss

P1.12 - State-Correlated Duration Model for HMM-Based Speech Synthesis System
Author(s): Xiaocui Li, Heng Kang, Wenju Liu

P1.13 - A Diphone Sharing Method Towards Scalable Unit-training-based TTS
Author(s): Jian Li, Xiaoyan Lou, Jie Hao, Lifu Yi

P1.14 - A Unified Totally-Data-Driven Framework for Duration and Intonation Modeling
Author(s): Lifu Yi, Jian Li, Xiaoyan Lou, Jie Hao

P1.15 - Pitch Prediction for Mandarin TTS with Mutual Prosodic Constraint
Author(s): Jian Yu, Jianhua Tao, Xia Wang

P1.16 - Prosodic Word Grouping in Mandarin TTS System
Author(s): Qing Guo, Endong Xun, Nobuyuki Katae

P1.17 - An Initial System for Integrated Synthesis of Mandarin, Min-nan, and Hakka Speech
Author(s): Hung-Yan Gu, Yan-Zuo Zhou, Huang-Liang Liau

P1.18 - Spectral Continuity Measures at Mandarin Syllable Boundaries
Author(s): Jun Xu, Lianhong Cai

P1.19 - A Multi-stage Method for Text-To-Pronunciation Conversion
Author(s): Ching-Hsien Lee, Ren-Jr Wang, Chung-Jen Chiu

P1.20 - Decision Tree Classification Approach for Model Selection in Segmenting Mandarin TTS Corpus
Author(s): Xiaoliang Yuan, Yuan Dong, Dezhi Huang, Jun Guo, Haila Wang

* P2 TOPICS IN SPOKEN LANGUAGE PROCESSING
Time: 16:00-18:00 Dec 14

P2.1 - Evaluation of Aspiration Sounds of Chinese Labial and Alveolar Diphthong Uttered by Japanese Students Using Voice Onset Time and Breathing Power
Author(s): Akemi Hoshino, Akio Yasuda

P2.2 - Contrastive Study on Tonal Patterns Between Accented and Standard Chinese
Author(s): Aijun Li, Ziyu Xiong, Xia Wang

P2.3 - Mismatch Negativity Elicited by Non-cluster and Cluster Consonants Changes in Thai Words in Humans
Author(s): Wichian Sittiprapaporn, Usanee Sotthiwat, Chittin Chindaduangratn, Naiphinich Kotchabhakdi

P2.4 - F0 Analysis of Chinese Accented German Speech
Author(s): Hongwei Ding, Oliver Jokisch and Ruiger Hoffmann

P2.5 - Automatic Scoring of Flat Tongue and Raised Tongue in Computer-assisted Mandarin Learning
Author(s): Bin Dong, Qingwei Zhao, Yonghong Yan

P2.6 - Improvements in Tone Pronunciation Scoring for Strongly Accented Mandarin Speech
Author(s): Fuping Pan, Qingwei Zhao, Yonghong Yan

P2.7 - The Application of Phone Weight in Putonghua Pronunciation Quality Assessment
Author(s): Qingsheng Liu, Si Wei, Yu Hu, Wu Guo, Renhua Wang

P2.8 - SpeechQoogle: An Open-Domain Question Answering System with Speech Interface
Author(s): Guoping Hu, Dan Liu, Qingfeng Liu, Renhua Wang

P2.9 - Research and Analysis of Fast Training in SVM-based Audio Classification
Author(s): Shilei Zhang, Hongchen Jiang, Shuwu Zhang, Bo Xu

P2.10 - An Efficient and Robust Approach to Audio ID Identification
Author(s): Ming Li, Jian Liu, Yonghong Yan

P2.11 - Robust Speech-Annotated Photo Retrieval Using Syllable-Transformed Patterns
Author(s): Chien-Lin Huang, Wei-Chuan Lee, Chung-Hsien Wu

P2.12 - Two-layer Distance Scheme in Matching Engine for Query by Humming System
Author(s): Feng Zhang, Yan Song, Lirong Dai, Renhua Wang

P2.13 - A Top-down Approach to Melody Match in Pitch Contour for Query by Humming
Author(s): Xiao Wu, Ming Li, Jian Liu, Jun Yang, Yonghong Yan

P2.14 - Multi-accented Mandarin Database Construction and Benchmark Evaluations
Author(s): Xiang Yan, Lei He, Pei Ding, Rui Zhao, Jie Hao

* P3 SPEECH RECOGNITION
Time: 10:00-12:00 Dec 15

P3.1 - EM Algorithm with Split and Merge in Trajectory Clustering for Automatic Speech Recognition
Author(s): Yan Han, Lou Boves

P3.2 - Sausage-net-based Minimum Phone Error Training for Continuous Phone Recognition
Author(s): Jiang-Chun Chen, Chun-Jen Lee, Shuo-Pin Hsu, J.-S. Roger Jang

P3.3 - Training Discriminative HMM by Optimal Allocation of Gaussian Kernels
Author(s): Zhijie Yan, Peng Liu, Jun Du, Frank Soong, Renhua Wang

P3.4 - Recognition of Emotional Speech and Speech Emotion in Farsi
Author(s): Davood Gharavian, S.M. Ahadi

P3.5 - Monte Carlo Noisy HMM Estimation and Segmental Differential Features on the Aurora2 Clean Training Evaluation
Author(s): Jing-Teng Zeng, Cheng-Chang Lee, Jeng-Shien Lin, Yuan-Fu Liao, Sen-Chia Chang

P3.6 - Optimizing the Implementation of MMSE Enhancement for Robust Speech Recognition
Author(s): Pei Ding, Lei He, Xiang Yan, Rui Zhao, Jie Hao

P3.7 - Speech Endpoint Detection Based on Sub-band Energy and Harmonic Structure of Voice
Author(s): Yanmeng Guo, Qiang Fu, Yonghong Yan

P3.8 - Improving the Robustness of LPCC Feature Against Impulsive Noise by Applying the FOP Method
Author(s): Pei Ding

P3.9 - A Low-Cost Robust Front-end for Embedded ASR System
Author(s): Lihui Guo, Xin He, Yue Lu, Yaxin Zhang

P3.10 - A Comparative Study on Confidence Measure in Mandarin Command Word Recognition
Author(s): Cong Liu, Zhijie Yan, Yu Hu, Renhua Wang

P3.11 - Speaker Adaptation Using Projection to Latent Structure Algorithm
Author(s): Jingying Wang, Zuoying Wang

P3.12 - Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition
Author(s): Hua Zhang, Yun Tang, Wenju Liu, Bo Xu

P3.13 - Experimental Investigation into Alignment-based Acoustic Confidence Measures in Keyword Verification for Mandarin Speech
Author(s): Yiyan Liu, Yingchun Yang, Zhenyu Shan

P3.14 - Speaker, Vocabulary and Context Independent Word Spotting System for Continuous Speech
Author(s): Radu Timofte, Ville Hautamaki, Pasi Franti

P3.15 - Keyword Spotting Based on Phoneme Confusion Matrix
Author(s): Pengyuan Zhang, Jian Shao, Jiang Han, Zhaojie Liu, Yonghong Yan

P3.16 - Performance Evaluation of Non-Keyword Modeling for Vocabulary-Independent Keyword Spotting
Author(s): Young Kuk Kim, Hwa Jeon Song, Hyung Soon Kim

P3.17 - DOE and ANOVA based Performance Influencing Factor Analysis for Evaluation of Speech Recognition Systems
Author(s): Xiangdong Wang, Feng Xie, Shouxun Lin, Yuelian Qian, Qun Liu

P3.18 - Full Utilization of Closed-captions in Broadcast News Recognition
Author(s): Meng Meng, Shijin Wang, Jiaen Liang, Peng Ding, Bo Xu

P3.19 - Integrating Hypotheses of Multiple Recognizers for Improving Mandarin LVCSR Performance
Author(s): Yu Shi, Frank Soong, Jian-Lai Zhou

P3.20 - English Alphabet Recognition Based on Chinese Acoustic Modeling
Author(s): Linquan Liu, Thomas Fang Zheng, Wenhu Wu

* P4 RECOGNITION OF SPEAKERS AND LANGUAGES
Time: 13:30-15:30 Dec 15

P4.1 - Minimum Classification Error Based Optimal Linear Combination for Spoken Language Identification
Author(s): Donglai Zhu, Rong Tong, Bin Ma, Haizhou Li

P4.2 - Automatic Tonal and Non-Tonal Language Classification and Language Identification Using Prosodic Information
Author(s): Liang Wang, Eliathamby Ambikairajah, Eric H.C. Choi

P4.3 - Incorporating Prosodic with Acoustic information for ISCSLP'2006 Speaker Recognition Evaluation- Robust Cross-Channel Speaker Verification
Author(s): Wen-Chieh Chang, Ding-Yun Chen, Zi-He Chen, Zhi-Ren Zeng, Yuan-Fu Liao, Yau-Tarng Juang

P4.4 - Compensations for SVM in Text-Independent Speaker Verification
Author(s): Xiang-Feng Lu, Jia Liu

P4.5 - Exploiting GMM-based Quality Measure for SVM Speaker Verification
Author(s): Rong Zheng, Hongchen Jiang, Shuwu Zhang, Bo Xu

P4.6 - Feature Extraction and Test Algorithm for Speaker Verification
Author(s): Wu Guo, Renhua Wang, Lirong Dai

P4.7 - Frame-level Nonlinearity for Robust DTW-based Speaker Verification
Author(s): Jian Luan, Jie Hao, Tomonari Kakino, Tomonori Ikumi

P4.8 - Temporal Discrete Cosine Transform: Towards Longer Term Temporal Features for Speaker Verification
Author(s): Tomi Kinnunen, Chin Wei Eugene Koh, Lei Wang, Haizhou Li, Eng-Siong Chng

P4.9 - On the Use of Long-Term Average Spectrum in Automatic Speaker Recognition
Author(s): Tomi Kinnunen, Ville Hautamaki, Pasi Franti

P4.10 - A New Data Fusion Technique and Performance Measure for Identification of Twins in Marathi
Author(s): Hemant A. Patil, T. K. Basu

Note: The papers that are preceded by * are included in the companion volume of the proceedings. If a section is preceded by *, all the papers in the section are included in the companion volume. The remaining papers are included in the Springer LNAI 4274.

©2005-2006 Chinese and Oriental Languages Information Processing Society, Singapore | Last updated on December 21, 2006 .