Haizhou Li, IEEE Fellow, Presidential Chair Professor

- School of Artificial Intelligence
The Chinese University of Hong Kong, Shenzhen, China

- Centre of Language, Intelligence, and Machines (LIMA)
Shenzhen Loop Area Institute, Shenzhen, China

- U Bremen Excellence Chair
University of Bremen, Germany

- Department of Electrical and Computer Engineering
National University of Singapore, Singapore

CUHK Email: haizhouli at cuhk dot edu dot cn
NUS Email: haizhou.li at u dot nus dot edu

Personal: http://www.colips.org/~eleliha/
School: https://sai.cuhk.edu.cn/en/teacher/102
Lab: https://cde.nus.edu.sg/ece/hlt/
https://orcid.org/0000-0001-9158-9401

updated on 1 October 2025

Biography

Haizhou Li is the Dean and X. Q. Deng Presidential Chair Professor at the School of Artificial Intelligence, The Chinese University of Hong Kong, Shenzhen (CUHK-Shenzhen), China. He is also with the Department of Electrical and Computer Engineering, National University of Singapore (NUS), Singapore. He is the Director of Shenzhen Key Laboratory of Cross-Modal Cognitive Computing (C3 Lab) at CUHK-Shenzhen, and Machine Listening Lab (MLL) at University of Bremen, Germany.

Haizhou Li received the B.Sc, M.Sc, and Ph.D degrees in electrical and electronic engineering from South China University of Technology, Guangzhou, China in 1984, 1987, and 1990 respectively. He has worked on speech and language technology in academia and industry since 1988. As an educator, he taught in The University of Hong Kong (1988-1989), South China University of Technology in Guangzhou, China (1990-1994), Nanyang Technological University in Singapore (2006-2016), University of Eastern Finland (2009), and University of New South Wales (2011). He was a researcher at CRIN/INRIA in France (1994-1995), a Research Manager in Apple-ISS Research Centre (1996-1998), Research Director of Lernout & Hauspie Asia Pacific (1999-2001), Vice President of InfoTalk Corp. Ltd and General Manager of InfoTalk Technology (Singapore) Pte Ltd (2001-2003), the Principal Scientist and Department Head of Human Language Technology at the Institute for Infocomm Research (2003-2016), and the Research Director of the Institute for Infocomm Research (2014-2016), the Agency for Science, Technology and Research, Singapore. He co-founded Baidu-I2R Research Centre in Singapore (2012). Dr. Li was known for his technical contributions to several award-winning speech products, such as Apple's Chinese Dictation Kits for Macintosh (1996) and Lernout & Hauspie's Speech-Pen-Keyboard Text Entry Solution for Asian languages (1999). He was the architect of a series of major technology deployments that include TELEFIQS voice-automated call centre service in Singapore Changi International Airport (2001), voiceprint engine for Lenovo A586 Smartphone (2012), and Baidu Music Search (2013).

Dr. Li's research interests include automatic speech recognition, natural language processing and neuromorphic computing. He has served as the Editor-in-Chief of IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING (2015-2018), Associate Editor (2008-2012) and Senior Area Editor (2014-2016) of IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, Associate Editor (2012-2013) of ACM TRANSACTIONS ON SPEECH AND LANGUAGE PROCESSING, Computer Speech and Language (2012-2024), Springer International Journal of Social Robotics (2008-2024), and a Member of IEEE Speech and Language Processing Technical Committee (2013-2015), Awards Board (2021-2023), Publications Board (2015-2018), and Conference Board (2023-2024) of IEEE Signal Processing Society. He has served as the President of the International Speech Communication Association (ISCA, 2015-2017), the President of Asia Pacific Signal and Information Processing Association (APSIPA, 2015-2016), the President of the Chinese and Oriental Language Information Processing Society (COLIPS, 2015-2024), the President of the Asian Federation of Natural Language Processing (AFNLP, 2017-2018), the Vice President (Conferences) of IEEE Signal Processing Society (2024-2026). He was the General Chair of ACL 2012, INTERSPEECH 2014, IWSDS 2019, ASRU 2019, IEEE ICASSP 2022, APSIPA ASC 2025, the Local Arrangement Chair of SIGIR 2008, ACL-IJCNLP 2009, EMNLP 2023, and the Technical Program Chair of ISCSLP 1998, APSIPA Annual Summit and Conference 2010, IEEE Spoken Language Technology Workshop 2014, and IEEE ChinaSIP 2015.

Dr. Li was the recipient of National Infocomm Awards 2002, Institution of Engineers Singapore (IES) Prestigious Engineering Achievement Award 2013 and 2015, President's Technology Award 2013, and MTI Innovation Activist Gold Award 2015 in Singapore. He was named one of the two Nokia Visiting Professors in 2009 by Nokia Foundation, IEEE Fellow in 2014 for leadership in multilingual, speaker and language recognition, ISCA Fellow in 2018 for contributions to multilingual speech information processing, U Bremen Excellence Chair Professor in 2019, Fellow of the Academy of Engineering Singapore in 2022, Fellow of Asia Pacific Artificial Intelligence Association in 2022, and DFG Mercator Fellow in 2022. Dr. Li is a member of ACL, ACM, and APSIPA.

Distinctions

1. Fellow, DFG Mercator Fellow, 2022

2. Fellow, Asia-Pacific Artificial Intelligence Association, 2022

3. Fellow, Academy of Engineering Singapore, 2022

4. Bremen Excellence Chair Professor, Germany, 2019

5. Fellow of the International Speech Communication Association 2018 (citation: for contributions to multilingual speech information processing)

6. First Prize at 2nd International Collegiate Competition for Brain-Inspired Computing, Beijing, China, 2018

7. A*STAR Awards 2016 (A*STAR Borderless Awards: Autonomous Vehicle Programme)

8. PS21 ExCEL Awards 2015 Innovation Champion (Bronze), Prime Minister's Office, Singapore (Citation: for efforts in practicing application research to pursue fundamental understandings of speech recognition and machine translation technologies)

9. ASEAN Outstanding Engineering Achievement Award 2015, ASEAN Federation of Engineering Organizations (Citation: in recognition of an outstanding engineering project which has made significant contributions to the country's development - Speak to Me in My Language)

10. MTI Innovation Activist Gold Award 2015, Ministry of Trade and Industry, Singapore

11. Best Technology Show and Tell Award, INTERSPEECH 2014

12. IEEE Fellow 2014 (Citation: for leadership in multilingual, speaker and language recognition.)

13. President's Technology Award 2013, Singapore (Citation: for the outstanding contributions to human language technology that have empowered the industry and benefited the Asian society. see also photo and speech by Minister S. Iswaran at National Archives of Singapore)

14. IES Prestigious Engineering Achievement Award 2013, Singapore (voiceprint technology)

15. The Most Cited Article, Speech Communication, 2007-2013

16. Distinguished Alumni Awards, South China University of Technology, 2012 (SCUT 60th Anniversary)

17. Nokia Visiting Professor 2009, Nokia Foundation

18. Achiever of the Year 2007/08, Institute for Infocomm Research, A*STAR

19. The Enterprise Challenge Awards 2004, Prime Minister's Office, Singapore

20. National Infocomm Awards 2002, Infocomm Development Authority, Singapore

Best Papers

1. Best Paper Award, Qibing Bai, Shuai Wang, Zhijun Liu, Mingyang Zhang, Wei Rao, Yannan Wang, Haizhou Li, Diffusion-Based Method with TTS Guidance for Foreign Accent Conversion, The 14th ISCA International Symposium on Chinese Spoken Language Processing, Beijing, 7-10 November, 2024

2. CVPR 2022 Best Paper Finalist, Egocentric Vision (EgoVis) 2022/2023 Distinguished Paper Award, Ego4D: Around the World in 3,000 Hours of Egocentric Video

3. Best Paper Award 2022, Peiwen Li, Enze Su, Jia Li, Siqi Cai, Longhan Xie, and Haizhou Li, ESAA: an EEG-Speed Audit Attention Detection Database, The 25th Conference of the Oriental COCOSDA (O-COCOSDA 2022), Hanoi, Vietnam, November 24 to 26, 2022

4. Best Paper Award 2021, Chen Zhang, Luis Fernando D`Haro, Thomas Friedrichs, Haizhou Li and Yiming Chen, Investigating the Impact of Pre-trained Language Models on Dialog Evaluation, The 12th International Workshop on Spoken Dialog System Technology, 15-17 November 2021, Singapore.

5. Best Paper Award 2021, Qian Xinyuan, Bidisha Sharma, Amine El Abridi and Haizhou Li, SLoClas: A DATABASE FOR JOINT SOUND LOCALIZATION AND CLASSIFICATION, The 24th Conference of the Oriental COCOSDA, 18-20 November 2021, Singapore.

6. IEEE Computational Intelligence Magazine Outstanding Paper Award 2019, How the Brain Formulates Memory: A Spatio-Temporal Model, Jun Hu, Huajin Tang, Kay Chen Tan and Haizhou Li, IEEE Computational Intelligence Magazine, vol. 11, no. 4, pp. 56-68, May 2016

7. Featured productive , and innovative author in speech and language processing (1965-2015) by the NLP4NLP Corpus, 2019

8. AI 2000 Speech Recognition Most Influential Scholars Honorable Mention (2009-2019)

9. Poster Presentation Award, A Dual Alignment Scheme for Improved Speech-to-Singing Voice Conversion, The 9th APSIPA Annual Summit and Conference, 12-15 December, 2017, Kuala Lumper, Malaysia

10. Best Paper Award, Computer-Assisted Pronunciation Training: From Pronunciation Scoring Towards Spoken Language Learning, Nancy F. Chen, Haizhou Li, The 8th APSIPA Annual Summit and Conference, 13-16 December, 2016, Jeju, Korea

11. IEEE Computational Intelligence Society Outstanding TNNLS Paper Award 2016, Rapid Feedforward Computation by Temporal Encoding and Learning with Spiking Neurons, Qiang Yu, Huajin Tang, Kay Chen Tan and Haizhou Li, IEEE Transactions on Neural Networks and Learning Systems, Vol. 24, No. 10, pp. 1539-1552, 2013

12. Best Paper Award, Spoken Keyword Spotting Based on DTW, Jinyong Hou, Lei Xie, Peng Yang, Xiong Xiao, Zhixiang Liang, Haihua Xu, Lei Wang, Bin Ma, Hang Lu, Eng Siong Chng, Haizhou Li, China National Conference on Man-Machine Speech Communication (NCMMSC) 2015, October 2015, Tianjin China

13. Best Paper Award, Parallel Inference of Dirichlet Process Gaussian Mixture Models for Unsupervised Acoustic Modeling: A Feasibility Study, Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, ZeroSpeech 2015 Challenge at INTERSPEECH 2015, 6-10 September 2015, Dresden, Germany

14. Best Paper M. Anandakrishnan Award, A Cloud-based Large Vocabulary Speech Recognition System for Tamil, Sunil Sivadas, Boon Pang Lim, Thai Ngoc Thuy Hoang Helen, Muthalagu Meyyappan, Bin Ma, and Haizhou Li, 14th Tamil Internet Conference 2015, 30 May - 1 June 2015, Singapore

15. Best Paper Award, The 4th Asia Pacific Signal and Information Processing Association Annual Summit and Conference, A Study on Spoofing Attack in State-of-the-Art Speaker Verification: the Telephone Speech Case, Zhizheng Wu, Tomi Kinnunen, Eng Siong Chng, Haizhou Li, and Eliathamby Ambikairajah, in Proc. APSIPA ASC 2012, 3-6 December, 2012, Hollywood, California, USA

Best Student Papers

1. Best Student Paper Award, The Impact of Synchronized Visual and Auditory Attention on Human Perception, Lichuan Jiang, Jiani Zhong, Muqing Jian, Xuanzhuo Liu, Siqi Cai, Haizhou Li, 16^th International Conference on Social Robotics, 25-28 September 2024, Shenzhen, China

2. Best Student Paper Award, Use of Claimed Speaker Models for Replay Detection, Gajan Suthokumar, Kaavya Sriskandaraja, Vidhyasaharan Sethu, Chamith Wijenayake, Eliathamby Ambikairajah, Haizhou Li , The 10th APSIPA Annual Summit and Conference, 12-15 November 2018 in Honolulu, USA

3. Best Student Paper Award, Perceptual Evaluation of Singing Quality, Chitralekha Gupta, Haizhou Li, Ye Wang, The 9th APSIPA Annual Summit and Conference, 12-15 December, 2017, Kuala Lumper, Malaysia

4. IEEE Ganesh N. Ramaswamy Memorial Student Grant 2015, Source-Specific Informative Prior for i-Vector Extraction, Sven Shepstone, Kong Aik Lee, Haizhou Li, Zheng-Hua Tan, and Soren Holdt Jensen, ICASSP 2015, 19-24 April 2015, Brisbane, Australia

5. IEEE Ganesh N. Ramaswamy Memorial Student Grant 2014, Minimum Divergence Estimation of Speaker Prior in Multi-Session PLDA Scoring, Liping Chen, Kong Aik Lee, Bin Ma, Wu Guo, Haizhou Li, Li Rong Dai, ICASSP 2014, 4-9 May 2014, Florence, Italy

6. ISCA International Symposium on Chinese Spoken Language Processing, Best Student Paper Award 2010, Factor Analysis based Spatial Correlation Modeling for Speaker Verification, Eryu Wang, Kong Aik Lee, Bin Ma, Haizhou Li, Wu Guo, and Lirong Dai, in Proc. ISCSLP, pp. 166 - 170, 29 November - 3 December 2010, Sun Moon Lake, Taiwan

Professional Leadership

1. Vice President, IEEE Signal Processing Society 2024-2026

2. Member, Awards Board, IEEE Signal Processing Society 2021-2023

3. Member, Fellow Evaluation Committee, IEEE Signal Processing Society 2019

4. President, Asian Federation of Natural Language Processing (AFNLP), 2017-2018

5. President, International Speech Communication Association (ISCA), 2015-2017

6. Vice President, Asian Federation of Natural Language Processing (AFNLP), 2015-2016

7. Member, Publications Board, IEEE Signal Processing Society, 2015-2017

8. Vice President, International Speech Communication Association (ISCA), 2013-2015

9. Board Member, International Speech Communication Association (ISCA), 2009-2017

10. Board Member, Asian Federation of Natural Language Processing (AFNLP), 2006-2012

11. Committee Member, IEEE Speech and Language Processing Technical Committee, 2013-2015

12. Committee Member, IEEE Singapore Computer Chapter, 2010-2011

13. Committee Member, IEEE Singapore, Systems, Man, & Cybernetics Chapter, 2011-2014

14. President, Teochew Doctorate Society, Singapore 2018-2022

15. President, Chinese and Oriental Languages Information Processing Society, 2011-2022

16. President, Asia Pacific Signal and Information Processing Association, 2015-2016

17. President-Elect, Asia Pacific Signal and Information Processing Association, 2013-2014

18. President (2006-2014), Honorary President (2015-), South China University of Technology Alumni Association (Singapore)

19. Chair, ISCA Special Interest Group on Chinese Spoken Language Processing, ISCA, 2011-2014

20. Member of Standing Committee, National Conference on Man-Machine Speech Communications, China, 2006-

Editorial Services

1. Editor-in-Chief, IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, 2015-2018

2. Guest Associate Editor, Frontiers in Neuroscience, 2018

3. Editor, Signal Processing Repository, IEEE Signal Processing Society, 2013-2014

4. Editor, IEEE Speech and Language Processing Technical Committee Newsletter, 2013-2015

5. Senior Area Editor, IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, 2014

6. Associate Editor, IEEE TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, 2009-2012

7. Associate Editor, Springer International Journal of Social Robotics, 2007-present

8. Associate Editor, ACM TRANSACTIONS ON SPEECH AND LANGUAGE PROCESSING, 2011-2013

9. Associate Editor, Journal of Multimedia, 2013-2014

10. Associate Editor, Computer Speech and Language, 2012- present

11. Editor, IEEE Speech and Language Processing Technical Committee Newsletter, 2013-2015

12. Guest Editor, PROCEEDINGS OF THE IEEE, 2013 (Special Issue on Speech Information Processing)

13. Guest Editor, IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, 2014 (Special Issue on Continuous Space Language Modeling)

14. Guest Editor, IEEE TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, 2013 (Special Issue on Large Scale Optimization)

15. Guest Editor, The Institute of the Electronics, Information and Communication Engineers (IEICE), 2012 (Special Issue on Recent Advances in Multimedia Signal Processing Techniques and Applications)

16. Guest Editor, Computational Linguistics and Chinese Language Processing (CLCLP), 2007

17. Guest Editor, International Journal of Computer Processing of Oriental Languages (IJCPOL), 2007

18. Guest Editor, ACM TRANSACTIONS ON ASIAN LANGUAGES INFORMATION PROCESSING, 2007

Conference Services

1. General Chair, The 17th Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2025, Singapore

2. Honorary Chair, 2024 IEEE Spoken Language Technology Workshop, 2-5 December 2024, Macau, China

3. General Chair, International Conference on Social Robotics (ICSR-InnoBiz) 2024, Shenzhen, China

4. Local Chair, The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), 6-10 December, 2023, Singapore

5. General Chair, The 47th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 22-27 May 2022, Singapore

6. General Chair, The 26th International Conference on Asian Language Processing (IALP), 27-28 Oct 2022, in Singapore and Shenzhen

7. General Chair, The 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue, 29-31 July 2021, Singapore

8. General Chair, The 12th International Workshop on Spoken Dialog System Technology, 15-17 November 2021, Singapore

9. General Chair, The 24th Oriental COCOSDA, 18-20 November 2021, Singapore

10. Senior Area Chair, ACL-IJCNLP 2021, Thailand

11. Area Chair, The 21th Annual Conference of the International Speech Communication Association (INTERSPEECH), 2020, Shanghai (online)

12. Member of Best Paper Committee, EMNLP 2020, Online conference

13. Publicity Chair, AACL-IJCNLP 2020, Online conference

14. Senior Area Chair, EMNLP-IJCNLP 2019, Hong Kong

15. General Chair, IEEE Workshop on Automatic Speech Recognition and Understanding 2019, December 2019, Singapore

16. Area Chair, The 20th Annual Conference of the International Speech Communication Association (INTERSPEECH), 2019, Graz, Austria

17. Area Chair, The 19th Annual Conference of the International Speech Communication Association (INTERSPEECH), 2018, Hyderabad, India

18. Associate Editor, the 24th International Conference on Pattern Recognition (ICPR), 20-24 August 2018, Beijing, China

19. General Chair, The 5th International Conference on Orange Technologies (ICOT), 8-10 December 2017, Singapore

20. Area Chair, The 18th Annual Conference of the International Speech Communication Association (INTERSPEECH), 2017 , Stockholm, Sweden

21. Area Chair, The 17th Annual Conference of the International Speech Communication Association (INTERSPEECH), 2016 , San Francisco, USA

22. Technical Program co-Chair, The Third IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP), 2015, Chengdu, China

23. Area Chair, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2015, Brisbane, Australia

24. Area Chair, The 8th IAPR International Conference on Biometrics, 2015, Phuket, Thailand

25. General Chair, The 15th Annual Conference of the International Speech Communication Association (INTERSPEECH), 2014 , Singapore

26. General Chair, The 9th International Symposium on Chinese Spoken Language Processing, (ISCA SIG CSLP) 2014, Singapore

27. Technical Program Chair, IEEE Workshop on Spoken Language Technology (SLT) 2014, South Lake Tahoe

28. Area Chair, Conference on Empirical Methods in Natural Language Processing (EMNLP), 2013, Seattle, USA

29. Publicity Chair, Automatic Speech Recognition and Understanding Workshop (ASRU), 2013, Olomouc, Czech Republic

30. Publicity Chair, 15th ACM International Conference on Multimodal Interaction (ICMI), 2013, Sydney, Australia

31. Area Chair, IEEE China Summit & International Conference on Signal and Information Processing (ChinaSIP), 2013, Beijing, China

32. Area Chair, International Conference on Pattern Recognition, 2012 (Tsukuba, Japan), 2014 (Stockholm, Sweden)

33. General Chair, The 50th Annual Meeting of Association for Computational Linguistics (ACL), 2012, Jeju, Korea

34. Organizing Chair, The Speaker and Language Recognition Workshop (Odyssey), 2012

35. Posters Chair, 5th ACM SIGGRAPH Conference and Exhibition on Computer Graphics and Interactive Techniques in Asia, 2012, Singapore

36. Area Coordinator, INTERSPEECH 2010, 26-30 September 2010, Makuhari, Japan

37. Workshop Chair, The 2nd Named Entities Workshop with Shared Task on Machine Transliteration, ACL 2010, Sweden

38. Area Chair, The 23rd International Conference on Computational Linguistics, COLING 2010, Beijing China

39. Area Chair, 2010 Conference on Empirical Methods on Natural Language Processing, EMNLP 2010, Cambridge, Massachusetts, USA

40. Program Chair, The 2nd Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2010, Singapore

41. Conference Chair, International Conference on Social Robotics 2010, Singapore

42. General Chair, International Conference on Asian Language Processing, IALP 2009, Singapore

43. Local Organizing Chair, The 47th Annual Meeting of ACL - 4th International Conference on Natural Language Processing, ACL-IJCNLP 2009, Singapore

44. Workshop Chair, The 1st Named Entities Workshop with Shared Task on Machine Transliteration, ACL-IJCNLP 2009 Workshop, Singapore

45. Local Arrangements Chair, The 31st SIGIR (The 31st Annual International ACM SIGIR Conference) 2008, Singapore

46. Workshop Chair, The 3rd IJCNLP (International Joint Conf. on Natural Language Processing) 2008, Hyderabad

47. Chair, The 6th SIGHAN Workshop on Chinese Language Processing, 2008, Hyderabad,

48. Technical Track Chair, 5th ACM SIGGRAPH International Conference on Virtual-Reality Continuum and its Applications in Industry (VRCAI 2008), 8-9 December 2008, Singapore

49. Program Chair, Infocomm Horizons 2007

50. Senior Researcher, Johns Hopkins University 2007 Summer Workshop on Human Language Technology

51. Member of Standing Committee, National Conference on Man-Machine Speech Communications, China, 2006-

52. General Chair, The 5th ISCSLP (International Symposium on Chinese Spoken Language Processing), 2006

Scientific Committees

1. Member, Academic Board, Master of Arts in Translation and Interpretation (MTI) programme, Nanyang Technological University, Singapore (2016-2020)

2. Member, The Electrical and Computer Engineering Panel (2017-18), FCT, Portugal

3. Member, The RGC Engineering Panel, University Grants Committee, Hong Kong (2017-2020)

4. Co-Chair, A*STAR Lead User of RI Technologies Taskforce (2015)

5. Member, National Robotics Taskforce (2014)

6. External Reviewer, Research Grants Council of Hong Kong Government

7. Member of Evaluation Panel, Singapore-Israel Industrial R&D Foundation

8. Member of Organizing Committee, The Agency for Science, Technology & Research (A*STAR) and Singapore National Academy of Science (SNAS) Young Scientist Awards 2013, 2014

9. Chair, Infocomms, Media & Computing Cluster Thematic Oversight Committee, Science and Engineering Research Council, Singapore 2013-2014

10. Member of Program Committee: INTERSPEECH, ICASSP, ASRU, ACL, EMNLP, IJCNLP, APSIPA ASC, PACLIC, IWSLT, IWSDS, NEWS, Oriental COCOSDA, AIRS, SIGHAN, ODYSSEY, SLTU, ICPR, Speech Prosody

Keynotes and Invited Talks

1. A Computational Perspective to Language and Intelligence, 2024 International Conference on Translation Education, Shenzhen, China, 12-14 April 2024

2. Seeing to Hear Better, The 25th Conference of the Oriental COCOSDA, Hanoi, Vietnam, 24-26 November, 2022

3. Recent Advances in Selective Auditory Attention, The 2020 IEEE Symposium Series on Computational Intelligence (IEEE SSCI), Canberra, Australia, 1-4 December, 2020.

4. Speech Processing at Cocktail Party, The 15th IEEE Conference on Industrial Electronics and Applications (ICIEA 2020), Kristiansand, Norway, 9-13 November 2020

5. The Story of Artificial Intelligence, The 2nd International Conference on Intelligent Autonomous Systems, 28 February - 2 March, 2019

6. Audio-visual speaker extraction, The 7th IEEE Global Conference on Signal and Information Processing (GlobalSIP) will be held at the SHAW Centre in Ottawa, Ontario, Canada on November 11-14, 2019

7. Exemplar-based Sparse Representation for Voice Conversion, the 119th audio, speech information processing symposium, 21 December 2017, Tokyo

8. Whither Speech Recognition? Alibaba Technology Forum, Learning from the Deep World, 18 September 2017, Singapore

9. Recent Advances in Singing Synthesis, 5th International Conference on Statistical Language and Speech Processing, 23-25 October 2017, Le Mans, France

10. Speech Synthesis Perfects Everyone's Singing, International Conference on Orange Technologies, 17-20 December 2016, Melbourne, Australia

11. Mandarin Chinese spoken by speakers of European origin, The Fifth Conference on Natural Language Processing and Chinese Computing & The Twenty Fourth International Conference on Computer Processing of Oriental Languages (NLPCC-ICCPOL 2016), December 2-6, 2016, Kunming, China

12. iCALL Mandarin Corpus, Oriental COCOSDA (International Committee for Coordination and Standardization of Speech Databases and Assessment Techniques), 26-28 October 2016, Bali, Indonesia

13. Voice conversion and spoofing countermeasures for speaker verification, Odyssey 2016, The Speaker and Language Recognition Workshop, June 21-24, Bilbao, Spain

Professional Membership

1. Member, Association for Computing Machinery (ACM)

2. Fellow, Institute of Electrical and Electronics Engineers (IEEE)

3. Fellow, International Speech Communication Association (ISCA)

4. Member, Association for Computational Linguistics (ACL)

5. Member, Asia Pacific Signal and Information Processing Association (APSIPA)

6. President, Chinese and Oriental Languages Information Processing Society (COLIPS)

Journal Articles

1. Rui Liu, Zhenqi Jia, Feilong Bao, Haizhou Li, Retrieval-Augmented Dialogue Knowledge Aggregation for expressive conversational speech synthesis. Inf. Fusion 118: 102948 (2025)

2. Rui Liu, Hongyu Yuan, Guanglai Gao, Haizhou Li, Listening and seeing again: Generative error correction for audio-visual speech recognition. Inf. Fusion 120: 103077 (2025)

3. Rui Liu, Jinhua Zhang, Haizhou Li, Hierarchical multi-source cues fusion for mono-to-binaural based Audio Deepfake Detection. Inf. Fusion 120: 103097 (2025)

4. Xinyuan Qian, Xianghu Yue, Jiadong Wang, Huiping Zhuang, Haizhou Li, Analytic Class Incremental Learning for Sound Source Localization With Privacy Protection. IEEE Signal Process. Lett. 32: 726-730 (2025)

5. Yi Ma, Shuai Wang, Tianchi Liu, Haizhou Li, ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification. IEEE Signal Process. Lett. 32: 731-735 (2025)

6. Jiqing Zhang, Malu Zhang, Yuanchen Wang, Qianhui Liu, Baocai Yin, Haizhou Li, Xin Yang, Spiking Neural Networks with Adaptive Membrane Time Constant for Event-Based Tracking. IEEE Trans. Image Process. 34: 1009-1021 (2025)

7. Ruijie Tao, Xinyuan Qian, Rohan Kumar Das, Xiaoxue Gao, Jiadong Wang, Haizhou Li, Enhancing Real-World Active Speaker Detection with Multi-Modal Extraction Pre-Training. IEEE Trans. Multim. 27: 2362-2373 (2025)

8. Ruihang Ji, Dongyu Li, Shuzhi Sam Ge, Haizhou Li, Tunnel Prescribed Control of Nonlinear Systems with Unknown Control Directions. IEEE Trans. Neural Networks Learn. Syst. 36(1): 1383-1395 (2025)

9. Qianhui Liu, Meng Ge, Haizhou Li, Intelligent event-based lip-reading word classification with spiking neural networks using spatio-temporal attention features and triplet loss. Inf. Sci. 675: 120660 (2024)

10. Jiaqi Yan, Qianhui Liu, Malu Zhang, Lang Feng, De Ma, Haizhou Li, Gang Pan, Efficient spiking neural network design via neural architecture search. Neural Networks 173: 106172 (2024)

11. Xinyi Chen, Qu Yang, Jibin Wu, Haizhou Li, Kay Chen Tan, A Hybrid Neural Coding Approach for Pattern Recognition with Spiking Neural Networks. IEEE Trans. Pattern Anal. Mach. Intell. 46(5): 3064-3078 (2024)

12. Shuai Wang, Zhengyang Chen, Bing Han, Hongji Wang, Chengdong Liang, Binbin Zhang, Xu Xiang, Wen Ding, Johan Rohdin, Anna Silnova, Yanmin Qian, Haizhou Li, Advancing speaker embedding learning: Wespeaker toolkit for research and production. Speech Commun. 162: 103104 (2024)

13. Jingru Lin, Meng Ge, Wupeng Wang, Haizhou Li, Mengling Feng, Selective HuBERT: Self-Supervised Pre-Training for Target Speaker in Clean and Mixture Speech. IEEE Signal Process. Lett. 31: 1014-1018 (2024)

14. Duo Ma, Xianghu Yue, Junyi Ao, Xiaoxue Gao, Haizhou Li, Text-Guided HuBERT: Self-Supervised Speech Pre-Training via Generative Adversarial Networks. IEEE Signal Process. Lett. 31: 2055-2059 (2024)

15. Xiaoxue Gao, Zexin Li, Yiming Chen, Cong Liu, Haizhou Li, Transferable Adversarial Attacks Against ASR. IEEE Signal Process. Lett. 31: 2200-2204 (2024)

16. Rui Liu, Haolin Zuo, Zheng Lian, Björn W. Schuller, Haizhou Li, Contrastive Learning Based Modality-Invariant Feature Acquisition for Robust Multimodal Emotion Recognition with Missing Modalities. IEEE Trans. Affect. Comput. 15(4): 1856-1873 (2024)

17. Qu Yang, Malu Zhang, Jibin Wu, Kay Chen Tan, Haizhou Li, LC-TTFS: Toward Lossless Network Conversion for Spiking Neural Networks With

18. Siqi Cai, Ran Zhang, Malu Zhang, Jibin Wu, Haizhou Li, EEG-Based Auditory Attention Detection with Spiking Graph Convolutional Network. IEEE Trans. Cogn. Dev. Syst. 16(5): 1698-1706 (2024)

19. Koichiro Yoshino, Yun-Nung Chen, Paul A. Crook, Satwik Kottur, Jinchao Li, Behnam Hedayatnia, Seungwhan Moon, Zhengcong Fei, Zekang Li, Jinchao Zhang, Yang Feng, Jie Zhou, Seokhwan Kim, Yang Liu, Di Jin, Alexandros Papangelis, Karthik Gopalakrishnan, Dilek Hakkani-Tur, Babak Damavandi, Alborz Geramifard, Chiori Hori, Ankit Shah, Chen Zhang, Haizhou Li, João Sedoc, Luis F. D'Haro, Rafael E. Banchs, Alexander Rudnicky, Overview of the Tenth Dialog System Technology Challenge: DSTC10. IEEE ACM Trans. Audio Speech Lang. Process. 32: 765-778 (2024)

20. Lei Liu, Li Liu, Haizhou Li, Computation and Parameter Efficient Multi-Modal Fusion Transformer for Cued Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1559-1572 (2024)

21. Xuehao Zhou, Mingyang Zhang, Yi Zhou, Zhizheng Wu, Haizhou Li, Accented Text-to-Speech Synthesis with Limited Data. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1699-1711 (2024)

22. Rui Liu, Berrak Sisman, Guanglai Gao, Haizhou Li, Controllable Accented Text-to-Speech Synthesis with Fine and Coarse-Grained Intensity Rendering. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2188-2201 (2024)

23. Tianchi Liu, Kong Aik Lee, Qiongqiong Wang, Haizhou Li, Golden Gemini is All You Need: Finding the Sweet Spots for Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2324-2337 (2024)

24. Congcong Sun, Hui Tian, Peng Tian, Haizhou Li, Zhenxing Qian, Multi-Agent Deep Learning for the Detection of Multiple Speech Steganography Methods. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2957-2972 (2024)

25. Mingyang Zhang, Yi Zhou, Yi Ren, Chen Zhang, Xiang Yin, Haizhou Li, RefXVC: Cross-Lingual Voice Conversion with Enhanced Reference Leveraging. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4146-4156 (2024)

26. Wupeng Wang, Zexu Pan, Xinke Li, Shuai Wang, Haizhou Li, Speech Separation with Pretrained Frontend to Minimize Domain Mismatch. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4184-4198 (2024)

27. Zexu Pan, Marvin Borsdorf, Siqi Cai, Tanja Schultz, Haizhou Li, NeuroHeed: Neuro-Steered Speaker Extraction Using EEG Signals. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4456-4470 (2024)

28. Yicheng Gu, Xueyao Zhang, Liumeng Xue, Haizhou Li, Zhizheng Wu, An Investigation of Time-Frequency Representation Discriminators for High-Fidelity Vocoders. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4569-4579 (2024)

29. Shuai Wang, Zhengyang Chen, Kong Aik Lee, Yanmin Qian, Haizhou Li, Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4971-4998 (2024)

30. Siqi Cai, Tanja Schultz, Haizhou Li, Brain Topology Modeling With EEG-Graphs for Auditory Spatial Attention Detection. IEEE Trans. Biomed. Eng. 71(1): 171-182 (2024)

31. Miao Liu, Jing Wang, Xinyuan Qian, Haizhou Li, Audio-Visual Temporal Forgery Detection Using Embedding-Level Fusion and Multi-Dimensional Contrastive Loss. IEEE Trans. Circuits Syst. Video Technol. 34(8): 6937-6948 (2024)

32. Zhenyu Weng, Huiping Zhuang, Fulin Luo, Haizhou Li, Zhiping Lin, Few-Shot Contrastive Transfer Learning With Pretrained Model for Masked Face Verification. IEEE Trans. Multim. 26: 3871-3883 (2024)

33. Xinyuan Qian, Wei Xue, Qiquan Zhang, Ruijie Tao, Haizhou Li, Deep Cross-Modal Retrieval Between Spatial Image and Acoustic Speech. IEEE Trans. Multim. 26: 4480-4489 (2024)

34. Siqi Cai, Peiwen Li, Haizhou Li, A Bio-Inspired Spiking Attentional Neural Network for Attentional Selection in the Listening Brain. IEEE Trans. Neural Networks Learn. Syst. 35(12): 17387-17397 (2024)

35. Ruihang Ji, Shuzhi Sam Ge, Kai Zhao, Haizhou Li, Event-Triggered Tracking Control for Nonlinear Systems With Prescribed Performance. IEEE Trans. Syst. Man Cybern. Syst. 54(6): 3547-3557 (2024)

36. Tao Luo, Weng-Fai Wong, Rick Siow Mong Goh, Anh Tuan Do, Zhixian Chen, Haizhou Li, Wenyu Jiang, Weiyun Yau, Achieving Green AI with Energy-Efficient Deep Learning Using Neuromorphic Computing. Commun. ACM 66(7): 52-57 (2023)

37. Tingting Wang, Zexu Pan, Meng Ge, Zhen Yang, Haizhou Li, Time-Domain Speech Separation Networks With Graph Encoding Auxiliary. IEEE Signal Process. Lett. 30: 110-114 (2023)

38. Yi Zhou, Zhizheng Wu, Mingyang Zhang, Xiaohai Tian, Haizhou Li, TTS-Guided Training for Accent Conversion Without Parallel Data. IEEE Signal Process. Lett. 30: 533-537 (2023)

39. Mingyang Zhang, Xuehao Zhou, Zhizheng Wu, Haizhou Li, Towards Zero-Shot Multi-Speaker Multi-Accent Text-to-Speech Synthesis. IEEE Signal Process. Lett. 30: 947-951 (2023)

40. Kun Zhou, Berrak Sisman, Rajib Rana, Björn W. Schuller, Haizhou Li, Emotion Intensity and its Control for Emotional Voice Conversion. IEEE Trans. Affect. Comput. 14(1): 31-48 (2023)

41. Hui Tian, Yiqin Qiu, Wojciech Mazurczyk, Haizhou Li, Zhenxing Qian, STFF-SM: Steganalysis Model Based on Spatial and Temporal Feature Fusion for Speech Streams. IEEE ACM Trans. Audio Speech Lang. Process. 31: 277-289 (2023)

42. Qiquan Zhang, Xinyuan Qian, Zhaoheng Ni, Aaron Nicolson, Eliathamby Ambikairajah, Haizhou Li, A Time-Frequency Attention Module for Neural Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 31: 462-475 (2023)

43. Xinyuan Qian, Zhengdong Wang, Jiadong Wang, Guohui Guan, Haizhou Li, Audio-Visual Cross-Attention Network for Robotic Speaker Tracking. IEEE ACM Trans. Audio Speech Lang. Process. 31: 550-562 (2023)

44. Chen Zhang, Luis Fernando D'Haro, Qiquan Zhang, Thomas Friedrichs, Haizhou Li, PoE: A Panel of Experts for Generalized Automatic Dialogue Assessment. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1234-1250 (2023)

45. Ruijie Tao, Kong Aik Lee, Rohan Kumar Das, Ville Hautamäki, Haizhou Li, Self-Supervised Training of Speaker Encoder With Multi-Modal Diverse Positive Pairs. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1706-1719 (2023)

46. Yi Zhou, Zhizheng Wu, Xiaohai Tian, Haizhou Li, Optimization of Cross-Lingual Voice Conversion With Linguistics Losses to Reduce Foreign Accents. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1916-1926 (2023)

47. Xiaoxue Gao, Chitralekha Gupta, Haizhou Li, PoLyScriber: Integrated Fine-Tuning of Extractor and Lyrics Transcriber for Polyphonic Music. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1968-1981 (2023)

48. Zhenyu Weng, Huiping Zhuang, Haizhou Li, Balakrishnan Ramalingam, Rajesh Elara Mohan, Zhiping Lin, Online Multi-Face Tracking With Multi-Modality Cascaded Matching. IEEE Trans. Circuits Syst. Video Technol. 33(6): 2738-2752 (2023)

49. Yiqin Qiu, Hui Tian, Haizhou Li, Chin-Chen Chang, Athanasios V. Vasilakos, Separable Convolution Network With Dual-Stream Pyramid Enhanced Strategy for Speech Steganalysis. IEEE Trans. Inf. Forensics Secur. 18: 2737-2750 (2023)

50. Jibin Wu, Yansong Chua, Malu Zhang, Guoqi Li, Haizhou Li, Kay Chen Tan, A Tandem Learning Rule for Effective Training and Rapid Inference of Deep Spiking Neural Networks. IEEE Trans. Neural Networks Learn. Syst. 34(1): 446-460 (2023)

51. Xianghu Yue, Jingru Lin, Fabian Ritter Gutierrez, Haizhou Li, Self-Supervised Learning With Segmental Masking for Speech Representation. IEEE J. Sel. Top. Signal Process. 16(6): 1367-1379 (2022)

52. Hongqiang Du, Lei Xie, Haizhou Li, Noise-robust voice conversion with domain adversarial training. Neural Networks 148: 74-84 (2022)

53. Jibin Wu, Chenglin Xu, Xiao Han, Daquan Zhou, Malu Zhang, Haizhou Li, Kay Chen Tan, Progressive Tandem Learning for Pattern Recognition With Deep Spiking Neural Networks. IEEE Trans. Pattern Anal. Mach. Intell. 44(11): 7824-7840 (2022)

54. Kun Zhou, Berrak Sisman, Rui Liu, Haizhou Li, Emotional voice conversion: Theory, databases and ESD. Speech Commun. 137: 1-18 (2022)

55. Hongning Zhu, Kong Aik Lee, Haizhou Li, Discriminative speaker embedding with serialized multi-layer multi-head attention. Speech Commun. 144: 89-100 (2022)

56. Tianchi Liu, Rohan Kumar Das, Kong Aik Lee, Haizhou Li, Neural Acoustic-Phonetic Approach for Speaker Verification With Phonetic Attention Mask. IEEE Signal Process. Lett. 29: 782-786 (2022)

57. Zexu Pan, Xinyuan Qian, Haizhou Li, Speaker Extraction With Co-Speech Gestures Cue. IEEE Signal Process. Lett. 29: 1467-1471 (2022)

58. Haizhou Li, A Unique ICASSP 2022: During an Unusual Time [Conference Highlights]. IEEE Signal Process. Mag. 39(2): 159-160 (2022)

59. Zexu Pan, Ruijie Tao, Chenglin Xu, Haizhou Li, Selective Listening by Synchronizing Speech With Lips. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1650-1664 (2022)

60. Rui Liu, Berrak Sisman, Guanglai Gao, Haizhou Li, Decoding Knowledge Transfer for Neural Text-to-Speech Training. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1789-1802 (2022)

61. Xiaoxue Gao, Chitralekha Gupta, Haizhou Li, Automatic Lyrics Transcription of Polyphonic Music With Lyrics-Chord Multi-Task Learning. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2280-2294 (2022)

62. Chitralekha Gupta, Haizhou Li, Masataka Goto, Deep Learning Approaches in Topics of Singing Information Processing. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2422-2451 (2022)

63. Zexu Pan, Meng Ge, Haizhou Li, USEV: Universal Speaker Extraction With Visual Cue. IEEE ACM Trans. Audio Speech Lang. Process. 30: 3032-3045 (2022)

64. Enze Su, Siqi Cai, Longhan Xie, Haizhou Li, Tanja Schultz, STAnet: A Spatiotemporal Attention Network for Decoding Auditory Spatial Attention From EEG. IEEE Trans. Biomed. Eng. 69(7): 2233-2242 (2022)

65. Siqi Cai, Enze Su, Longhan Xie, Haizhou Li, EEG-Based Auditory Attention Detection via Frequency and Channel Neural Attention. IEEE Trans. Hum. Mach. Syst. 52(2): 256-266 (2022)

66. Malu Zhang, Jiadong Wang, Jibin Wu, Ammar Belatreche, Burin Amornpaisannon, Zhixuan Zhang, Venkata Pavan Kumar Miriyala, Hong Qu, Yansong Chua, Trevor E. Carlson, Haizhou Li, Rectified Linear Postsynaptic Potential Function for Backpropagation in Deep Spiking Neural Networks. IEEE Trans. Neural Networks Learn. Syst. 33(5): 1947-1958 (2022)

67. Jibin Wu, Qi Liu, Malu Zhang, Zihan Pan, Haizhou Li, Kay Chen Tan, HuRAI: A brain-inspired computational model for human-robot auditory interface. Neurocomputing 465: 103-113 (2021)

68. Rui Liu, Berrak Sisman, Yixing Lin, Haizhou Li, FastTalker: A neural text-to-speech architecture with shallow and group autoregression. Neural Networks 141: 306-314 (2021)

69. Hongqiang Du, Xiaohai Tian, Lei Xie, Haizhou Li, Factorized WaveNet for voice conversion with limited data. Speech Commun. 130: 45-54 (2021)

70. Tharshini Gunendradasan, Eliathamby Ambikairajah, Julien Epps, Vidhyasaharan Sethu, Haizhou Li, An adaptive transmission line cochlear model based front-end for replay attack detection. Speech Commun. 132: 114-122 (2021)

71. Bidisha Sharma, Xiaoxue Gao, Karthika Vijayan, Xiaohai Tian, Haizhou Li, NHSS: A speech and singing parallel database. Speech Commun. 133: 9-22 (2021)

72. Xinyuan Qian, Qi Liu, Jiadong Wang, Haizhou Li, Three-Dimensional Speaker Localization: Audio-Refined Visual Scaling Factor Estimation. IEEE Signal Process. Lett. 28: 1405-1409 (2021)

73. Rui Liu, Berrak Sisman, Feilong Bao, Jichen Yang, Guanglai Gao, Haizhou Li, Exploiting Morphological and Phonological Features to Improve Prosodic Phrasing for Mongolian Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 29: 274-285 (2021)

74. Mingyang Zhang, Yi Zhou, Li Zhao, Haizhou Li, Transfer Learning from Speech Synthesis to Voice Conversion with Non-Parallel Training Data. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1290-1302 (2021)

75. Rui Liu, Berrak Sisman, Guanglai Gao, Haizhou Li, Expressive TTS Training with Frame and Style Reconstruction Loss. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1806-1818 (2021)

76. Yi Zhou, Xiaohai Tian, Haizhou Li, Language Agnostic Speaker Embedding for Cross-Lingual Personalized Speech Generation. IEEE ACM Trans. Audio Speech Lang. Process. 29: 3427-3439 (2021)

77. Chen Zhang, Grandee Lee, Luis Fernando D'Haro, Haizhou Li, D-Score: Holistic Dialogue Evaluation Without Reference. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2502-2516 (2021)

78. Zihan Pan, Malu Zhang, Jibin Wu, Jiadong Wang, Haizhou Li, Multi-Tone Phase Coding of Interaural Time Difference for Sound Source Localization with Spiking Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2656-2670 (2021)

79. Chenglin Xu, Wei Rao, Jibin Wu, Haizhou Li, Target Speaker Verification with Selective Auditory Attention for Single and Multi-Talker Speech. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2696-2709 (2021)

80. Berrak Sisman, Junichi Yamagishi, Simon King, and Haizhou Li, An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep Learning, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 29, pp. 132-157, 2021, doi: 10.1109/TASLP.2020.3038524

81. Rui Liu, Berrak Sisman, Feilong Bao, Jichen Yang, Guanglai Gao and Haizhou Li, Exploiting morphological and phonological features to improve prosodic phrasing for Mongolian speech synthesis, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2020, doi: 10.1109/TASLP.2020.3040523

82. Rui Liu, Berrak Sisman, Feilong Bao, Guanglai Gao and Haizhou Li, Modeling Prosodic Phrasing with Multi-Task Learning in Tacotron-based TTS, IEEE Signal Processing Letters, vol. 27, pp. 1470-1474, 2020

83. Yi Zhou, Xiaohai Tian and Haizhou Li, Multi-Task WaveRNN with an Integrated Architecture for Cross-lingual Voice Conversion, IEEE Signal Processing Letters, vol. 27, pp. 1310-1314, 2020

84. Changhuai You and Jichen Yang, Device Feature Extraction Based on Parallel Neural network training for replay spoofing detection, IEEE/ACM Transactions on Audio, Speech and Language Processing, vol. 28, pp 2308-2318, 2020

85. Mingyang Zhang, Berrak Sisman, Li Zhao and Haizhou Li, DeepConversion: Voice conversion with limited parallel training data, Speech Communication, vol. 122, pp. 31-43, 2020

86. Chenglin Xu, Wei Rao, Eng Siong Chng and Haizhou Li, SpEx: Multi-Scale Time Domain Speaker Extraction Network, IEEE/ACM Transaction on Audio, Speech, and Language Processing, vol. 28, pp. 1370-1384, 2020

87. Malu Zhang, Xiaoling Luo, Jibin Wu, Yi Chen, Ammar Belatreche, Zihan Pan, Hong Qu, and Haizhou Li, An Efficient Threshold-Driven Aggregate-Label Learning Algorithm for Multimodal Information Processing, IEEE Journal of Selected Topics in Signal Processing, 14(3), pp. 592-602, March 2020, doi: 10.1109/JSTSP.2020.2983547

88. Malu Zhang, Jibin Wu, Ammar Belatreche, Zihan Pan, Xiurui Xie, Yansong Chua, Guoqi Li, Hong Qu and Haizhou Li, Supervised Learning in Spiking Neural Networks with Synaptic Delay-Weight Plasticity, Neurocomputing, vol. 409, pp. 103-118, October 2020

89. Jibin Wu, Emre Yılmaz, Malu Zhang, Haizhou Li and Kay Chen Tan, Deep Spiking Neural Networks for Large Vocabulary Automatic Speech Recognition, Frontiers in Neuroscience, 14(199), March 2020

90. Zihan Pan, Yansong Chua, Jibin Wu, Malu Zhang, Haizhou Li and Eliathamby Ambikairajah, An Efficient and Perceptually Motivated Auditory Neural Encoding and Decoding Algorithm for Spiking Neural Networks, Frontiers in Neuroscience, 13(1420), January 2020

91. Jichen Yang, Rohan Kumar Das and Haizhou Li, Significance of Subband Features for Synthetic Speech Detection, IEEE Transactions on Information Forensics and Security, vol. 15, pp. 2160-2170, 2020, doi: 10.1109/TIFS.2019.2956589

92. Chitralekha Gupta, Haizhou Li and Ye Wang, Automatic Leaderboard: Evaluation of Singing Quality Without a Standard Reference, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 28, pp. 13-26, 2020, doi: 10.1109/TASLP.2019.2947737

93. Qiang Yu, Haizhou Li, Kay Chen Tan, Spike Timing or Rate? Neurons Learn to Make Decisions for Both Through Threshold-Driven Plasticity, IEEE Trans. Cybernetics 49(6): 2178-2189, 2019

94. Berrak Sisman, Mingyang Zhang, Haizhou Li, Group Sparse Representation with WaveNet Vocoder Adaptation for Spectrum and Prosody Conversion, IEEE/ACM Transactions on Audio, Speech, and Language Processing, IEEE/ACM Trans. Audio, Speech & Language Processing 27(6): 1085-1097 (2019)

95. Karthika Vijayan, Haizhou Li, Tomoki Toda, Speech-to-Singing Voice Conversion: The Challenges and Strategies for Improving Vocal Conversion Processes, IEEE Signal Processing Magazine. 36(1): 95-102, 2019

96. Luis Fernando D'Haro, Rafael E. Banchs, Chiori Hori, Haizhou Li: Automatic evaluation of end-to-end dialog systems with adequacy-fluency metrics, Computer Speech & Language 55: 200-215, 2019

97. Chong Zhang, Kay Chen Tan, Haizhou Li, Geok Soon Hong, A Cost-Sensitive Deep Belief Network for Imbalanced Classification, IEEE Transactions on Neural Networks and Learning Systems. 30(1): 109-122, 2019

98. Van Tung Pham, Haihua Xu, Xiong Xiao, Nancy F. Chen, Eng Siong Chng, Haizhou Li, Re-ranking spoken term detection with acoustic exemplars of keywords. Speech Communication 104: 12-23, 2018

99. Longting Xu, Kong-Aik Lee, Haizhou Li, Zhen Yang, Generalizing I-Vector Estimation for Rapid Speaker Recognition. IEEE/ACM Trans. Audio, Speech & Language Processing 26(4): 749-759, 2018

100. Saad Irtza, Vidhyasaharan Sethu, Eliathamby Ambikairajah, Haizhou Li, Using language cluster models in hierarchical language identification. Speech Communication 100: 30-40, 2018

101. Kaavya Sriskandaraja, Vidhyasaharan Sethu, Eliathamby Ambikairajah, Haizhou Li, Front-End for Antispoofing Countermeasures in Speaker Verification: Scattering Spectral Decomposition, IEEE Journal of Selected Topics in Signal Processing 11(4): 632-643, 2017

102. Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, Multitask Feature Learning for Low-Resource Query-by-Example Spoken Term Detection, IEEE Journal of Selected Topics in Signal Processing 11(8): 1329-1339, 2017

103. Xiaohai Tian, Siu Wa Lee, Zhizheng Wu, Eng Siong Chng, Haizhou Li, An Exemplar-Based Approach to Frequency Warping for Voice Conversion, IEEE/ACM Trans. Audio, Speech & Language Processing 25(10): 1863-1876, 2017

104. Hongjie Chen, Lei Xie, Cheung-Chi Leung, Xiaoming Lu, Bin Ma, Haizhou Li, Modeling Latent Topics and Temporal Distance for Story Segmentation of Broadcast News, IEEE/ACM Trans. Audio, Speech & Language Processing 25(1): 108-119, 2017

105. Jun Hu, Huajin Tang, Kay Chen Tan, Haizhou Li, How the Brain Formulates Memory: A Spatio-Temporal Model, IEEE Computational Intelligence Magazine, 11(2): 56-68, 2016

106. Xiong Xiao, Shengkui Zhao, Duc Hoang Ha Nguyen, Xionghu Zhong, Douglas L. Jones, Eng Siong Chng, Haizhou Li, Speech dereverberation for enhancement and recognition using dynamic features constrained deep neural networks and feature adaptation, EURASIP Journal Adv. Sig. Proc. 2016: 4, 2016

107. Zhizheng Wu, Haizhou Li, On the study of replay and voice conversion attacks to text-dependent speaker verification, Multimedia Tools Appl. 75(9): 5311-5327, 2016

108. Nancy F. Chen, Darren Wee, Rong Tong, Bin Ma, Haizhou Li, Large-scale characterization of non-native Mandarin Chinese spoken by speakers of European origin: Analysis on iCALL. Speech Communication 84: 46-56, 2016

109. Sven Ewan Shepstone, Kong-Aik Lee, Haizhou Li, Zheng-Hua Tan, Soren Holdt Jensen, Total Variability Modeling Using Source-Specific Priors. IEEE/ACM Trans. Audio, Speech & Language Processing 24(3): 504-517, 2016

110. Duc Hoang Ha Nguyen, Xiong Xiao, Eng Siong Chng, Haizhou Li, Feature Adaptation Using Linear Spectro-Temporal Transform for Robust Speech Recognition. IEEE/ACM Trans. Audio, Speech & Language Processing 24(6): 1006-1019, 2016

111. Qiang Yu, Rui Yan, Huajin Tang, Kay Chen Tan, Haizhou Li, A Spiking Neural Network System for Robust Sequence Recognition, IEEE Transactions on Neural Networks and Learning Systems, 27(3): 621-635, 2016, doi: 10.1109/TNNLS.2015.2416771

112. Yuma Ueda, Longbiao Wang, Atsuhiko Kai, Xiong Xiao, Eng Siong Chng, Haizhou Li, Single-channel Dereverberation for Distant-Talking Speech Recognition by Combining Denoising Autoencoder and Temporal Structure Normalization. Signal Processing Systems 82(2): 151-161, 2016

113. Liping Chen, Kong-Aik Lee, Bin Ma, Wu Guo, Haizhou Li, Li-Rong Dai, Exploration of Local Variability in Text-Independent Speaker Verification, Signal Processing Systems 82(2): 217-228, 2016

114. Dau-Cheng Lyu, Tien Ping Tan, Eng Siong Chng, Haizhou Li: Mandarin-English code-switching speech corpus in South-East Asia: SEAME. Language Resources and Evaluation 49(3): 581-600, 2015

115. Chang Huai You, Haizhou Li, and Kong-Aik Lee, Relevance factor of maximum a posteriori adaptation for GMM-NAP-SVM in speaker and language recognition, Computer Speech and Language, vol.30, no.1, pp.116-134, 2015

116. Van Hai Do, Xiong Xiao, Eng Siong Chng, and Haizhou Li, Context-dependent Phone Mapping for Acoustic Modeling of Under-resourced Languages, International Journal of Asian Language Processing, vol.23, no.1, pp.21-33, 2015

117. Haipeng Wang, Tan Lee, Cheung-Chi Leung, Bin Ma, and Haizhou Li, Acoustic Segment Modeling with Spectral Clustering Methods, IEEE/ACM Transactions on Audio, Speech and Language Processing, vol.23, no.2, pp.264-277, 2015

118. Rafael E. Banchs, Luis F. D'Haro, and Haizhou Li, Adequacy-Fluency Metrics: Evaluating MT in the Continuous Space Model Framework, IEEE/ACM Transactions on Audio, Speech and Language Processing, vol.23, no.3, pp.472-482, 2015

119. Tze Yuang Chong, Rafael E. Banchs, Eng Siong Chng, Haizhou Li, Decoupling Word-Pair Distance and Co-occurrence Information for Effective Long History Context Language Modeling, IEEE/ACM Transactions on Audio, Speech and Language Processing, 23(7): 1221-1232, 2015

120. Haizhou Li, Inaugural editorial: Embracing Opportunities for Growth, IEEE/ACM Transactions on Audio, Speech and Language Processing, 23(1): 5-6, 2015

121. Jonathan William Dennis, Tran Huy Dat, Haizhou Li: Generalized Hough Transform for Speech Pattern Classification. IEEE/ACM Transactions on Audio, Speech & Language Processing 23(11): 1963-1972, 2015

122. Zhizheng Wu, Nicholas Evans, Tomi Kinnunen, Junichi Yamagishi, Federico Alegre, Haizhou Li, Spoofing and countermeasures for speaker verification: a survey, Speech Communication, vol.66, Pages 130-153, 2015

123. Van Hai Do, Xiong Xiao, Engsiong Chng, Haizhou Li, Cross-Lingual Phone Mapping for Large Vocabulary Speech Recognition of Under-Resourced Languages, IEICE Transactions 97-D(2): 285-295, 2014

124. Miaolong Yuan, Huajin Tang, Haizhou Li, Real-Time Keypoint Recognition Using Restricted Boltzmann Machine, IEEE Trans. Neural Netw. Learning Syst. 25(11): 2119-2126, 2014

125. Zhizheng Wu, Haizhou Li, Voice conversion versus speaker verification: an overview, APSIPA Transactions on Signal and Information Processing, vol.3, e17 doi:10.1017/ATSIP.2014.17, 2014

126. Zhizheng Wu, Eng Siong Chng, Haizhou Li, Exemplar-based voice conversion using joint nonnegative matrix factorization, Multimedia Tools and Applications, Springer, 2014

127. Zhizheng Wu, Tuomas Virtanen, Eng Siong Chng, Haizhou Li, Exemplar-based sparse representation with residual compensation for voice conversion, IEEE/ACM Transactions on Audio, Speech and Language Processing, vol. 22, No. 10, pp. 1506-1521, 2014

128. Anthony Larcher, Kong Aik Lee, Bin Ma, Haizhou Li, Text-dependent speaker verification: Classifiers, databases and RSR2015, Speech Communication, vol. 60, May 2014, pp. 56-77

129. Qiang Yu, Huajin Tang, Kay Chen Tan, and Haizhou Li, Precise-Spike-Driven Synaptic Plasticity: Learning Hetero-Association of Spatiotemporal Spike Patterns, PLoS ONE, 8(11): e78318, 2013, doi: 10.1371/journal.pone.0078318

130. Qiang Yu, Huajin Tang, Kay Chen Tan, Haizhou Li: Rapid Feedforward Computation by Temporal Encoding and Learning With Spiking Neurons. IEEE Trans. Neural Networks Learning System, 24(10): 1539-1552, 2013

131. S. J. Wright, D. Kanevsky, L. Deng, X. He, G. Heigold, and H. Li, Optimization Algorithm and Applications for Speech and Language Processing, IEEE Transactions on Audio, Speech and Language Processing, 21(11):2231-2243, 2013

132. Haipeng Wang, Cheung-Chi Leung, Tan Lee, Bin Ma, Haizhou Li, Shifted-Delta MLP Features for Spoken Language Recognition. IEEE Signal Process. Lett. 20(1): 15-18, 2013

133. Jun Hu, Huajin Tang, Kay Chen Tan, Haizhou Li and Luping Shi, A Spike-Timing Based Integrated Model for Pattern Recognition. Neural Computation, vol. 25, no. 2, pp. 450-472, 2013

134. Raymond W. M. Ng, Tan Lee, Cheung-Chi Leung, Bin Ma, Haizhou Li: Spoken Language Recognition with Prosodic Features. IEEE Transactions on Audio, Speech & Language Processing, 21(9): 1841-1853, 2013

135. V. Hautamaki, T. Kinnunen, F. Sedlak, Kong Aik Lee, Bin Ma, and Haizhou Li, Sparse Classifier Fusion for Speaker Verification, IEEE Transactions on Audio, Speech and Language Processing, 21(8): 1622-1631, August 2013

136. Douglas D. O'Shaughnessy, Li Deng, Haizhou Li: Speech Information Processing: Theory and Applications [Scanning the Issue]. Proceedings of the IEEE vol. 101, No. 5 pp. 1034-1037, May 2013

137. Haizhou Li, Kong Aik Lee, and Bin Ma, Spoken Language Recognition: From Fundamentals to Practice, Proceedings of the IEEE, vol. 101, No. 5, pp. 1136 – 1159, May 2013

138. Jiali Yu, Huajin Tang, Haizhou Li, Dynamics Analysis of a Population Decoding Model, IEEE Transactions on Neural Networks and Learning Systems, vol. 24, No. 3, 2013

139. Jiali Yu, Huajin Tang, Haizhou Li, Luping Shi, Dynamical properties of continuous attractor neural network with background tuning, Neurocomputing, vol. 99, pp. 439 - 447, 2013

140. Zhizheng Wu, Tomi Kinnunen, Eng Siong Chng, Haizhou Li, Mixture of factor analyzers using priors from non-parallel speech for voice conversion, IEEE Signal Processing Letters, 19(12), pp. 914-917, 2012

141. Omid Dehzangi, Bin Ma, Eng-Siong Chng and Haizhou Li, Discriminative Feature Extraction for Speech Recognition Using Continuous Output Codes, Pattern Recognition Letters, 33 (2012), pp. 1703-1709.

142. Liyuan Li, Shuicheng Yan, Xinguo Yu, Yeow Kee Tan, and Haizhou Li, Robust Multiperson Detection and Tracking for Mobile Service and Social Robots, IEEE Transactions on Systems, Man, and Cybernetics - PART B: CYBERNETICS, vol. 42, No. 5, 2012

143. T. Kinnunen, R. Saeidi, F. Sedlak, Kong Aik Lee, J. Sandberg, M. Hansson-Sandsten, Haizhou Li, Low-Variance Multitaper MFCC Features: a Case Study in Robust Speaker Verification, IEEE Transactions on Audio, Speech and Language Processing, 20(7): 1990-2001, September 2012

144. Andreea Niculescu, Betsy van Dijk, Anton Nijholt, Haizhou Li, See Swee Lan: Making Social Robots More Attractive: The Effects of Voice Pitch, Humor and Empathy. International Journal of Social Robotics 5(2): 171-191 (2013)

145. Wenliang Chen, Jun'ichi Kazama, Min Zhang, Yoshimasa Tsuruoka, Yujie Zhang, Yiou Wang, Kentaro Torisawa, Haizhou Li, Bitext Dependency Parsing With Auto-Generated Bilingual Treebank, IEEE Transactions on Audio, Speech and Language Processing, 20(5): 1461-1472 (2012)

146. Xiaoxuan Wang, Lei Xie, Mimi Lu, Bin Ma, Engsiong Chng, Haizhou Li, Broadcast News Story Segmentation Using Conditional Random Fields and Multimodal Features. IEICE Transactions on Information and Systems, vol. E95-D, No.5, pp.1206-1215, 2012

147. Yi Ren Leng, Tran Huy Dat, Norihide Kitaoka, Haizhou Li: Selective Gammatone Envelope Feature for Robust Sound Event Recognition. IEICE Transactions 95-D (5): 1229-1237, 2012

148. Rui Yan, Keng Peng Tee, Yuanwei Chua, Haizhou Li, Huajin Tang: Gesture Recognition Based on Localist Attractor Networks with Application to Robot Control, IEEE Computational Intelligence Magazine, vol. 7, No. 1, pp. 64-74, 2012

149. Jin-Shea Kuo, Haizhou Li: Learning regional transliteration variants, Information Processing and Management, 48(1): 154-169, 2012

150. Tin Lay Nwe, Hanwu Sun, Bin Ma, Haizhou Li, Speaker Clustering and Cluster Purification Methods for RT07 and RT09 Evaluation Meeting Data, IEEE Transactions on Audio, Speech and Language Processing, vol 20, No. 2, pp 461-473, 2012

151. Haizhou Li , John-John Cabibihan, Yeow Kee Tan: Towards an Effective Design of Social Robots, International Journal of Social Robotics, 3(4), pp. 333-335, November 2011

152. Sakriani Sakti, Michael Paul, Andrew Finch, Shinsuke Sakai, Thang Tat Vu, Noriyuki Kimura, Chiori Hori, Eiichiro Sumita, Satoshi Nakamura, Jun Park, Chai Wutiwiwatchai, Bo Xu, Hammam Riza, Karunesh Arora, Chi Mai Luong, Haizhou Li, A-STAR: Toward Translating Asian Spoken Languages, Computer Speech and Language, vol. 27, No. 2, pp. 509 - 527, 2013

153. Huajin Tang, Haizhou Li, Book Review: Information Theoretic Learning: Renyi's Entropy and Kernel Perspectives, IEEE Computational Intelligence Magazine, vol. 6, No. 3, August 2011

154. Eliathamby Ambikairajah, Haizhou Li, Liang Wang, Bo Yin, and Vidhyasaharan Sethu, Language Identification: A Tutorial, IEEE Circuits and Systems Magazine, vol. 11, No. 2, pp.82 - 108, 2011

155. Huajin Tang, Haizhou Li, and Zhang Yi, Online learning and stimulus-driven responses of neurons in visual cortex, Cognitive Neurodynamics, vol. 5, no. 1, pp. 77-85, 2011

156. Omid Dehzangi, Bin Ma, Eng-Siong Chng and Haizhou Li, Error Corrective Fusion of Classifier Scores for Spoken Language, IEICE Transactions on Information and Systems, Vol. E94-D, No.12, pp.2503-2512, 2011

157. Deyi Xiong, Min Zhang, Haizhou Li, A Maximum Entropy Segmentation Model for Statistical Machine Translation, IEEE Transactions on Audio, Speech and Language Processing, 19 (8), November 2011

158. Huy Dat Tran, Haizhou Li, Sound Event Recognition with Probabilistic Distance SVMs, IEEE Transactions on Audio, Speech and Language Processing, vol. 19, No. 6, pp 1556 - 1568, 2011

159. Jonathan Dennis, Huy Dat Tran, Haizhou Li, Spectrogram Image Feature for Sound Event Classification in Mismatched Conditions, in Signal Processing Letters, vol. 18, No. 2, pp 130 - 133, February 2011

160. Haizhou Li, Ma Bin, TechWare: Speaker and Spoken Language Recognition Resources, IEEE Signal Processing Magazine, vol. 27, No. 6, pp 139-142, November 2010

161. Kong Aik Lee, Chang Huai You, Haizhou Li, Tomi Kinnunen, and Khe Chai Sim, Using Discrete Probabilities with Bhattacharyya Measure for SVM-based Speaker Verification, IEEE Transactions on Audio, Speech and Language Processing, 19(4), pp.861 - 870, May 2011

162. Deyi Xiong, Min Zhang, Aiti Aw, Haizhou Li, Linguistically Annotated Reordering Evaluation and Analysis, Computational Linguistics, vol. 36, No. 3, pp 535-568, 2010

163. Donglai Zhu, Bin Ma, Haizhou Li, Speaker Verification with Feature-Space MAPLR Parameters, IEEE Transactions on Audio, Speech and Language Processing, vol. 19, No. 3, pp 505-515, March 2011

164. Huajin Tang, Haizhou Li, Zhang Yi, A Discrete-Time Neural Network for Optimization Problems with Hybrid Constraints, IEEE Transactions on Neural Networks, vol. 21, no. 7, pp. 1184-1189, 2010

165. Namunu C. Maddage, Haizhou Li, Beat Space Segmentation and Octave Scale Cepstral Feature for Sung Language Recognition in Pop Music, ACM Transactions on Multimedia Computing, Communications and Applications (TOMCCAP), vol. 7 Issue 4, November 2011, Article No. 37

166. Lei Wang, Eng Siong Chng, Haizhou Li, A Tree-Construction Search Approach for Multivariate Time Series Motifs Discovery, Pattern Recognition Letters, vol. 31, No. 9, pp 869-875, 2010

167. Huajin Tang, Haizhou Li, and Rui Yan, Memory Dynamics in Attractor Networks with Saliency Weights, Neural Computation, 22(7), pp. 1899-1926, July 2010

168. Chang Huai You, Kong Aik Lee, Haizhou Li, GMM-SVM Kernel with a Bhattacharyya-Based Distance for Speaker Recognition, IEEE Transactions on Audio, Speech and Language Processing, vol. 18, No. 6, pp1300-1312, 2010

169. Tomi Kinnunen, Haizhou Li, An Overview of Text-Independent Speaker Recognition: from Features to Supervectors, Speech Communication 52 (1), 2010, pp. 12-40 (Speech Communication Most Cited Article 2007-2013)

170. Xiong Xiao, Jinyu Li, Eng Siong Chng, Haizhou Li, Chin-Hui Lee, A Study on the Generalization Capability of Acoustic Models for Robust Speech Recognition, IEEE Transactions on Audio, Speech and Language Processing, vol. 18, No 6, pp1158-1169, 2010

171. Namunu C. Maddage, Khe Chai Sim, Haizhou Li, Word Level Automatic Alignment of Music and Lyrics using Vocal Synthesis, ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP), vol. 6, No. 3, 2010

172. Huy Dat Tran, Haizhou Li, Jump Function Kolmogorov for Audio Classification in Noise-mismatch Conditions, IEEE Transactions on Signal Processing, vol. 57, No 8, pp 2908-2918, 2009

173. Tee Kiah Chia, Khe Chai Sim, Haizhou Li and Hwee Tou Ng, Statistical Lattice-Based Spoken Document Retrieval, ACM Transactions on Information Systems, vol. 28, No. 1, 2010

174. Rong Tong, Bin Ma, Haizhou Li, and Eng Siong Chng, A Target-Oriented Phonotactic Front-end for Spoken Language Recognition, IEEE Transactions on Audio, Speech and Language Processing, vol. 17, No 7, pp.1335-1347, 2009

175. Donglai Zhu, Haizhou Li, Bin Ma, and Chin-Hui Lee, Optimizing the Performance of Spoken Language Recognition with Discriminative Training, IEEE Transactions on Audio, Speech and Language Processing, vol. 16, No. 8, pp.1642-165, 2008

176. Chang Hui You, Kong-Aik Lee, and Haizhou Li, An SVM Kernel with GMM-Supervector Based on the Bhattacharyya Distance for Speaker Recognition, IEEE Signal Processing Letters, vol. 16, No. 1, pp.49-52, 2009

177. Xiong Xiao, Eng Siong Chng, Haizhou Li, Normalization of the Speech Modulation Spectra for Robust Speech Recognition, IEEE Transactions on Audio, Speech and Language Processing, vol. 16, No. 8, pp.1662-1674, 2008

178. Haizhou Li, Jin-Shea Kuo, Jian Su, Chih-Lung Lin, Mining Live Transliterations using Incremental Learning Algorithms, International Journal of Computer Processing Of Languages, vol. 21, No. 2, pp. 183-203, 2008

179. Khe Chia Sim and Haizhou Li, On Acoustic Diversification Front-end for Spoken Language Identification, IEEE Transactions on Audio, Speech and Language Processing, vol. 16, No. 5, pp.1029-1037, 2008

180. Bin Ma, Haizhou Li, and Rong Tong, Spoken Language Recognition with Ensemble Classifiers, IEEE Transactions on Audio, Speech and Language Processing, vol. 15, No. 7, 2007

181. Jin-shea Kuo, Haizhou Li, and Ying-Kuei Yang, Active Learning for Constructing Transliteration Lexicons from the Web, Journal of the American Society for Information Science and Technology, vol. 59, No. 1, 2008

182. Xiong Xiao, Eng Siong Chng, and Haizhou Li, Temporal structure normalization of speech feature for robust speech recognition, IEEE Signal Processing Letters, vol. 14, No. 7, 2007

183. Jin-Shea Kuo, Haizhou Li, Ying-Kuei Yang, A Phonetic Similarity Model for Automatic Extraction of Transliteration Pairs, ACM Transactions on Asian Language Information Processing, vol. 6, Issue 2, September, 2007

184. Tin Lay Nwe and Haizhou Li, Exploring Vibrato-Motivated Acoustic Features for Singer Identification, IEEE Transactions on Audio, Speech and Language Processing, vol. 15, No. 2, 2007

185. Haizhou Li, Bin Ma, and Chin-Hui Lee, A Vector Space Modeling Approach to Spoken Language Identification, IEEE Transactions on Audio, Speech and Language Processing, vol. 15, No. 1, 2007

Books and Book Chapters

1. Haizhou Li, Kar-Ann Toh, Liyuan Li, Advanced Topics in Biometrics, World Scientific, 2011

2. Haizhou Li, Bin Ma, and Chin-Hui Lee, Vector-based Spoken Language Classification, in Springer Handbook of Speech Processing, Jacob Benesty, M. Mohan Sondhi, Arden Huang (editors), Springer 2007

3. Chin-Hui Lee, Haizhou Li, Lin-shan Lee, Renhua Wang, and Qiang Huo (editors), Advances in Chinese Spoken Language Processing, World Scientific, 2007

4. Shuzhi Sam Ge, Haizhou Li, John-John Cabibihan and Yeow Kee Tan (editors), Social Robotics, Springer Lecture Notes in Artificial Intelligence 6414, 2010

5. Qiang Huo, Bin Ma, Eng Siong Chng, and Haizhou Li (editors), Chinese Spoken Language Processing, Springer Lecture Notes in Artificial Intelligence 4274, 2006

6. Yinglin Yu, Haizhou Li, Neural Networks and Signal Analysis, South China University of Technology Press, 1996

Teaching

1. CSC3020 Machine Learning (CUHK-SZ)

2. EE2211 Introduction to Machine Learning (NUS)

3. EE2012 Analytical Methods in Electrical and Computer Engineering (NUS)

4. EE6733 Advanced Topics on Vision and Machine Learning (NUS)

Ph.D. Students

1. Jiacheng ZHANG (CUHKSZ-SLAI), 09/2025 -

2. Shaochen ZHANG (CUHKSZ-SLAI), 09/2025 -

3. Shuhan ZHANG (CUHKSZ-SLAI), 09/2025 -

4. Sirui LI (CUHKSZ), 09/2025 -

5. Youcun ZHENG (CUHKSZ), 09/2025 -

6. Fan BU (CUHKSZ-SLAI), 09/2025 -

7. Chenyu YANG, 09/2024 -

8. Ruicong WANG, 09/2024 -

9. Kuang WANG, 09/2024 -

10. Qibing BAI, 09/2023 -

11. Zhijun LIU, 09/2023 -

12. Zheyuan LIN, 09/2023 -

13. Dedimuni Dashanka Nadeeshan De Silva (U Bremen), 2023 -

14. Saurav Pahuja (U Bremen), 2022 -

15. Wenxuan WU (CUHK), 09/2022 -

16. Junyi AO, 09/2022 -

17. Sho INOUE, 09/2022 -

18. Mehmet Sinan YILDIRIM (NUS), 01/2022 -

19. Jingru LIN (NUS), 08/2022 -

20. Yidi JIANG (NUS), 08/2021 -

21. Zeyang SONG (NUS), 08/2021 -

22. Yi MA (NUS) 08/2020 -

23. Junchen LU (NUS), 08/2020 –

24. Marvin Borsdorf (U Bremen), Speech Separation for Monolingual and Multilingual Cocktail Party Scenarios, 2025.10, web, thesis

25. Wupeng WANG (NUS), Domain-Invariant Speech Separation in Real Scenarios, 2025.06, web, thesis

26. Yiming CHEN (NUS), Semi-Supervised and Adversarial Data Synthesis for Language Modeling, 2025, web, thesis

27. Victor Li Chuang (NUS), Towards Holistic and Proactive Conversational Recommender Systems, 2025, web, thesis

28. Tianchi LIU (NUS), Advances in Robust and Practical Speaker Verification, 2024, web, thesis

29. Qu YANG (NUS), Speech Processing Using Spiking Neural Networks, 2024, web, thesis

30. Xuehao ZHOU (NUS), Cross-Regional Text-to-Speech Synthesis with Language and Accent Diversity, 2024 (Huawei, Singapore) web, thesis

31. Jiadong WANG (NUS), Cross-Modality Complementarity for Audio-Visual Speech Recognition, 2024 (TUM Germany) web, thesis

32. Xianghu YUE (NUS), Self-Supervised Modeling for Multimodal Understanding, 2024 (NUS Singapore) web, thesis

33. Zexu PAN (NUS), Look Attentively to Hear: Audio-Visual Speaker Extraction, 2023 (Alibaba DAMO Academy, Singapore) web, thesis

34. Qinyi WANG (NUS), Code-Switch Detection Techniques and Language Modeling Strategies for Automatic Speech Recognition, 2023 (Huawei, Singapore) web, thesis

35. Kun ZHOU (NUS), Emotion Modeling for Speech Generation, 2023 (Alibaba DAMO Academy, Singapore) web, thesis

36. Chen ZHANG (NUS), Self-Supervised Modeling for Open-Domain Dialogue Evaluation, 2023 web, thesis

37. Ruijie TAO (NUS), Audio-Visual Active Speaker Detection and Recognition, 2023 web, thesis

38. Nana HOU (NTU), Mismatch Problem in Deep‑learning based Speech Enhancement, 2023 (Zoom, Singapore) web, thesis

39. Xiaoxue GAO (NUS), Automatic Lyrics Transcription of Polyphonic Music, 2022 (A*STAR, Singapore) web, thesis

40. Zihan CHEN (SUTD), Adaptive Communication-efficient Federated Learning on Real-world Data, 2022 web, thesis

41. Yi ZHOU (NUS), Cross-Lingual Voice Conversion, 2021 (Tomato.ai, US) web, thesis

42. Grandee LEE (NUS), Cross-Lingual Language Modeling, Methods and Applications, 2021 (Singapore University of Social Sciences, Singapore) web, thesis

43. Zihan PAN (NUS), Neural Encoding of Auditory Signals in Spiking Neural Networks, 2020 (A*STAR, Singapore) web

44. Jibin WU (NUS), Auditory information processing using spiking neural networks, 2020 (The Hong Kong Polytechnic University, Hong Kong SAR) web, thesis

45. Chenglin XU (NTU), Single channel multi-talker speech separation with deep learning, 2020 (Kuaishou, China) web, thesis

46. Paul Yaozhu CHAN (NUS), The psychoacoustics and synthesis of singing harmony, 2020 (A*STAR, Singapore) web, thesis

47. Berrak SISMAN (NUS), Machine learning for limited data voice conversion, 2020 (University of Texas at Dallas, US) web, thesis

48. Malu Zhang (UESTC), On the study of spiking machine learning algorithms, 2019 (University of Electronics Science and Technology of China, China)

49. Chitralekha GUPTA (NUS), Comprehensive evaluation of singing quality, 2019 (NUS Singapore) web, thesis

50. Nicole MIRNIG (Salzburg), Essential of robot feedback: On developing a taxonomy for human-robot interaction, 2019 (University of Salzburg, Austria), thesis

51. Wenda CHEN (UIUC), Modeling phones, keywords, topics and intents in spoken languages, 2019 (A*STAR, Singapore) web

52. Van Tung PHAM (NTU), Robust spoken term detection using partial search and re-scoring hypothesized detections techniques, 2018, thesis

53. Tze Yuang CHONG (NTU), Exploiting long context using joint distance and occurrence information for language modeling, 2018, thesis

54. Duc Hoang Ha NGUYEN (NTU), Feature-based robust techniques for speech recognition, 2017, thesis

55. Chong ZHANG (NUS), Computational intelligence in diagnostic and prognostic applications, 2017, thesis

56. Van Hai DO (NTU), Acoustic modeling for speech recognition under limited training data conditions, 2015 (Thuyloi University, Vietnam), thesis

57. Zhizheng WU (NTU), Spectral mapping for voice conversion, 2015 (The Chinese University of Hong Kong, Shenzhen), thesis

58. Trung Hieu NGUYEN (NTU), Speaker diarization in meetings domain, 2014, thesis

59. Lei WANG (NTU), Audio pattern discovery and retrieval, 2012, thesis

60. Rong TONG (NTU), Towards a high performance phonotactic features for spoken language recognition, 2012 (Singapore Institute of Technology, Singapore), thesis

61. Omid DEHZANGHI (NTU), Discriminative feature extraction for speech recognition using continuous output codes, 2012, thesis

62. Xiong XIAO (NTU), Robust speech features and acoustic models for speech recognition, 2009, thesis

63. Tee Kiah CHIA (NUS), Lattice-based statistical spoken document retrieval, 2009, thesis

64. Hendra SETIAWAN (NUS), Reordering in statistical machine translation: a function word, syntax-based approach, 2008, thesis

MPhil Students

1. Rui KE (2025-)

2. Yihang LIN (2024-)

Project Acknowledgement

1. 2023.01.01 - 2026.12.31: National Natural Science Foundation of China (Grant No. 62271432)

2. 2024.02.06 - 2026.02.05: Shenzhen Science and Technology Program (Shenzhen Key Laboratory, Grant No. ZDSYS20230626091302006)

3. 2022.10.28 - 2025.10.31: Shenzhen Science and Technology Research Fund (Fundamental Research Key Project, Grant No. JCYJ20220818103001002)

4. 2024.09.01 - 2029.08.31: Program for Guangdong Introducing Innovative and Entrepreneurial Teams, Grant No. 2023ZT10X044

5. 2019 - Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany's Excellence Strategy (University Allowance, EXC 2077, University of Bremen).

6. 2024 - Hearable-centered assistance: From sensor to participation - Hearaz (GRK 2969) funded by Deutsche Forschungsgemeinschaft (DFG, German Research Foundation)

Postal Address

1. School of Artificial Intelligence, Shenzhen Research Institute of Big Data, The Chinese University of Hong Kong, Shenzhen, Guangdong 518172, China

2. Department of Electrical and Computer Engineering, National University of Singapore, Singapore 117583

3. Machine Listening Lab, University of Bremen, 28359 Bremen, Germany