Conference Program

Lecture rooms:

December 5 2017


Time

Activity

7:30 – 8:45

Registration

8:45 – 9:00

Opening

9:00-10:00

Keynote Speech I
Title: Making speech tangible for a better understanding of human speech communication
Speaker: Prof Hideki Kawahara, Emeritus Professor, Center for Innovative Research and Liaison, Wakayama University
Chair: Prof Haizhou Li, National University of Singapore

10:00 – 10:30

Coffee Break

10:30 – 12:00

S1: Machine learning for natural language I (6)

Chair: Prof. Qi Su

S2: Speech recognition and synthesis (6)

Chair: Dr. Yanfeng Lu

S3: Language learning (6)

Chair:Prof. Yanping Li

 12:00 – 13:30

Lunch break

13:30 – 15:15

S4: Question answering (7)

Chair: Dr. Shuangyong Song

S5: Word Embedding (7)

Chair: Prof. Sanath Jayasena

S6: Linguistics (6)

Chair: Prof. Shili Ge

15:15 – 15:45

Coffee break

15:45 – 17:30

S7: Neural machine translation (7)

Chair: Dr. Jun Lang

S8: Speech processing (7)

Chair: Dr. Lei Wang

S9: Under-resourced language studies (6)

Chair: Prof. Yan Li

18:00 - 20:00

Dinner and night tour

December 6 2017

Time

Activity

    8:00 – 9:00

Registration

9:00– 10:00

Keynote Speech II
Title: Empirical Adventures of a Retrieval-based Chatbot
Speaker: Dr Rafael E. Banchs, Scientist, Institute for Infocomm Research (I2R), A-STAR, Singapore. 
Chair: Prof Yue Zhang, Singapore University of Technology and Design

10:00 – 10:30

Coffee break

10:30 – 12:00

S10: Text mining I (6)

Chair: Prof. Xia Li

S11: Machine learning for natural language II (6)

Chair: Prof. Kazuhide Yamamoto

 12:00 – 13:30

Lunch break

13:30 – 15:15

S12: Text mining II (7)

Chair: Dr. Kui Wu

S13: Word sense disambiguation  (7)

Chair: Dr. Ridong Jiang

15:15 – 15:45

Coffee break

15:45 – 17:15

S14: NLP applications (6)

Chair: Dr. Zhengchen Zhang

S15: Sentiment Analysis (6)

Chair: Dr. Xuancong Wang

18:00 - 21:00

Banquet

December 7 2017


Time

Activity

9:00- 17:00

Networking and City tour

Detailed program:


Time

Session

ID

Paper Title

Authors

 

Dec 5

10:30 – 12:00

S1:
Machine learning for natural language I

Chair: Prof. Qi Su

Peking University, China

23

Joint Extraction of Argument Components and Relations

Yang Du, Minglan Li and Mengxue Li

41

Joint Learning of Contextal and Global Features for Named Entity Disambiguation

Bo Ma, Tonghai Jiang, Yating Yang, Xi Zhou and Lei Wang

58

Mining Tibetan-Chinese Bilingual Entities from Wikipedia

Tao Jiang, Hongzhi Yu, Xiangzhen He and Xianghe Meng

94

Dynamic topic mining for Microblog fused with user’s behavior and time window

Fei Wu, Zhuo Wang, Zhengtao Yu, Liren Wang and Feng Zhou

113

Corpus for the Legal Information Processing System (CLIPS): a Chinese legal corpus annotated with discourse information

Hong Wang and Yunfeng Ge

148

Towards Improving the Performance of Chat Oriented Dialogue System

Ridong Jiang and Rafael E. Banchs

 

Dec 5

10:30 – 12:00

 

S2: Speech recognition and synthesis

 

Chair: Dr. Yanfeng Lu

Institute for Infocomm Research, Singapore

102

Investigating Multi-task Learning for Automatic Speech Recognition with Code-switching between Mandarin and English

Xiao Song, Yuexian Zou, Shilei Huang, Shaobin Chen and Yi Liu

24

Isolated Digit Filipino Speech Recognition through Spectrogram Image Classification: Towards Application in a Disaster Preparedness Participatory Toolkit

Julie Ann Salido, Nathaniel Oco, Rachel Roxas, Emmanuel Malaay, Michael Simora and Ronald John Cabatic

52

Transfer learning for children’s speech recognition

Rong Tong, Lei Wang and Bin Ma

71

Multimodal Learning using 3D Audio-Visual Data for Audio-Visual Speech Recognition

Rongfeng Su, Wang Lan and Xunying Liu

90

On the analysis and evaluation of prosody conversion techniques

Berrak Sisman, Grandee Lee, Haizhou Li and Kay Chen Tan

118

On Some Problems about the Text in Mongolian Speech Synthesis

Qi Bailing

 

 

Dec 5

10:30 – 12:00

S3:
Language learning

Chair:Prof. Li Yanping

Nanjing University of Posts and Telecommunications

22

Improving Pronunciation Erroneous Tendency Detection with Convolutional Long Short-Term Memory

Longfei Yang, Yingming Gao, Yanlu Xie and Jinsong Zhang

57

Correcting Misuse of Japanese Visually Similar Characters

Youichiro Ogawa and Kazuhide Yamamoto

59

Experimental Research of Mandarin Diphthongs Produced by Uyghur Learners

Yultuz Rapkat, Gulnur Arkin and Askar Hamdulla

79

Chinese teaching material readability assessment with contextual information

Hao Liu, Si Li, Jianbo Zhao, Zuyi Bao and Xiaopeng Bai

119

A New Exploration of Diagrammatic Treebank in International Chinese Teaching

Yinbing Zhang, Jihua Song, Weiming Peng, Dongdong Guo and Canran Sun

125

Evaluating the Intended Learning Outcome of Ancient Chinese Teaching Materials with Teaching-oriented Corpus

Bing Qiu

 

Dec 5

13:30 – 15:15

S4: Question answering

Chair: Dr. Shuangyong Song

Alibaba Group

12

Information Entropy-Informed Sentence Representation for Question Classification

Jin Gao, Miao Li, Lei Chen, Jinhua Du and Rongqiang Ma

14

Intension Classification of User Queries in Intelligent Customer Service System

Shuangyong Song, Haiqing Chen and Zhiwei Shi

48

Extracting Disease-Symptom Relationships from Health Question and Answer Forum

Christian Halim, Alfan F. Wicaksono and Mirna Adriani

76

Combine Multi-features with Deep Learning for Answer Selection

Yuqing Zheng, Chenghe Zhang, Dequan Zheng and Feng Yu

85

Domain Independent Keyword Identification for Question Answering

Prathyusha Jwalapuram and Radhika Mamidi

107

Towards a Deep Learning Powered Query Engine for Urban Planning

Yon Shin Teo, Zihong Yuan, Yangfan Zhang, Valerie Phang and Wee Siong Ng

130

Single Turn Chinese Emotional Conversation Generation based on Information Retrieval and Question Answering

Zhiheng Zhou, Man Lan, Yuanbin Wu and Jun Lang

 

Dec 5

13:30 – 15:15

S5:
Word Embedding

Chair: Prof. Sanath Jayasena



University of Moratuwa, Sri Lanka

 

20

Analysis of Japanese WSD with HiraganaKanji conversion and Context Word Embeddings

Yuki Gumizawa and Kazuhide Yamamoto

21

CBOS: Continuos Bag of Sentences for Learning Sentence Embeddings

Ye Yuan and Yue Zhang

31

Analyzing word embeddings and improving POS tagger of Tigrinya

Yemane Tedla and Kazuhide Yamamoto

33

Bilingual Word Embedding with Sentence Similarity Constraint for Machine Translation

Kui Wu, Xuancong Wang and AiTi Aw

54

Domain Adaption Based on LDA and Word Embedding in SMT

Shaolin Zhu, Yating Yang, Xiao Li, Tonghai Jiang, Lei Wang, Xi Zhou and Chenggang Mi

143

Constructing an Enriched Domain Taxonomy for Hindi using Word Embeddings

Vaishakh K, Pravalika A, Sowmya Kamath S and Geetha V

 

 

6

Neural Domain Adaptation for Chinese Word Segmentation

Zuyi Bao, Si Li, Weiran Xu and Sheng Gao

 

Dec 5

13:30 – 15:00

S6: Linguistics

Chair: Prof. Shili Ge

Guangdong University of Foreign Studies

66

A Quantitative Study of the Weakening of Emotional Words' Emotional Intensity in "X-yixia" Construction in the Perspective of Construction Coercion

Chenghao Zhu and Pengyuan Liu

7

An Optimality Theory Solution to Disyllabic Tone Sandhi and Neutral Tone in Yichang Dialect

Yan Li

132

Embedding Wikipedia Title Based on Its Wikipedia Text and Categories

Chi-Yen Chen and Wei-Yun Ma

69

A Study on the Reduplication of Chinese Classifiers

Fengcun An and Lei Zhao

129

The Automatic Extraction of common-used Adverbs for Teaching Chinese as Second Language

Zhimin Wang and Meiyu Wang

149

A Multi-dimensional Analysis of Deception

Qi Su

 

Dec 5

15:45 – 17:30

S7:
Neural machine translation

Chair: Dr. Jun Lang

Lazada Group

68

Recursive Annotations for Attention-Based Neural Machine Translation

Shaolin Ye and Wu Guo

77

Controlling Byte Pair Encoding for Neural Machine Translation

Alfred John Tacorda, Marvin John Ignacio, Nathaniel Oco and Rachel Edita Roxas

88

Improving Character-level Japanese-Chinese Neural Machine Translation with Radicals as an Additional Input Feature

Jinyi Zhang and Tadahiro Matsumoto

89

Exploration of Chinese-Uyghur Neural Machine Translation

Gulnigar Mahmut, Rehmutulla Memet, Mewlude Nijat and Askar Hamdulla

96

Language Post Positioned Characteristic Based Chinese-Vietnamese Statistical Machine Translation Method

Jianyalin He, Zhengtao Yu, changtao lv, Hua Lai, Shengxiang Gao and Yang Zhang

114

Error Analysis of Chinese-English Machine Translation on the Clause-Complex Level

Xiaoping Lin, Shili Ge and Rou Song

81

Neural Machine Translation for Sinhala and Tamil Languages

Pasindu Tennage, Prabath Sandaruwan, Malith Thilakarathne, Achini Herath, Surangika Ranathunga, Sanath Jayasena and Gihan Dias

 

Dec 5

15:45 – 17:30

S8: Speech processing

 

Chair: Dr. Lei Wang

Institute for Infocomm Research, Singapore

32

Study on the Aspirated Characteristics of Chinese Mandarin Consonant

Shiliang Lyu and Luxin Zhou

60

Improving Air Traffic Control Speech Intelligibility by Reducing Speaking Rate Effectively

Nana Hou, Xiaohai Tian, Eng Siong Chng, Bin Ma and Haizhou Li

153

A Light-weight Method of Building an LSTM-RNN-based Bilingual TTS System

Huaiping Ming, Yanfeng Lu, Zhengchen Zhang and Minghui Dong

109

Word Polarity Detection using Syllable Features for Manipuri Language

Loitongbam Gyanendro Singh and Sanasam Ranbir Singh

117

A Review of the Mandarin-English Code-switching Corpus: SEAME

Grandee Lee, Thi-Nga Ho, Eng Siong Chng and Haizhou Li

139

Effect of Language Independent Transcribers on Spoken Language Identification for Different Indian Languages

Rajlakshmi Saikia, Sanasam Ranbir Singh and Priyankoo Sarmah

 

 

142

Adapting monolingual resources for code-mixed Hindi-English speech recognition

Ayushi Pandey, Brij Mohan Lal Srivastava and Suryakanth V Gangashetty

 

Dec 5

15:45 – 17:15

S9: Under-resourced language studies

Chair: Prof. Yan Li

Shaanxi Normal University

30

Variational Grid Setting Network

Yu-Neng Chuang, Zi-yu Huang and Yen-Lung Tsai

35

Compiling a Text Re-Use Detection Corpus from Scientific Papers with Semi-Real Cases of Plagiarism

salar mohtaj, Habibollah Asghari and Vahid Zarrabi

45

A Rule and Statistical Modeling based Stem Extraction Method for Kazakh Words

Rehmutulla Memet, Gulnigar Mahmut, Mewlude Nijat and Askar Hamdulla

51

HiEnCor: on Mining of a Hi-En General Purpose Parallel Corpus from the Web

Arjun Das, Utpal Garain, Ravindra Kumar and Apurbalal Senapati

86

Chinese-Thai Cross-Language Topic Extraction and Alignment

Xia Li, Zihang Zeng, Jianshu Zhang and Shengyi Jiang

128

“Nee Intention enti?” Towards Dialog Act Recognition in Code-Mixed Conversations

Divya Sai Jitta, Khyathi Raghavi Chandu, Harsha Pamidipalli and Radhika Mamidi

 

Dec 6

10:30 – 12:00

S10:
Text mining I

Chair: Prof. Xia Li

Guangdong university of foreign studies

44

Understanding Explicit Arithmetic Word Problems and Explicit Plane Geometry Problems Using Syntax-Semantics Models

Xinguo Yu, Wenbin Gan and Mingshu Wang

55

Simple and sophisticated inning summary generation based on encoder-decoder model and transfer learning

Yuuki Tagawa and Kazutaka Shimada

84

A Joint Framework for Entity Discovery and Linking in Chinese Questions

Ziqi Lin, Wancheng Ni, Haidong Zhang, Yu Liu and Yiping Yang

101

Multi-Document News Summarization via Paragraph Embedding and Density Peak Clustering

Baoyan Wang, Jian Zhang, Fanggui Ding and Yuexian Zou

120

Semantic-Frame Representation for Event Detection on Twitter

Yanxia Qin, Yue Zhang, Min Zhang and Dequan zheng

131

Exploring Semantic Content to User Profiling for User Cluster-based Collaborative Point-of-Interest Recommander System

Yuhuan Xiu, Man Lan, Yuanbin Wu and Jun Lang

 

 

 

 

           

 

Dec 6

10:30 – 12:00

S11: Machine learning for natural language II

Chair: Prof. Kazuhide Yamamoto

Nagaoka University of Technology, Japan

 

34

On the Use of Machine Translation-Based Approaches for Vietnamese Diacritic Restoration

Thai-Hoang Pham, Xuan-Khoai Pham and Phuong Le-Hong

78

Filipino and English Clickbait Detection Using a Long Short Term Memory Recurrent Neural Network

Philogene Kyle Dimpas, Royce Vincent Po and Mary Jane Sabellano

106

Transition-based Dependency Parser with Postponed Determinations for Japanese Sentences

Xiaobo Xi and Akihiro Inokuchi

127

Chinese Hedge Scope Detection Based on Phrase Semantic Representation

Huiwei Zhou, Shixian Ning, Yunlong Yang, Zhuang Liu and Junli Xu

150

The Singular Value Decomposition-based Anchor Word Selection Method for Separable Nonnegative Matrix Factorization

Delano Novrilianto, Hendri Murfi and Ari Wibowo

103

Qualitative data analysis of disaster risk reduction suggestions assisted by topic modeling and word2vec

Ken Gorro, Jeffrey Rosario Ancheta, Nathaniel Oco, Rachel Edita Roxas, Mary Jane Sabellano, Brandie Nonnecke, Shrestha Mohanty, Camille Crittenden and Ken Goldberg

 

             

Dec 6

13:30 – 15:15

S12:
Text mining II

Chair: Dr. Kui Wu

Institute for Infocomm Research, Singapore

40

Extraction of Indonesian and English Parallel Sentences from Movie Subtitles

Boon Hong Yeo, Ai Ti Aw and Xuancong Wang

72

Supervised Learning for Robust Term Extraction

Yu Yuan, Jie Gao and Yue Zhang

91

Non-Entity Event Argument Extraction on Structural Representation

Yiting Liu and Peifeng Li

10

A Simple Yet Effective Method for Summarizing Microblogging Users with their Representative Tweets

Shuangyong Song, Yao Meng, Zhiwei Shi, Zhongguang Zheng and Haiqing Chen

105

Using Topic Analysis Techniques to Support Comprehensive Research Paper Searches

Satoshi Fukuda and Yoichi Tomiura

138

Extractive Text Summarisation in Hindi

Sakshee Vijay, Vartika Rai, Sorabh Gupta, Anshuman Vijayvargia and Dipti Misra Sharma

152

Mining Features for Web NER Model Construction based on Distant Learning

Chien-Lung Chou and Chia-Hui Chang

 

 

 

Dec 6

13:30 – 15:15

S13:
Word sense disambiguation

Chair: Dr. Ridong Jiang

Institute for Infocomm Research, Singapore

15

Co-Occurrence Semantic Knowledge Base Construction for Abbreviation Disambiguation

Shuangyong Song, Qingliang Miao, Zhiwei Shi, Yao Meng and Haiqing Chen

111

Japanese-Chinese Machine Translation for the Japanese Case Particle "de"

Jinyi Zhang and Tadahiro Matsumoto

27

Implicit Discourse Relation Identification based on Tree Structure Neural Network

Ruiying Geng, Ping Jian, Yingxue Zhang and Heyan Huang

67

Joint Bi-Affine Parsing and Semantic Role Labeling

Peng Shi and Yue Zhang

70

Word Sense Disambiguation of Adjectives using Dependency Structure and Degree of Association Between Sentences

Kenichi Mishina, Seiji Tsuchiya and Hirokazu Watabe

110

Analysis of Literal and Metaphorical Senses Based on Diachronic Word Embeddings

Yuxiang Jia, Yi Zheng, Hongying Zan and Zhimin Wang

135

Improving Word and Sense Embedding with Hierarchical Semantic Relations

Yow-Ting Shiue and Wei-Yun Ma

 

Dec  6

15:45 – 17:15

S14:
NLP applications

Chair: Dr. Zhengchen Zhang

Institute for Infocomm Research, Singapore

37

The Chinese Vietnamese Bilingual News Event Ranking Method Based on Attribute Association Graph

Mingwei Zhu, Zhengtao YU, Guangshun Qin, Hua Lai and Shengxiang Gao

42

Complement of Incomplete Task Results for Real-time Crowdsourcing Interpretation

Takeaki Shionome, Hirotaka Hashimoto, Jianwei Zhang, Yuhki Shiraishi, Daisuke Wakatsuki, Yohei Seki and Atsuyuki Morishima

50

Sentence simplification with core vocabulary

Takumi Maruyama and Kazuhide Yamamoto

75

Neural Architecture for Tibetan Word Segmentation

Mengzhu Chen, Shengjie Zhao and Kai Yang

95

Hybrid Answer Selection Model for Non-Factoid Question Answering

Rongqiang Ma, Jian Zhang, Miao Li, Lei Chen and Jin Gao

112

Named Entity Transliteration with Sequence-to-Sequence Neural Network

Zhongwei Li, Eng Siong Chng and Haizhou Li

 

Dec 6

15:45 – 17:15

S15: Sentiment Analysis

Chair: Dr. Xuancong Wang

Institute for Infocomm Research, Singapore

28

A Hierarchical LSTM Model with Multiple Features for Sentiment Analysis of Sina Weibo Texts

Shumin Shi, Meng Zhao, Jun Guan, Yaxuan Li and Heyan Huang

49

Unsupervised Aspect-Based Sentiment Analysis on Indonesian Restaurant Reviews

Dhanang Hadhi Sasmita, Alfan F. Wicaksono, Samuel Louvan and Mirna Adriani

56

Fine-grained Sentiment Analysis with 32 Dimensions

Xianchao Wu, Hang Tong and Momo Klyen

122

InSet Lexicon: Evaluation of a Word List for Indonesian Sentiment Analysis in Microblogs

Fajri Koto and Gemala Y. Rahmaningtyas

126

A Microblog Dataset for Tibetan Sentiment Analysis

Yong Cuo, Xiaodong Shi, Nyima Trashi and Yidong Chen

146

Towards Question Identification from Online Healthcare Consultation Forum Post in Bahasa

Rahmad Mahendra, Abid Nurul Hakim and Mirna Adriani

Remarks: