Professor Shuichi ITAHASHI University of Tsukuba Japan
Abstract: This paper reviews the activities of Oriental COCOSDA, mentioning the importance of speech corpora for promoting speech research. It describes the process of creating speech corpora including their designing, recording, editing, labeling, validation and utilization. It also discusses the methods of speech input/output systems assessment. Then, it deals with the organizations for speech corpora creation and utilization in the world with special emphasis on the recent Asian activities. Finally, it refers to the prospects for the future of speech corpora creation and utilization.
Professor Thomas Fang Zheng, Tsinghua University
Abstract: It is well understood that the speech databases play a very important role for speech recognition. It is a dream for speech recognition researchers to create a more useful database with a smaller effort. To achieve this goal, the database should be well designed at first, and tools and more information should be provided so that it can be made full use of. This paper will illustrate the criteria according to which the database will be created for different purposes. The way of transcription will also be discussed, which is the first thing to do after the data creation. Then an example on how to learn knowledge from the created database for other research purpose will be given.