본문 바로가기
  • Home

An Automatic Data Construction Approach for Korean Speech Command Recognition

  • Journal of The Korea Society of Computer and Information
  • Abbr : JKSCI
  • 2019, 24(12), pp.17-24
  • DOI : 10.9708/jksci.2019.24.12.017
  • Publisher : The Korean Society Of Computer And Information
  • Research Area : Engineering > Computer Science
  • Received : November 26, 2019
  • Accepted : December 20, 2019
  • Published : December 31, 2019

Yeonsoo Lim 1 Deokjin Seo 1 Park, Jeong-Sik 2 JUNG YU CHUL 1

1금오공과대학교
2한국외국어대학교

Accredited

ABSTRACT

The biggest problem in the AI field, which has become a hot topic in recent years, is how to deal with the lack of training data. Since manual data construction takes a lot of time and efforts, it is non-trivial for an individual to easily build the necessary data. On the other hand, automatic data construction needs to handle data quality issue. In this paper, we introduce a method to automatically extract the data required to develop Korean speech command recognizer from the web and to automatically select the data that can be used for training data. In particular, we propose a modified ResNet model that shows modest performance for the automatically constructed Korean speech command data. We conducted an experiment to show the applicability of the command set of the health and daily life domain. In a series of experiments using only automatically constructed data, the accuracy of the health domain was 89.5% in ResNet15 and 82% in ResNet8 in the daily lives domain, respectively.

Citation status

* References for papers published after 2023 are currently being built.