본문 바로가기
  • Home

Performance Comparison of Korean Dialect Classification Models Based on Acoustic Features

  • Journal of The Korea Society of Computer and Information
  • Abbr : JKSCI
  • 2021, 26(10), pp.37-43
  • DOI : 10.9708/jksci.2021.26.10.037
  • Publisher : The Korean Society Of Computer And Information
  • Research Area : Engineering > Computer Science
  • Received : September 15, 2021
  • Accepted : October 25, 2021
  • Published : October 29, 2021

Young-Kook Kim 1 Myung-Ho Kim 2

1숭실대학교 융합소프트웨어학과
2숭실대학교

Accredited

ABSTRACT

Using the acoustic features of speech, important social and linguistic information about the speaker can be obtained, and one of the key features is the dialect. A speaker's use of a dialect is a major barrier to interaction with a computer. Dialects can be distinguished at various levels such as phonemes, syllables, words, phrases, and sentences, but it is difficult to distinguish dialects by identifying them one by one. Therefore, in this paper, we propose a lightweight Korean dialect classification model using only MFCC among the features of speech data. We study the optimal method to utilize MFCC features through Korean conversational voice data, and compare the classification performance of five Korean dialects in Gyeonggi/Seoul, Gangwon, Chungcheong, Jeolla, and Gyeongsang in eight machine learning and deep learning classification models. The performance of most classification models was improved by normalizing the MFCC, and the accuracy was improved by 1.07% and F1-score by 2.04% compared to the best performance of the classification model before normalizing the MFCC.

Citation status

* References for papers published after 2023 are currently being built.