본문 바로가기
  • Home

A Method for Compound Noun Extraction to Improve Accuracy of Keyword Analysis of Social Big Data

  • Journal of The Korea Society of Computer and Information
  • Abbr : JKSCI
  • 2021, 26(8), pp.55-63
  • DOI : 10.9708/jksci.2021.26.08.055
  • Publisher : The Korean Society Of Computer And Information
  • Research Area : Engineering > Computer Science
  • Received : June 30, 2021
  • Accepted : August 11, 2021
  • Published : August 31, 2021

Hyeon Gyu Kim 1

1삼육대학교

Accredited

ABSTRACT

Since social big data often includes new words or proper nouns, statistical morphological analysis methods have been widely used to process them properly which are based on the frequency of occurrence of each word. However, these methods do not properly recognize compound nouns, and thus have a problem in that the accuracy of keyword extraction is lowered. This paper presents a method to extract compound nouns in keyword analysis of social big data. The proposed method creates a candidate group of compound nouns by combining the words obtained through the morphological analysis step, and extracts compound nouns by examining their frequency of appearance in a given review. Two algorithms have been proposed according to the method of constructing the candidate group, and the performance of each algorithm is expressed and compared with formulas. The comparison result is verified through experiments on real data collected online, where the results also show that the proposed method is suitable for real-time processing.

Citation status

* References for papers published after 2023 are currently being built.