본문 바로가기
  • Home

Text summarization of dialogue based on BERT

  • Journal of The Korea Society of Computer and Information
  • Abbr : JKSCI
  • 2022, 27(8), pp.41-47
  • DOI : 10.9708/jksci.2022.27.08.041
  • Publisher : The Korean Society Of Computer And Information
  • Research Area : Engineering > Computer Science
  • Received : July 13, 2022
  • Accepted : August 8, 2022
  • Published : August 31, 2022

Wongyung Nam 1 Jisoo Lee 1 BEAKCHEOL JANG 1

1연세대학교

Accredited

ABSTRACT

In this paper, we propose how to implement text summaries for colloquial data that are not clearly organized. For this study, SAMSum data, which is colloquial data, was used, and the BERTSumExtAbs model proposed in the previous study of the automatic summary model was applied. More than 70% of the SAMSum dataset consists of conversations between two people, and the remaining 30% consists of conversations between three or more people. As a result, by applying the automatic text summarization model to colloquial data, a result of 42.43 or higher was derived in the ROUGE Score R-1. In addition, a high score of 45.81 was derived by fine-tuning the BERTSum model, which was previously proposed as a text summarization model. Through this study, the performance of colloquial generation summary has been proven, and it is hoped that the computer will understand human natural language as it is and be used as basic data to solve various tasks.

Citation status

* References for papers published after 2023 are currently being built.

This paper was written with support from the National Research Foundation of Korea.