본문 바로가기
  • Home

Investigating an Automatic Method for Summarizing and Presenting a Video Speech Using Acoustic Features

  • Journal of the Korean Society for Information Management
  • Abbr : JKOSIM
  • 2012, 29(4), pp.191~208
  • DOI : 10.3743/KOSIM.2012.29.4.191
  • Publisher : 한국정보관리학회
  • Research Area : Interdisciplinary Studies > Library and Information Science
  • Received : November 21, 2012
  • Accepted : December 13, 2012
  • Published : December 30, 2012

Kim, Hyun Hee 1

1명지대학교

Accredited

ABSTRACT

Two fundamental aspects of speech summary generation are the extraction of key speech content and the style of presentation of the extracted speech synopses. We first investigated whether acoustic features (speaking rate, pitch pattern, and intensity) are equally important and, if not, which one can be effectively modeled to compute the significance of segments for lecture summarization. As a result, we found that the intensity (that is, difference between max DB and min DB) is the most efficient factor for speech summarization. We evaluated the intensity-based method of using the difference between max-DB and min-DB by comparing it to the keyword-based method in terms of which method produces better speech summaries and of how similar weight values assigned to segments by two methods are. Then, we investigated the way to present speech summaries to the viewers. As such, for speech summarization, we suggested how to extract key segments from a speech video efficiently using acoustic features and then present the extracted segments to the viewers.

Citation status

* References for papers published after 2023 are currently being built.

This paper was written with support from the National Research Foundation of Korea.