본문 바로가기
  • Home

Investigation and Transcription of North Korean for building a Corpus

  • Korean Language & Literature
  • 2021, (117), pp.5-32
  • DOI : 10.21793/koreall.2021.117.5
  • Publisher : Korean Language & Literature
  • Research Area : Humanities > Korean Language and Literature
  • Received : May 17, 2021
  • Accepted : June 14, 2021
  • Published : June 30, 2021

Baek Euna 1

1전북대학교

Accredited

ABSTRACT

When investigating and transcribing narratives data in North Korean for the purpose of building a corpus, a number of issues arise. This paper discussed these issues. For discussion, narratives data of informants from Yanggang-do were used. When collecting North Korean data, it is recommended to record narratives data targeted to North Korean defectors. And when investigating and transcribing North Korean, we basically have to follow the methods of dialect survey. However, several problems arise in the investigation and transcription process because of the following: the environmental specificity of the informant, the characteristics of the building a corpus. This paper confirmed the following facts. First, when collecting North Korean dialects, it is effective to select North Korean defectors as interviewer. Because it can solve the interviewer’s problem, and it is easy to recruit informants. It is also good for forming a bond with the informant. However, in this case, it is necessary to prepare detailed investigation guidelines for North Korean defectors interviewer. Second, it is better to use the ELAN program when transcribing narratives. Third, the transcription proceeds to form morphophonological transcription, and the transcription must be divided into two stages. Listed Words of 『Urimalsaem』 follows dictionary notation, and Non-listed Words must attach morphological information and semantic information to the word form.

Citation status

* References for papers published after 2023 are currently being built.

This paper was written with support from the National Research Foundation of Korea.