본문 바로가기
  • Home

A Character Shape Encoding Method to Input Chinese Characters in Old Documents

  • The Journal Of Korean Medical Classics
  • Abbr : JKMC
  • 2019, 32(1), pp.105~116
  • DOI : 10.14369/jkmc.2019.32.1.105
  • Publisher : 대한한의학원전학회
  • Research Area : Medicine and Pharmacy > Korean Medicine
  • Received : February 7, 2019
  • Accepted : February 12, 2019
  • Published : February 25, 2019

Kiwang Kim 1

1부산대학교

Accredited

ABSTRACT

Objectives : There are many secluded Chinese characters – so called Byeokja (僻字) in ancient classic literature, and Chinese characters that are not registered in Unicode and Variant characters (heterogeneous characters) that cannot be found in the current font sets often appear. In order to register all possible Chinese characters including such characters as units of information exchange, this study attempts to propose a method to encode the morphological information of Chinese characters according to certain rules. Methods : This study suggests the methods to encode the connection between the nodules constituting the Chinese character and the coordinates of the nodules. In addition to that, rules for expressing information about curves, expressions of aspect ratios of characters, rules for minimizing coordinate lines, and rules for expressing aggregation status of character components are added. Results : Through the proposed method, it is possible to generate codes of a certain length by extracting only information expressing the morphological configuration of characters. Conclusions : The method of character encoding proposed in this study can be used to distinguish variant characters with small variations in Byeokja, new Chinese characters and character strokes and to store and search them.

Citation status

* References for papers published after 2023 are currently being built.