본문 바로가기
  • Home

Method for Detection and Correction of Pseudo-Semantic Errors Due to Typographical Errors

  • Journal of The Korea Society of Computer and Information
  • Abbr : JKSCI
  • 2013, 18(10), pp.173-182
  • Publisher : The Korean Society Of Computer And Information
  • Research Area : Engineering > Computer Science

김동주 1

1안양대학교

Accredited

ABSTRACT

Typographical mistakes made in the writing process of drafts of electronic documents are more common than any other type of errors. The majority of these errors caused by mistyping are regarded as consequently still typo-errors, but a considerable number of them are developed into the grammatical errors and the semantic errors. Pseudo semantic errors among these errors due to typographical errors have more noticeable peculiarities than pure semantic errors between senses of surrounding context words within a sentence. These semantic errors can be detected and corrected by simple algorithm based on the co-occurrence frequency because of their prominent contextual discrepancy. I propose a method for detection and correction based on the co-occurrence frequency in order to detect semantic errors due to typo-errors. The co-occurrence frequency in proposed method is counted for only words with immediate dependency relation, and the cosine similarity measure is used in order to detect pseudo semantic errors. From the presented experimental results, the proposed method is expected to help improve the detecting rate of overall proofreading system by about 2~3%.

Citation status

* References for papers published after 2023 are currently being built.