본문 바로가기
  • Home

Detecting Spelling Errors by Comparison of Words within a Document

  • Journal of The Korea Society of Computer and Information
  • Abbr : JKSCI
  • 2011, 16(12), pp.83-92
  • Publisher : The Korean Society Of Computer And Information
  • Research Area : Engineering > Computer Science

김동주 1

1안양대학교

Accredited

ABSTRACT

Typographical errors by the author's mistyping occur frequently in a document being prepared with word processors contrary to usual publications. Preparing this online document, the most common orthographical errors are spelling errors resulting from incorrectly typing intent keys to near keys on keyboard. Typical spelling checkers detect and correct these errors by using morphological analyzer. In other words, the morphological analysis module of a speller tries to check well-formedness of input words, and then all words rejected by the analyzer are regarded as misspelled words. However, if morphological analyzer accepts even mistyped words, it treats them as correctly spelled words. In this paper, I propose a simple method capable of detecting and correcting errors that the previous methods can not detect. Proposed method is based on the characteristics that typographical errors are generally not repeated and so tend to have very low frequency. If words generated by operations of deletion, exchange, and transposition for each phoneme of a low frequency word are in the list of high frequency words, some of them are considered as correctly spelled words. Some heuristic rules are also presented to reduce the number of candidates. Proposed method is able to detect not syntactic errors but some semantic errors, and useful to scoring candidates.

Citation status

* References for papers published after 2022 are currently being built.