본문 바로가기
  • Home

AN EFFICIENT METHOD FOR DOCUMENT IMAGE GEOMETRIC LAYOUT ANALYSIS

  • Journal of Software Assessment and Valuation
  • Abbr : JSAV
  • 2013, 9(1), pp.49-55
  • Publisher : Korea Software Assessment and Valuation Society
  • Research Area : Engineering > Computer Science
  • Received : June 7, 2013
  • Accepted : June 29, 2013
  • Published : June 30, 2013

CHUNG YUN KOO 1

1한국전자통신연구원

ABSTRACT

Document image analysis is necessary for optical character recognition (OCR) and also very useful for many other document image manipulations. In this paper, we propose a document image geometric layout analysis system which has less region segmentation and classification error than that of the commercial software and previous works. The proposed method segments the document image into small regions to the size of a character using fast connected components generation method, so that it prevents the different types of connected components from combining. We also propose new criterion for clustering the connected components and some new techniques to deal with noise and reduce computation time. Experiment shows classification error rate of text and picture regions is decreased.

Citation status

* References for papers published after 2023 are currently being built.