In this paper, we propose a method of extracting and recognizing related information for research from images of the unstructured pulmonary function test papers using character detection and recognition techniques.
Also, we develop a post-processing method to reduce the character recognition error rate. The proposed structuring method uses a character detection model for the pulmonary function test paper images to detect all characters in the test paper and passes the detected character image through the character recognition model to obtain a string. The obtained string is reviewed for validity using string matching and structuring is completed.
We confirm that our proposed structuring system is a more efficient and stable method than the structuring method through manual work of professionals because our system’s error rate is within about 1% and the processing speed per pulmonary function test paper is within 2 seconds.