The main task of fuzzy judgment is to determinate if the imported image is clear or blurry.
This is a 2 steps process:
The main task of border cutting is to find the position of image foreground (by detecting the borders using Line Segment Detector) and remove the background part.
The main task of the perspective transformation is to correct the distortion or tilt of the foreground part.
This is a 2 steps process:
The main task of type detection is matching the imported image with a template image, then return the most similar template type.
This is a 2 steps process:
Template matching is mainly to extract the features of the template and match the feature descriptors that correspond to the features.
This is a 4 steps process:
The main task of binary conversion is to convert the blocks of images to black-white.
This part of the process is critical. If the conversion is not correct, the impact in character recognition and matching is huge.
The main process is to find the information of each text region, then to convert each text region to black and white.
The main task of layout analysis is the segmentation of blocks from the image. These blocks are then categorized into text blocks, line blocks or graphic blocks.
This is a 3 steps process: