It-Jim
Computer vision solutions for your business
It-Jim is a group of experts with PhD degrees in various mathematical disciplines. We conduct the high-quality research in the fields of computer vision, pattern recognition, machine learning, artificial intelligence, augmented reality, signal and image processing.
Contact Information
Kharkiv , Eastern Ukraine 61057 Ukraine
DOCUMENT ANALYSIS AND RECOGNITION Visit Website
Social Finance Information Technology
Timeline: 0 weeks Amount: Not Disclosed
Automatic document analysis is a key part of the overall document recognition process. A common scenario is when the user takes a picture by mobile phone or tablet and the goal is to automatically parse and recognize content from the captured document. Such as pictures, tables, text data, links, etc. There are several challenges in this case: geometric distortions of the paper, varying illumination, occlusions. Nevertheless, we have built the page unwrapping engine, which successfully handles the above problems. The developed algorithm contains several key components: Preprocessor, Feature Extractor, Geometric Model Estimator and Refiner. The preprocessor performs image filtering, roughly locates the document boundaries and extracts vertical and horizontal spans (lines and text regions). Feature Extractor module performs parsing of document content. For robustness, we have fused several types of features including corners and edges from images preprocessing in different ways. The output nonlinear grid of points is used as an input for the next system component: Geometric Model Estimator. In order to properly account for the nonlinear geometric distortions of the document, we have constructed a specific 2D-3D model, where page shape is reconstructed as a 3D surface together with the 6 DoF camera position in 3D space.Finally, after the estimation of model parameters, the document is dewarped and most of the existing distortions are corrected. Developed dewarping engine works very accurately and performs distortion correction within a reasonable time frame. KEY POINTS: Handling illumination changes Correction of nonlinear geometric distortions Automatic page border detection 2-step procedure returns well-refined document layout
