It-Jim

It-Jim

Computer vision solutions for your business

50$-99$
Ukraine
10-49
2015

It-Jim is a group of experts with PhD degrees in various mathematical disciplines. We conduct the high-quality research in the fields of computer vision, pattern recognition, machine learning, artificial intelligence, augmented reality, signal and image processing.

Contact Information

Ukraine
It-Jim
Kharkiv , Eastern Ukraine 61057 Ukraine

DOCUMENT ANALYSIS AND RECOGNITION Visit Website

Social Finance Information Technology

Timeline: 0 weeks   Amount: Not Disclosed

Automatic document analysis is a key part of the overall document recognition process. A common scenario is when the user takes a picture by mobile phone or tablet and the goal is to automatically parse and recognize content from the captured document. Such as pictures, tables, text data, links, etc. There are several challenges in this case: geometric distortions of the paper, varying illumination, occlusions. Nevertheless, we have built the page unwrapping engine, which successfully handles the above problems. The developed algorithm contains several key components: Preprocessor, Feature Extractor, Geometric Model Estimator and Refiner. The preprocessor performs image filtering, roughly locates the document boundaries and extracts vertical and horizontal spans (lines and text regions). Feature Extractor module performs parsing of document content. For robustness, we have fused several types of features including corners and edges from images preprocessing in different ways. The output nonlinear grid of points is used as an input for the next system component: Geometric Model Estimator. In order to properly account for the nonlinear geometric distortions of the document, we have constructed a specific 2D-3D model, where page shape is reconstructed as a 3D surface together with the 6 DoF camera position in 3D space.Finally, after the estimation of model parameters, the document is dewarped and most of the existing distortions are corrected. Developed dewarping engine works very accurately and performs distortion correction within a reasonable time frame. KEY POINTS: Handling illumination changes Correction of nonlinear geometric distortions Automatic page border detection 2-step procedure returns well-refined document layout

Portfolio