Home   >   CSC-OpenAccess Library   >    Manuscript Information
A Novel Method for De-warping in Persian document images captured by cameras
hadi dehbovid, farbod razzazi, shapor alirezaee
Pages - 390 - 400     |    Revised - 30-08-2010     |    Published - 30-10-2010
Volume - 4   Issue - 4    |    Publication Date - October 2010  Table of Contents
MORE INFORMATION
KEYWORDS
Geometric Distortion, OCR, camera based OCR, Image Archives
ABSTRACT
In this Paper, We proposed a novel algorithm for de-warping of Persian document images captured by the cameras. The aim of de-warping is to remove page distortions and to straighten document images captured by the cameras, so that the documents are readable to the OCR system. Recently, the industrial implementation of the images captured by digital cameras has significantly expanded. Most of the studies carries out so far in this regard have focused on the documents written in Latin and few researches have been conducted regarding Persian documents. The original idea of the proposed algorithm is based on the segmentation of the components of texts. In this algorithm, an effective technique is offered for detection of the upper and lower baselines, which is used in estimation of the slope of the words. Moreover, vertical shift of the warped words is done through fitting a quadratic curve fitted to the centers of the words in a line in relation to the horizontal line. The suggested algorithm is examined by qualitative and quantitative measures and the results of its implementation on various documents indicate a 92% accuracy of the proposed technique in correction of the location and angle of the words.
CITED BY (3)  
1 Shayegan, M. A. (2015). Dataset size and dimensionality reduction approaches for handwritten farsi digits and characters recognition (Doctoral dissertation, University of Malaya).
2 Camera, S. C. B. Electric Institute funded master's degree thesis master's program.
3 Guo Wende. (2011). Identification and automatic music playing system to retrieve the camera.
1 Google Scholar 
2 CiteSeerX 
3 iSEEK 
4 Socol@r  
5 Scribd 
6 SlideShare 
7 PDFCAST 
8 PdfSR 
A. Masalovitch and L. Mestetskiy. "Usage of continuous skeletal image representation for document images de-warping". In 2nd Int. Workshop on Camera- Based Document Analysis and Recognition, Curitiba, Brazil, 2007
A. Ulges, C. Lampert, and T. M. Breuel. "Document capture using stereo vision". In Proceedings of the ACM Symposium on Document Engineering, Milwaukee, Wisconsin, USA, 2004
A. Ulges, C.H. Lampert and T.M. Breuel. "Document image dewarping using robust estimation of curled text lines". In Proc. Eighth Int. Conf. on Document Analysis and Recognition, Washington, DC, USA, 2005
A. Yamashita, A. Kawarago, T. Kaneko and K.T.Miura. "Shape reconstruction and image restoration for non-flat surfaces of documents with a stereo vision system". In Proceedings of 17th International Conference on Pattern Recognition (ICPR) Cambridge UK, 2004
B. Fu, M.Wu, R. Li,W. Li, and Z. Xu. "A model-based book de-warping method using text line detection". In 2nd Int. Workshop on Camera-Based Document Analysis and Recognition, Curitiba, Brazil, 2007
B.Gatos, I. Pratikakis, and K. Ntirogiannis. "Segmentation based recovery of arbitrarily warped document images". In Proc. Int. Conf. on Document Analysis and Recognition, Curitiba, Brazil, 2007
F. Shafait and T. M. Breuel. "Document Image Dewarping Contest". In proc CBDR, 2007
J. Liang, D. Doermann, H. Li. "Camera-based analysis of text and documents: a survey". Int. Jour. Of Document Analysis and Recognition, 7(2-3): 84104, 2005
J. Liang, D.F. DeMenthon, and D. Doermann. "Flattening curved documents in images". In Proc. Computer Vision and Pattern Recognition,San Diego, 2005
L. Zhang and C.L. Tan. "Warped image restoration with applications to digital libraries". In Proc. Eighth Int. Conf. on Document Analysis and Recognition, Washington, DC, USA, 2005
M. Pilu. "Deskewing perspectively distorted documents: An approach based on perceptual organization". In HP Technical Reports, 2001
M.S. Brown and W.B. Seales. "Document restoration using 3d shape: A general deskewing algorithm for arbitrarily warped documents". In International Conference on Computer Vision (ICCV), Vancouver, B.C., Canada, 2001
U.V. Marti, H. Bunke. "Using a statistical language model to improve the performance of an HMMbased cursive handwriting recognition system". Int. Jour. of Pattern Recognition and Artifical Intelligence, 15(1): 6590, 2001
Mr. hadi dehbovid
Islamic Azad University, Nour Branch - Iran
hadi.dehbovid@gmail.com
Dr. farbod razzazi
paya soft - Iran
Dr. shapor alirezaee
- Iran