ANALYZING DIFFERENT ALGORITHMS AND TECHNIQUES TO FIND OPTICAL CHARACTER RECOGNITION FOR TAMIL SCRIPTS

Authors:

Rajkumar N,A. B. Rajendra,Janhavi V,

DOI NO:

https://doi.org/10.26782/jmcms.2020.02.00029

Keywords:

Tamil,OCR,Manuscript,Script,Optical Character Recognition,Tamil Language,Tamil Script,

Abstract

Tamil is one of the world's ancient languages. This paper focuses mainly in particular on OCR for the digitalization and conservation of texts and inscriptions in the Tamil language. A system that does not include obtaining either Standard size and shape or the color difference between background and foreground to recognize Palm Leaf Manuscript and stone inscriptions and obtaining information. A variety of algorithms have been analyzed for OCR texts for Tamils, and ancient letter conversion still has a big challenge to convert ancient Tamils into today's digital text format for Tamils.

Refference:

I. A. V. S. Rao, N. V. Rao, L. P. Reddy, G. Sunil,T.S.K.Prabhu, and A. S. C. S. Sastry, “Adaptive binarization of ancient documents,” in Proceedings of the 2nd International Conference on Machine Vision (ICMV ’09), pp. 22–26, December 2009.
II. B. Gangamma, K. Srikanta Murthy, and A. V. Singh, “Restoration of degraded historical document image,” Journal of Emerging Trends in Computing and Information Sciences, vol. 3, no. 5,pp. 36–39, 2012.
III. C. L. Tan, R. Cao, and P. Shen, “Restoration of archival documents using a wavelet technique,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 24, no. 10, pp. 1399–1404, 2002.
IV. E. Nadernejad, S. Sharifzadeh, and H. Hassanpour, “Edge detection techniques:evaluations and comparisons,” Applied Mathematical Sciences, vol. 2, no. 31, pp. 1507–1520, 2008.
V. G. R. M. Babu, P. Srimayee, and A. Srikrishna, “Heterogenous images using mathematical morphology,” Journal ofTheoretical and Applied Information Technology, vol. 15, no. 5, pp. 795–825, 2008.
VI. G. Raju and K. Revathy, “Wavepackets in the Recognition of Isolated Handwritten Characters”, Proceedings of the World Congress on Engineering 2007 Vol I WCE 2007, July 2 – 4, 2007, London, U.K.
VII. G. Rama mohanbabu,, P. Srimaiyee, A. Srikrishna , “Text extraction from hetrogenous images using mathematical morphology”, Journal of Theoretical and Applied Information Technology, vol. 16, no. 1, pp. 39 – 47 2010.
VIII. G.Y.Chen, T.D.Bui, A.Krzyzak. “Contour based handwritten numeral recognition using multiwavelets and neural networks”,Pattern Recognition, vol. 36, no. 7, pp. 1597 – 1604, 2003.
IX. H.K.Chethan, G.Hemantha Kumar, “ A Comparative Analysis of Different Edge based Algorithms for Mobile/Camera Captured Images”, International Journal of Computer Applications, vol. 7, no. 3, pp. 36-41, 2010.
X. H.K.Chethana, G.HemanthaKumar , “Image Dewarping and Text Extraction from Mobile Captured Distinct Documents”, Procedia Computer Science, vol. 2, pp. 330 – 337, 2010.
XI. Jian Yuan, Yi Zhang, KokKiong Tan, Tong Heng Lee, “Text Extraction from Images Captured via Mobile and Digital Devices”, 2009 IEEE/ASME International Conference on Advanced Intelligent Mechatronics, pp. 566 – 571,2009.
XII. K. C. Kim, H. R. Byun, Y. J. Song et al., “Scene text extraction in natural scene images using hierarchical feature combining and verification,” in Proceedings of the 17th International Conference on Pattern Recognition (ICPR ’04), vol. 2, pp. 679–682, gbr, August 2004.
XIII. L. Agnihotri and N. Dimitrova, “Text detection for video analysis,” in Proceedings of IEEE International Workshop onContent-Based Access of Image and Video Libraries, pp. 109–113, June 1999.
XIV. M. Seeger and C. Dance, Binarising Camera Images for OCR, Xerox Research Centre, Meylan, France, 2000.
XV. P Wunsch, A F Laine. “Wavelet descriptors for multiresolution recognition of handprinted characters”. Pattern Recognition, vol. 28, no. 8, pp. 1237–1249, 1995.
XVI. P. Chevalier, L. Albera, P. Comon, and Ferreol, “Comparative performance analysis of eight blind source separation methods on radio communications signals,” in Proceedings of the International Joint Conference on Neural Networks, vol. 8, pp. 251–276, July 2004.
XVII. P. Tichavsk´y, Z. Koldovsk´y, and E. Oja, “Performance analysis of the FastICA algorithm and Cram´er-Rao bounds for linear independent component analysis,” IEEE Transactions on Signal Processing, vol. 54, no. 4, pp. 1189–1203, 2006.
XVIII. S P Chowdhury S DharA K Das B Chanda K McMenemy, “Robust Extraction of Text from Camera Images “, 2009 10th International Conference on Document Analysis and Recognition, pp. 1280 – 1284, 2009.
XIX. S W Lee, C H Kim, H Ma, Y Y Tang. “Multiresolution recognition of unconstrained handwritten numerals with wavelet transform and multilayer cluster neural network”. Pattern Recognition, vol. 29, pp. 1953–1956, 1996.
XX. S. Buzykanov, “Enhancement of poor resolution text images in the weighted sobolev space,” in Proceedings of the 19th International Conference on Systems, Signals and Image Processing (IWSSIP ’12), pp. 536–539, Vienna, Austria, April 2012.
XXI. S. Cherala and P. Rege, “Palm leaf manuscript/color document image enhancement by using improved adaptive binarization method,” in Proceedings of the 6th Indian Conference on Computer Vision, Graphics and Image Processing (ICVGIP ’08), pp. 687–692, December 2008.
XXII. S. Choi, “Independent component analysis,” in Proceedings of the 12th WSEAS International Conference on Communications, pp. 159–162, July 2008.
XXIII. S. Choi, A. Cichocki, and S. I. Amari, “Flexible independent component analysis,” Journal of VLSI Signal Processing Systems for Signal, Image, and Video Technology, vol. 26, no. 1, pp. 25–38, 2000.
XXIV. Sachin Grover, KushalArora,Suman K. Mitra , “Text Extraction from Document Images using Edge Information”, 2009 Annual IEEE India Conference, 2009.
XXV. Seethalakshmi R, Sreeranjani TR., Balachandar T, “Optical Character Recognition for printed Tamil text using Unicode”, Journal of Zhejiang University Science, vol. 6A, no. 11, pp. 1297-1305, 2005
XXVI. U. Garain, A. Jain, A. Maity, and B. Chanda, “Machine reading of camera-held low quality text images: an ICA-based image enhancement approach for improving OCR accuracy,” in Proceedings of the 19th International Conference on Pattern Recognition (ICPR ’08), pp. 1–4, December 2008.
XXVII. Vinoth R, Rajesh R, Yoganandhan P, “Intelligence System for Tamil Vattezhuttu Optical Character Recognition”, International Journal of Computer Science & Engineering Technology, vol. 8, no. 04, pp. 22 -27 Apr 2017
XXVIII. X. S. Hua, P. Yin, and H. J. Zhang, “Efficient video text recognition using multiple frame integration,” in Proceedings of the International Conference on Image Processing, vol. 2, pp. 22–25, September 2004.

View Download