Research by Jim Samuel et al. “Customized AI Readers: An Adaptive Framework for Flexible Human Handwriting Recognition of Numerical Digits with OCR Methods”

June 16, 2023

Abstract

Advanced artificial intelligence (AI) techniques have led to significant developments in optical character recognition (OCR) technologies. OCR applications, using AI techniques for transforming images of typed text, handwritten text, or other forms of text into machine-encoded text, provide a fair degree of accuracy for general text. However, even after decades of intensive research, creating OCR with human-like abilities has remained evasive. One of the challenges has been that OCR models trained on general text do not perform well on localized or personalized handwritten text due to differences in the writing style of alphabets and digits. This study aims to discuss the steps needed to create an adaptive framework for OCR models, with the intent of exploring a reasonable method to customize an OCR solution for a unique dataset of English language numerical digits were developed for this study. We develop a digit recognizer by training our model on the MNIST dataset with a convolutional neural network and contrast it with multiple models trained on combinations of the MNIST and custom digits. Using our methods, we observed results comparable with the baseline and provided recommendations for improving OCR accuracy for localized or personalized handwritten text. This study also provides an alternative perspective to generating data using conventional methods, which can serve as a gold standard for custom data augmentation to help address the challenges of scarce data and data imbalance.

Keywords

OCR; adaptive; custom; digits; MNIST; informatics; machine learning; deep learning

Citation

Jain, P.H.; Kumar, V.; Samuel, J.; Singh, S.; Mannepalli, A.; Anderson, R. Customized AI Readers: An Adaptive Framework for Flexible Human Handwriting Recognition of Numerical Digits with OCR Methods. Information202314, 305. https://doi.org/10.3390/info14060305

Recent Posts

Gov. Murphy Lectures in Roseman’s Class

Governor Phil Murphy joined former Chief Speechwriter Derek Roseman's Political Communications for Public Policy class at the Bloustein School on April 2nd. Photos via X.com and Julia Sass Rubin.

Molloy Discusses Criteria for Healthiest Cities

Location matters when it comes to health. Some places promote wellness by expanding access to nutritious food and recreational facilities. Others strive to keep healthcare costs affordable for everyone or keep parks clean and well-maintained. When a city doesn’t take...

McGlynn & Payne Explore the Relational Reprojection Platform

Counter-GIS Experiments in Distance Interpolation with the Relational Reprojection Platform Abstract In this paper, we discuss the cartographic genealogy and prospective uses of the Relational Reprojection Platform (RRP), an interactive tool that we built to create...

Clint Andrews–The Critical Role of University Research

The Critical Role of University Research: Funding, Challenges, and Impact This week on EJB Talks dean Stuart Shapiro and Associate Dean of Research Clint Andrews discuss the vital role federal-funded university research plays in complementing education, driving...

Payne Investigates City Digital Twins Concepts

Expanding the city digital twin in the context of crisis, cartography and computation Abstract This commentary responds to Gillian Rose's ‘Visualising human life in volumetric cities: city digital twins and other disasters’ as a framework for thinking about crisis and...