Research by Jim Samuel et al. “Customized AI Readers: An Adaptive Framework for Flexible Human Handwriting Recognition of Numerical Digits with OCR Methods”

June 16, 2023

Abstract

Advanced artificial intelligence (AI) techniques have led to significant developments in optical character recognition (OCR) technologies. OCR applications, using AI techniques for transforming images of typed text, handwritten text, or other forms of text into machine-encoded text, provide a fair degree of accuracy for general text. However, even after decades of intensive research, creating OCR with human-like abilities has remained evasive. One of the challenges has been that OCR models trained on general text do not perform well on localized or personalized handwritten text due to differences in the writing style of alphabets and digits. This study aims to discuss the steps needed to create an adaptive framework for OCR models, with the intent of exploring a reasonable method to customize an OCR solution for a unique dataset of English language numerical digits were developed for this study. We develop a digit recognizer by training our model on the MNIST dataset with a convolutional neural network and contrast it with multiple models trained on combinations of the MNIST and custom digits. Using our methods, we observed results comparable with the baseline and provided recommendations for improving OCR accuracy for localized or personalized handwritten text. This study also provides an alternative perspective to generating data using conventional methods, which can serve as a gold standard for custom data augmentation to help address the challenges of scarce data and data imbalance.

Keywords

OCR; adaptive; custom; digits; MNIST; informatics; machine learning; deep learning

Citation

Jain, P.H.; Kumar, V.; Samuel, J.; Singh, S.; Mannepalli, A.; Anderson, R. Customized AI Readers: An Adaptive Framework for Flexible Human Handwriting Recognition of Numerical Digits with OCR Methods. Information202314, 305. https://doi.org/10.3390/info14060305

Recent Posts

Zhang et al. Study Street-View Greenspace and Exercise

GPS-based street-view greenspace exposure and wearable assessed physical activity in a prospective cohort of US women Abstract Background Increasing evidence positively links greenspace and physical activity (PA). However, most studies use measures of greenspace, such...

NJSPL: Some College, No Credential Population in NJ

Overview of the Some College, No Credential Population and Educational Outcomes in New Jersey, 2023–2024 New Jersey State Policy Lab Supporting New Jersey residents in returning to college after leaving without a credential has been an increasing focus of the state’s...

Loh and Noland Explore Public Charging Station Disparities

Equal charging for all: Are there income-based disparities in public charging stations? Abstract We compare charging station accessibility for different income groups in the San Francisco Bay Area. Using a microsimulation model, we estimate charging station...

Heldrich Center Releases New Work Trends Brief and Website

The Heldrich Center for Workforce Development is pleased to announce the availability of two new research products resulting from its long-running public opinion polling series, Work Trends. To better understand the public’s attitudes about work, employers, and the...

NJSPL Report: Analyzing the Use and Equity of ARPA Funds

Report Release: Analyzing the Use and Equity of ARPA Funds in NJ Local Governments and Beyond New Jersey State Policy Lab The American Rescue Plan Act’s Coronavirus State and Local Fiscal Recovery Funds (ARPA-SLFRF) represent a historic $350 billion investment to...