An Enhanced OCR-Based Model for Accurate Text Extraction from Images

Author(s)	Mayank Deshmukh, Saloni Rabde, Priyanka Makode, Sourabh Jasuja, Prof. Bhavesh Khasdev
Country	India
Abstract	The growing need for digitization and intelligent document processing has led to significant advancements in text detection and extraction technologies. This paper reviews methodologies and tools employed for extracting textual information from images and Portable Document Format (PDF) files. Both traditional Optical Character Recognition (OCR) techniques and modern deep learning-based approaches are discussed. Five major research contributions in this area are analyzed in detail. The paper further explores challenges in handling complex document layouts, multilingual text, and low-quality images, and highlights research gaps and future directions that emphasize the potential of artificial intelligence and multimodal learning to enhance text extraction accuracy and efficiency.
Keywords	OCR, Text Extraction, Deep Learning, Layout LM, Scene Text Detection, Document Analysis.
Field	Engineering
Published In	Volume 17, Issue 4, April 2026
Published On	2026-04-06

About IJTAS Fees & Payment Current Issue Publication Archive	Submit Research Paper Track Submission Status Publication Guidelines Publication Ethics Peer Review & Plagiarism	Join as a Reviewer Editors & Reviewers Reviewer Referral Program Get Reviewer Membership Certi.	Website/Journal Policies Usage Policy Content Policies Privacy Policy

Contact Us	Message on WhatsApp	+91-9687-182-185	editor@ijtas.com

International Journal of Technology and Applied Science