CASE OVERVIEW.
Our client is a financial corporation that processes thousands of applications daily through their screening system – requiring the data to be manually digitized from physical documents. The digitization process was marred with inaccuracies and errors resulting in a significant increase in the time and cost of the whole operation.
With advanced optical character recognition techniques available to help, the task was automated to a large extent. However, existing OCR technology still had limitations, such as errors in recognition or image quality requirements. Current off-the-shelf solutions were not acceptable for the job due to the required efficiency and precision of the data. The client needed something better. Adding AI to the Character and Word recognition provided the required extra precision of the data while retaining efficiency.
Machine Learning | OCR
CHALLENGES.
The challenges included creating an Intelligent Optical Character Recognition solution that could digitize physical documents, no matter the source of the documents (computer printed, clean or uneven fonts, hand-written). Requirements:
- All data on the document had to be recognized and digitized. Any text on the document that could not be digitized must be resolved due to the sensitive nature of the data. No text could be skipped.
- System must be capable of recognizing handwritten documents, including cursive text, within acceptable error limits.
- Resolution issues should first be handled automatically by the AI. If not resolvable, human interaction would indicate the correct data, and the AI would learn from that instance.
APPROACH.
They hired few AI specialists from AI development company in California and collaborated with Pegasus One’s Team. These teams utilized the non-legacy Glyph-based document processing engine for OCR. This allowed our team to implement:
- An Information extraction engine, capable of reading the physical document with higher accuracy.
- Low-resolution document scanning support (with AI learning models).
- For handwriting recognition, our team devised a Machine Learning AI model that learns every time a document is scanned. It’s capable of reading and self-learning cursive, and other non-uniform styles of writing.
RESULTS.
HIGH PRECISION
Broadcasts only the required data. No wastage of precious bandwidth.
AI
Enhanced and accurate reporting of health stats
SELF-LEARNING
Intuitive UI for better user interaction and information consumption
COST REDUCTION
Intuitive UI for better user interaction and information consumption