Data extraction

Once timesheet scanning is complete, the images are submitted for image processing, intelligent classification, document recognition, and data extraction.

Image processing and quality control

Image quality is a critical factor in determining how well characters and documents are identified and processed.

Our timesheet processing system includes advanced image enhancement and correction capabilities, including de-skewing, auto-rotating and character enhancement.

Built-in quality control features identify and correct low-quality images early in the process to reduce human intervention.

Automated document recognition and classification

Each image received will be compared to one or more pre-defined templates to identify the form type. The timesheet software can recognise and distinguish between many different form types and process them accordingly. Unrecognised pages (such as cover pages or damaged pages) will be routed to an unclassified queue for manual handling.

Extract data from scanned timesheets using OCR, ICR, OMR and barcode recognition

After document processing and classification, the timesheet software uses handprint (ICR), machine print (OCR) and checkbox (OMR) document recognition technology to automatically extract data from scanned timesheets. This will include constrained print fields, tick boxes, variable scales, comments, tick boxes and signatures.

short-handwritten-text-timesheets

Intelligent character recognition (ICR)

To read short hand-print written responses from constrained print fields (one character per box) such as names, dates and numbers.

choice-fields-timesheets

Optical mark recognition (OMR)

OMR technology identifies if a checkbox has been filled, with automatic handling of crossed-out and amended responses.

timesheet-ocr

Optical character recognition (OCR)

To capture machine-printed text.

Image zone capture

Predictive key from image, coding and image snippet capture can be used to capture drawings or cursive handwriting.

barcode-recognition-timesheets

Barcode recognition

To recognise all standard barcode types, including 2D matrix barcodes.

Signature detection

To confirm if a paper form has been authorised with a signature by calculating the fill percentage of the field.

Simple rules such as alpha, numeric, dictionaries, date ranges, look-ups and mandatory fields will be checked at this stage with any unrecognised fields/characters queued for human review.

These common-sense logic rules are applied to the extracted timesheet data to ensure that invalid responses are not exported (e.g. impossible to tick more than one response for a single-choice question). In such situations, the exception will be intelligently routed to the right human operators to review and correct. The entire process takes seconds meaning thousands of timesheets can be processed each day.

About ePC

Trusted by Rentokil Initial, Next plc, and the British Council, ePC support organisations to replace manual, paper-driven tasks with data capture, workflow, and document scanning solutions that reduce manual data entry and automate critical tasks. Visit our website.

Company registration number: 05192543. VAT number: GB842064740.
Address: PO Box 1578, Lightwater, GU20 5AR, United Kingdom
our-accreditations

© ePartner Consulting Ltd 2004-2024