Skip to the content

Historical typewritten document recognition using minimal user interaction

Retsinas, G, Gatos, B, Antonacopoulos, Apostolos, Louloudis, G and Stamatopoulos, N 2015, 'Historical typewritten document recognition using minimal user interaction' , in: Proceedings of the 3rd Workshop on Historical Document Imaging and Processing (HIP2015) , ACM, pp. 31-38.

[img] PDF - Published Version
Restricted to Repository staff only

Download (5MB) | Request a copy

Abstract

Recognition of low-quality historical typewritten documents can still be considered as a challenging and difficult task due to several issues i.e. the existence of faint and degraded characters, stains, tears, punch holes etc. In this paper, we exploit the unique characteristics of historical typewritten documents in order to propose an efficient recognition methodology that requires minimum user interaction. It is based on a pre-processing stage in order to enhance the quality and extract connected components, on a semi-supervised clustering for detecting the most representative character samples and on a segmentation-free recognition stage based on a template matching and cross-correlation technique. Experimental results prove that even with minimum user interaction, the proposed method can lead to promising accuracy results.

Item Type: Book Section
Schools: Schools > School of Computing, Science and Engineering
Journal or Publication Title: Proceedings of the 3rd Workshop on Historical Document Imaging and Processing (HIP2015)
Publisher: ACM
ISBN: 9781450336024
Funders: European Commission
Depositing User: Professor Apostolos Antonacopoulos
Date Deposited: 22 Mar 2016 16:11
Last Modified: 29 Apr 2016 09:43
URI: http://usir.salford.ac.uk/id/eprint/38464

Actions (login required)

Edit record (repository staff only) Edit record (repository staff only)

Downloads

Downloads per month over past year