Similarity-based Text Recognition by Deeply Supervised Siamese Network

Hosseini-Asl, Ehsan; Guha, Angshuman

Computer Science > Computer Vision and Pattern Recognition

arXiv:1511.04397 (cs)

[Submitted on 13 Nov 2015 (v1), last revised 5 Jul 2016 (this version, v5)]

Title:Similarity-based Text Recognition by Deeply Supervised Siamese Network

Authors:Ehsan Hosseini-Asl, Angshuman Guha

View PDF

Abstract:In this paper, we propose a new text recognition model based on measuring the visual similarity of text and predicting the content of unlabeled texts. First a Siamese convolutional network is trained with deep supervision on a labeled training dataset. This network projects texts into a similarity manifold. The Deeply Supervised Siamese network learns visual similarity of texts. Then a K-nearest neighbor classifier is used to predict unlabeled text based on similarity distance to labeled texts. The performance of the model is evaluated on three datasets of machine-print and hand-written text combined. We demonstrate that the model reduces the cost of human estimation by $50\%-85\%$. The error of the system is less than $0.5\%$. The proposed model outperform conventional Siamese network by finding visually-similar barely-readable and readable text, e.g. machine-printed, handwritten, due to deep supervision. The results also demonstrate that the predicted labels are sometimes better than human labels e.g. spelling correction.

Comments:	Accepted for presenting at Future Technologies Conference - (FTC 2016) San Francisco, December 6-7, 2016
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:1511.04397 [cs.CV]
	(or arXiv:1511.04397v5 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1511.04397

Submission history

From: Ehsan Hosseini-Asl [view email]
[v1] Fri, 13 Nov 2015 18:46:01 UTC (2,444 KB)
[v2] Wed, 18 Nov 2015 20:59:10 UTC (2,508 KB)
[v3] Fri, 8 Jan 2016 00:37:29 UTC (1,899 KB)
[v4] Sun, 3 Jul 2016 16:38:35 UTC (1,896 KB)
[v5] Tue, 5 Jul 2016 01:21:08 UTC (1,897 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Similarity-based Text Recognition by Deeply Supervised Siamese Network

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Similarity-based Text Recognition by Deeply Supervised Siamese Network

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators