Semi-supervised Feature Learning For Improving Writer Identification

Shiming Chen; Yisong Wang; Chin-Teng Lin; Weiping Ding; Zehong Cao

by Shiming Chen, Yisong Wang, Chin-Teng Lin, Weiping Ding, Zehong Cao

Released as a article .

2018

Abstract

Data augmentation is usually used by supervised learning approaches for offline writer identification, but such approaches require extra training data and potentially lead to overfitting errors. In this study, a semi-supervised feature learning pipeline was proposed to improve the performance of writer identification by training with extra unlabeled data and the original labeled data simultaneously. Specifically, we proposed a weighted label smoothing regularization (WLSR) method for data augmentation, which assigned the weighted uniform label distribution to the extra unlabeled data. The WLSR method could regularize the convolutional neural network (CNN) baseline to allow more discriminative features to be learned to represent the properties of different writing styles. The experimental results on well-known benchmark datasets (ICDAR2013 and CVL) showed that our proposed semi-supervised feature learning approach could significantly improve the baseline measurement and perform competitively with existing writer identification approaches. Our findings provide new insights into offline write identification.
In text/plain format

Archived Files and Locations

application/pdf 4.0 MB
file_64si3xnb2fhibcifegnt2ypr6u arxiv.org (repository)
web.archive.org (webarchive)

Read Archived PDF

Preserved and Accessible

Type article
Stage

submitted

Date 2018-10-06
Version v3
Language en ^?

arXiv 1807.05490v3

Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)

Cite This

BibTeX
CSL-JSON
MLA
Harvard

Lookup Links

Worldcat
wikidata.org
CORE.ac.uk
Semantic Scholar
Google Scholar

Catalog Record
Revision: f2b66cd2-bbdd-4017-8fdc-5c01eee2978e
API URL: JSON

Edit Metadata View History

Semi-supervised Feature Learning For Improving Writer Identification release_dfxtdkumiffetbtpoye4jp35te

Abstract

Archived Files and Locations

Semi-supervised Feature Learning For Improving Writer Identification `release_dfxtdkumiffetbtpoye4jp35te`