Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Progressive Multitask Learning Network for Online Chinese Signature Segmentation and Recognition

  • Conference paper
  • First Online:
Frontiers in Handwriting Recognition (ICFHR 2022)

Abstract

Recently, the booming of electronic devices has revolutionized the way we sign in our daily life. The sudden surge of online Chinese signatures calls for need online Chinese signature segmentation as prerequisite for downstream tasks such as building up database of Chinese characters and verifying signatures based upon individual characters. However, common approaches deriving from over-segmentation do not apply well to Chinese signatures, which have little linguistic meanings but instead have flourish, artistic styles, composite structures or even word overlaps. To cope with those difficulties, this paper exploits the benefits of signature recognition to boost segmentation performance and proposes a progressive multitask learning network (PMLNet) for online Chinese signature segmentation and recognition. PMLNet consists of a dual channel stroke feature extraction block (DSF-Block), a stacked transformer encoder block (STE-Block) and a progressive multitask learning block (PML-Block). DSF-block is used to extract stroke-wise spatial features and semantic features through dual channels; STE-block is used to model long-range dependencies among different strokes and enhance their feature representations; PML-block is used to branch interactively and progressively the signature’s final segmentation and recognition. Specifically, we introduce a progressive learning strategy in PML-block to fine-tune segmentation and recognition results by fully leveraging their reciprocal relationship. The experiment results on our private database show that PMLNet achieves 8.77% higher accurate rate (AR), 9.15% higher correct rate (CR) and 9.93% higher sample-level segmentation accurate rate (\(\mathrm {ACC_{seg}}\)) than SOTAs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Alemi, A.A., Fischer, I., Dillon, J.V., Murphy, K.: Deep variational information bottleneck. arXiv preprint arXiv:1612.00410 (2016)

  2. Bai, Z.L., Huo, Q.: A study on the use of 8-directional features for online handwritten Chinese character recognition. In: Eighth International Conference on Document Analysis and Recognition (ICDAR 2005), pp. 262–266. IEEE (2005)

    Google Scholar 

  3. Chen, K., et al.: Hybrid task cascade for instance segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4974–4983 (2019)

    Google Scholar 

  4. Chen, K., et al.: A compact CNN-DBLSTM based character model for online handwritten Chinese text recognition. In: 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), vol. 1, pp. 1068–1073. IEEE (2017)

    Google Scholar 

  5. Cheng, J., Dong, L., Lapata, M.: Long short-term memory-networks for machine reading. arXiv preprint arXiv:1601.06733 (2016)

  6. Collobert, R., Weston, J.: A unified architecture for natural language processing: deep neural networks with multitask learning. In: Proceedings of the 25th International Conference on Machine Learning, pp. 160–167 (2008)

    Google Scholar 

  7. Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)

  8. Ding, H., Chen, K., Hu, W., Cai, M., Huo, Q.: Building compact CNN-DBLSTM based character models for handwriting recognition and OCR by teacher-student learning. In: 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 139–144. IEEE (2018)

    Google Scholar 

  9. Graves, A., Fernández, S., Gomez, F., Schmidhuber, J.: Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks. In: Proceedings of the 23rd International Conference on Machine Learning, pp. 369–376 (2006)

    Google Scholar 

  10. Han, K., et al.: A survey on vision transformer. IEEE Trans. Pattern Anal. Mach. Intell. (2022)

    Google Scholar 

  11. He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)

    Google Scholar 

  12. Huang, J.T., Li, J., Yu, D., Deng, L., Gong, Y.: Cross-language knowledge transfer using multilingual deep neural network with shared hidden layers. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 7304–7308. IEEE (2013)

    Google Scholar 

  13. Ke, X., Zhang, X., Zhang, T.: GCBANET: a global context boundary-aware network for SAR ship instance segmentation. Remote Sens. 14(9), 2165 (2022)

    Article  Google Scholar 

  14. Keysers, D., Deselaers, T., Rowley, H.A., Wang, L.L., Carbune, V.: Multi-language online handwriting recognition. IEEE Trans. Pattern Anal. Mach. Intell. 39(6), 1180–1194 (2016)

    Article  Google Scholar 

  15. Lample, G., Ballesteros, M., Subramanian, S., Kawakami, K., Dyer, C.: Neural architectures for named entity recognition. In: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (2016)

    Google Scholar 

  16. Li, N., Jin, L.: A Bayesian-based probabilistic model for unconstrained handwritten offline Chinese text line recognition. In: 2010 IEEE International Conference on Systems, Man and Cybernetics, pp. 3664–3668. IEEE (2010)

    Google Scholar 

  17. Liao, M., Wan, Z., Yao, C., Chen, K., Bai, X.: Real-time scene text detection with differentiable binarization. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, pp. 11474–11481 (2020)

    Google Scholar 

  18. Liebel, L., Körner, M.: Auxiliary tasks in multi-task learning. CoRR abs/1805.06334 (2018). http://arxiv.org/abs/1805.06334

  19. Liu, M., Xie, Z., Huang, Y., Jin, L., Zhou, W.: Distilling GRU with data augmentation for unconstrained handwritten text recognition. In: 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 56–61. IEEE (2018)

    Google Scholar 

  20. Lloyd, S.: Least squares quantization in PCM. IEEE Trans. Inf. Theory 28(2), 129–137 (1982)

    Article  MathSciNet  MATH  Google Scholar 

  21. Loshchilov, I., Hutter, F.: Decoupled weight decay regularization. arXiv preprint arXiv:1711.05101 (2017)

  22. Peng, D., et al.: Recognition of handwritten Chinese text by segmentation: a segment-annotation-free approach. IEEE Trans. Multimed. (2022)

    Google Scholar 

  23. Peng, D., et al.: Towards fast, accurate and compact online handwritten Chinese text recognition. In: Lladós, Josep, Lopresti, Daniel, Uchida, Seiichi (eds.) ICDAR 2021. LNCS, vol. 12823, pp. 157–171. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-86334-0_11

    Chapter  Google Scholar 

  24. Su, T.H., Zhang, T.W., Guan, D.J., Huang, H.J.: Off-line recognition of realistic Chinese handwriting using segmentation-free strategy. Pattern Recogn. 42(1), 167–182 (2009)

    Article  MATH  Google Scholar 

  25. Sun, K., Zhang, R., Mensah, S., Mao, Y., Liu, X.: Progressive multi-task learning with controlled information flow for joint entity and relation extraction. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, pp. 13851–13859 (2021)

    Google Scholar 

  26. Sun, L., Su, T., Liu, C., Wang, R.: Deep LSTM networks for online Chinese handwriting recognition. In: 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 271–276. IEEE (2016)

    Google Scholar 

  27. Vaswani, A., et al.: Attention is all you need. Adv. Neural Inf. Process. Syst. 30 (2017)

    Google Scholar 

  28. Wang, Z.X., Wang, Q.F., Yin, F., Liu, C.L.: Weakly supervised learning for over-segmentation based handwritten Chinese text recognition. In: 2020 17th International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 157–162. IEEE (2020)

    Google Scholar 

  29. Wang, Z.R., Du, J., Wang, J.M.: Writer-aware CNN for parsimonious HMM-based offline handwritten Chinese text recognition. Pattern Recogn. 100, 107102 (2020)

    Article  Google Scholar 

  30. Wu, Y.C., Yin, F., Liu, C.L.: Improving handwritten Chinese text recognition using neural network language models and convolutional neural network shape models. Pattern Recogn. 65, 251–264 (2017)

    Article  Google Scholar 

  31. Xie, Z., Sun, Z., Jin, L., Ni, H., Lyons, T.: Learning spatial-semantic context with fully convolutional recurrent network for online handwritten Chinese text recognition. IEEE Trans. Pattern Anal. Mach. Intell. 40(8), 1903–1917 (2017)

    Article  Google Scholar 

  32. Yin, F., Wang, Q.F., Zhang, X.Y., Liu, C.L.: ICDAR 2013 Chinese handwriting recognition competition. In: 2013 12th International Conference on Document Analysis and Recognition, pp. 1464–1470. IEEE (2013)

    Google Scholar 

  33. Zhang, X.Y., Yin, F., Zhang, Y.M., Liu, C.L., Bengio, Y.: Drawing and recognizing Chinese characters with recurrent neural network. IEEE Trans. Pattern Anal. Mach. Intell. 40(4), 849–862 (2017)

    Article  Google Scholar 

  34. Zhou, X.D., Wang, D.H., Tian, F., Liu, C.L., Nakagawa, M.: Handwritten Chinese/Japanese text recognition using semi-Markov conditional random fields. IEEE Trans. Pattern Anal. Mach. Intell. 35(10), 2413–2426 (2013)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhonghao Shen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Qin, X., Zhang, H., Ke, X., Shen, Z., Qi, S., Liu, K. (2022). Progressive Multitask Learning Network for Online Chinese Signature Segmentation and Recognition. In: Porwal, U., Fornés, A., Shafait, F. (eds) Frontiers in Handwriting Recognition. ICFHR 2022. Lecture Notes in Computer Science, vol 13639. Springer, Cham. https://doi.org/10.1007/978-3-031-21648-0_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-21648-0_11

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-21647-3

  • Online ISBN: 978-3-031-21648-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics