Towards building a Bangla text recognition solution with a Multi-Headed CNN architecture.

Handwritten character recognition (HCR) remains a challenging pattern recognition problem despite decades of research, and lacks research on script independent recognition techniques. ... HCR-Net is extensively evaluated on 40 publicly available datasets of Bangla, Punjabi, Hindi, English, Swedish, Urdu, Farsi, Tibetan, Kannada, Malayalam, Telugu, Marathi, Nepali and Arabic languages, and ... Moreover, it is observed that a multi-column multi-scale CNN architecture proposed by [8] performs exceptionally well for Bangla script. ...

arXiv:2108.06663v4 fatcat:a5dof25aojhxpazmfjprtx5wca

Multiple Versions

Further enriching the synthetic dataset with non-Unicode fonts and multiple augmentations helps us achieve a remarkable Word Recognition Rate gain of over 33% on the IIIT-ILST Hindi dataset. ... This work investigates the significant differences in Indian and Latin Scene Text Recognition (STR) systems. ... Data Availability Statement: Data available in a publicly accessible repository that does not issue DOIs. ...

doi:10.3390/jimaging8040086 pmid:35448213 pmcid:PMC9025185 fatcat:m7m6oppkevagblcx67yx3yku2u

DOAJ

This drove us towards finding a solution that would limit the gap instead of the communication and build a bridge instead of a boundary. ily communicate with them. ... Another paper that describes work on recognition and translation of sign languages, uses AI based solutions [6] . ...

doi:10.1109/csde50874.2020.9411523 fatcat:tfkrskin55hxpdzajiwtw2ooou

Nowadays, many researchers try to find solutions to many problems in various fields under the light of DL methods. ... , voice and video recognition, medical image processing, and big data. ... [170] provided a head-to-head comparison between the latest technology in the mammography CAD system and the CNN to obtain a system that can read mammography independently. ...

doi:10.30855/gmbd.2019.03.01 fatcat:2sv7dg7elrfqppcjx5otzmb7pi

DOAJ

In the Bangla language, neither any dataset available for RC nor any work has been done in the past. In this research work, we develop a question-answering system from RC. ... For doing this, we construct a dataset containing 3636 reading comprehensions along with questions and answers. ... Multi-Head Attention: The multi-head self-attention mechanism in the transformer has three different uses. ...

doi:10.1016/j.heliyon.2022.e11052 pmid:36254291 pmcid:PMC9568857 fatcat:bgexsoawhrbb5isoeoo4ux3j7q

DOAJ Szczepanski

We further build a Bangla WordNet, BNNet, using these resources and map it with the Princeton English WordNet. ... With a variation in typical CNN model architecture, a satisfactory result is found to predict the genre of a particular artwork with an accuracy of 98.21%. 2 6 Omar Faruqe, Maliha Elma, Nahid Hossain ... Our Purpose is to build a platform with clustering algorithms which will jointly help to provide the quickest solution to find blood or plasma donor. ...

doi:10.1109/iccit51783.2020.9392749 fatcat:pz3hf7rsmzbjpe6hxjlu5tmrfq

The most used concept in planning is sampling-based planning, it provides a successful solution in wayfinding path planning, and because of this, it is performed in different robotics fields. ... The basic strategy of the overall behavior tasks in a behavior-based system are divided into smaller independent behaviors that focus on the performance of specific tasks such as the behavior of a robot ... From a statistical analysis we notice that the probability that a Bangla word will have at least one character with head-line is 0.994. ...

doi:10.15864/ajec.1303 fatcat:sitz5xykknhahawc54yptjptqa

In addition, recent advances have allowed researchers to move from simple recognition of sign language characters and words towards the capacity to translate continuous sign language communication with ... People with hearing impairments are found worldwide; therefore, the development of effective local level sign language recognition (SLR) tools is essential. ... [122] used the CNN approach to train a dataset obtained from the Bangla Sign Language. ...

doi:10.1109/access.2021.3110912 fatcat:mcjehb6znjcijhk2wzgdxbmzqq

DOAJ

However, recent studies show that the two kinds of clues are not always well registered and therefore, feature and character might be misaligned in difficult text (e.g., with a rare shape). ... The Transformer-based encoder-decoder framework is becoming popular in scene text recognition, largely because it naturally integrates recognition clues from both visual and semantic domains. ... A recent study reported very fast yet efficient solutions by leveraging ViT-like architecture (Du et al., 2022) . We plan to incorporate it to speed up the recognition. ...

arXiv:2111.11011v5 fatcat:hdk2xpebr5fvzf7hwbov626ot4

Multiple Versions

Intellectual Character Recognition System is an application that uses Convolutional Neural Network (CNN) to recognize the Tamil character dataset accurately developed by HP Labs India. ... The proposed approach is capable of recognizing characters in a variety of challenging conditions using the Convolutional Neural Network, where traditional character recognition systems fail, notably in ... A Convolutional Neural Network (CNN, or ConvNet) is a unique sort of multi-layer neural systems, intended to perceive visual examples legitimately from pixel pictures with negligible preprocessing. ...

doi:10.4108/eai.16-10-2020.166659 fatcat:rrv3tyk2ezegdhcwsvuvvkgbrq

DOAJ

The author applies a most recent state-of-art scalingbased 3D-CNN different pretrained deep neural architectures, such as VGG19, and ResNet-50 to compare the performance with the proposed architecture. ... In this work, we have used multiple deep convolution neural networks (CNN) with the same architecture of InceptionV3. ... Paper ID: 121 Bangla Broadcast Speech Recognition Using Support Vector Machine Authors: Refat Noor Swarna Abstract: Over the past few decades, incredible growth has been revealed in the recognition ...

doi:10.1109/etcce51779.2020.9350902 fatcat:2drvbgkrmjabfeawuw33hui6yy

We propose the Multi Task Deep Morphological analyzer (MT-DMA), a character-level neural morphological analyzer based on multitask learning of word-level tag markers for Hindi and Urdu. ... Exploiting character-level features in phonological space optimized for each tag using multi-objective genetic algorithm, our model establishes a new state-of-the-art accuracy score upon all seven of the ... This paper introduces a multi-task learning framework based upon two widely used architectures: (a) the aforementioned CNN-RNN model for predicting the POS, G, N, P, C, and TAM, and (b) an attention-based ...

arXiv:1811.08619v2 fatcat:qasqykysxjfm7oos4lkeg6mzze

Open Access Multiple Versions

APPENDIX A HIERARCHICAL DATA FORMAT (HDF) FOR THE UTHDC DATABASE Listing 1. Python Code To Extract uTHCD Dataset ... BASIC CNN ARCHITECTURE We aim to design a minimalist CNN architecture to establish baseline accuracy for the proposed database. ... As this study predominantly uses the CNN architectures for reporting the use-case scenarios of the proposed dataset, the overall flow diagram of the model building using CNN is shown in Fig.12 . ...

doi:10.1109/access.2021.3096823 fatcat:rdrtnpzavzedtasnsc4ov4a23m

DOAJ

Various methods have been proposed to deal with such a problem. ... In this article, we first introduce several datasets in the community that deal with this task and take a closer look at them by providing some exploratory analysis. ... entity recognition, text summarization, and sentiment analysis. ...

doi:10.1145/3544557 fatcat:zrhucn4xbvguxgtfri7hd2ssdm

a detection accuracy of 96.4% with a YOLOv5m model without using any human annotation. ... Transcription alignment is a simpler task that aims to find a correspondence between text in the scanned image and its existing Unicode counterpart, a correspondence which can then be used as training ... Furthermore, we thank Kha Cong Nguyen, Cuong Tuan Nguyen, and Masaki Nakagawa, the authors of [3] , for providing us with their ground truth for the test images. ...

doi:10.3390/app11114894 fatcat:sfh27dqhgzawzbu7nndzgokfty

DOAJ

HCR-Net: A deep learning based script independent handwritten character recognition network [article]

Preserved Fulltext

Other Versions

Improving Scene Text Recognition for Indian Languages with Transfer Learning and Font Diversity

Preserved Fulltext

Bangla Sign Language Recognition and Sentence Building Using Deep Learning

Preserved Fulltext

Derin Öğrenme Araştırma Alanlarının Literatür Taraması

Preserved Fulltext

Reading comprehension based question answering system in Bangla language with transformer-based learning

Preserved Fulltext

ICCIT 2020 Conference Proceedings [Front matter]

Preserved Fulltext

Android Controlled Home Automation

Preserved Fulltext

Deep Learning for Sign Language Recognition: Current Techniques, Benchmarks, and Open Issues

Preserved Fulltext

CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition [article]

Preserved Fulltext

Other Versions

Intelligent Character Recognition System Using Convolutional Neural Network

Preserved Fulltext

International Conference on Emerging Technology in Computing, Communication and Electronics

Preserved Fulltext

Multi Task Deep Morphological Analyzer: Context Aware Joint Morphological Tagging and Lemma Prediction [article]

Preserved Fulltext

Other Versions

uTHCD: A New Benchmarking for Tamil Handwritten OCR

Preserved Fulltext

Survey on Aspect Category Detection

Preserved Fulltext

Transcription Alignment of Historical Vietnamese Manuscripts without Human-Annotated Learning Samples

Preserved Fulltext