Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Filters








31 Hits in 5.6 sec

HCR-Net: A deep learning based script independent handwritten character recognition network [article]

Vinod Kumar Chauhan, Sukhdeep Singh, Anuj Sharma
2024 arXiv   pre-print
Handwritten character recognition (HCR) remains a challenging pattern recognition problem despite decades of research, and lacks research on script independent recognition techniques.  ...  HCR-Net is extensively evaluated on 40 publicly available datasets of Bangla, Punjabi, Hindi, English, Swedish, Urdu, Farsi, Tibetan, Kannada, Malayalam, Telugu, Marathi, Nepali and Arabic languages, and  ...  Moreover, it is observed that a multi-column multi-scale CNN architecture proposed by [8] performs exceptionally well for Bangla script.  ... 
arXiv:2108.06663v4 fatcat:a5dof25aojhxpazmfjprtx5wca

Improving Scene Text Recognition for Indian Languages with Transfer Learning and Font Diversity

Sanjana Gunna, Rohit Saluja, Cheerakkuzhi Veluthemana Jawahar
2022 Journal of Imaging  
Further enriching the synthetic dataset with non-Unicode fonts and multiple augmentations helps us achieve a remarkable Word Recognition Rate gain of over 33% on the IIIT-ILST Hindi dataset.  ...  This work investigates the significant differences in Indian and Latin Scene Text Recognition (STR) systems.  ...  Data Availability Statement: Data available in a publicly accessible repository that does not issue DOIs.  ... 
doi:10.3390/jimaging8040086 pmid:35448213 pmcid:PMC9025185 fatcat:m7m6oppkevagblcx67yx3yku2u

Bangla Sign Language Recognition and Sentence Building Using Deep Learning

Safayet Anowar Shurid, Khandaker Habibul Amin, Md. Shahnawaz Mirbahar, Dolan Karmaker, Mohammad Tanvir Mahtab, Farhan Tanvir Khan, Md. Golam Rabiul Alam, Md. Ashraful Alam
2020 2020 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE)  
This drove us towards finding a solution that would limit the gap instead of the communication and build a bridge instead of a boundary. ily communicate with them.  ...  Another paper that describes work on recognition and translation of sign languages, uses AI based solutions [6] .  ... 
doi:10.1109/csde50874.2020.9411523 fatcat:tfkrskin55hxpdzajiwtw2ooou

Derin Öğrenme Araştırma Alanlarının Literatür Taraması

M. Mutlu Yapıcı, Adem Tekerek, Nurettin Topaloğlu
2019 Gazi Mühendislik Bilimleri Dergisi  
Nowadays, many researchers try to find solutions to many problems in various fields under the light of DL methods.  ...  , voice and video recognition, medical image processing, and big data.  ...  [170] provided a head-to-head comparison between the latest technology in the mammography CAD system and the CNN to obtain a system that can read mammography independently.  ... 
doi:10.30855/gmbd.2019.03.01 fatcat:2sv7dg7elrfqppcjx5otzmb7pi

Reading comprehension based question answering system in Bangla language with transformer-based learning

Tanjim Taharat Aurpa, Richita Khandakar Rifat, Md Shoaib Ahmed, Md. Musfique Anwar, A. B. M. Shawkat Ali
2022 Heliyon  
In the Bangla language, neither any dataset available for RC nor any work has been done in the past. In this research work, we develop a question-answering system from RC.  ...  For doing this, we construct a dataset containing 3636 reading comprehensions along with questions and answers.  ...  Multi-Head Attention: The multi-head self-attention mechanism in the transformer has three different uses.  ... 
doi:10.1016/j.heliyon.2022.e11052 pmid:36254291 pmcid:PMC9568857 fatcat:bgexsoawhrbb5isoeoo4ux3j7q

ICCIT 2020 Conference Proceedings [Front matter]

2020 2020 23rd International Conference on Computer and Information Technology (ICCIT)  
We further build a Bangla WordNet, BNNet, using these resources and map it with the Princeton English WordNet.  ...  With a variation in typical CNN model architecture, a satisfactory result is found to predict the genre of a particular artwork with an accuracy of 98.21%. 2 6 Omar Faruqe, Maliha Elma, Nahid Hossain  ...  Our Purpose is to build a platform with clustering algorithms which will jointly help to provide the quickest solution to find blood or plasma donor.  ... 
doi:10.1109/iccit51783.2020.9392749 fatcat:pz3hf7rsmzbjpe6hxjlu5tmrfq

Android Controlled Home Automation

2021 American Journal of Electronics & Communication  
The most used concept in planning is sampling-based planning, it provides a successful solution in wayfinding path planning, and because of this, it is performed in different robotics fields.  ...  The basic strategy of the overall behavior tasks in a behavior-based system are divided into smaller independent behaviors that focus on the performance of specific tasks such as the behavior of a robot  ...  From a statistical analysis we notice that the probability that a Bangla word will have at least one character with head-line is 0.994.  ... 
doi:10.15864/ajec.1303 fatcat:sitz5xykknhahawc54yptjptqa

Deep Learning for Sign Language Recognition: Current Techniques, Benchmarks, and Open Issues

Muhammad Al-Qurishi, Thariq Khalid, Riad Souissi
2021 IEEE Access  
In addition, recent advances have allowed researchers to move from simple recognition of sign language characters and words towards the capacity to translate continuous sign language communication with  ...  People with hearing impairments are found worldwide; therefore, the development of effective local level sign language recognition (SLR) tools is essential.  ...  [122] used the CNN approach to train a dataset obtained from the Bangla Sign Language.  ... 
doi:10.1109/access.2021.3110912 fatcat:mcjehb6znjcijhk2wzgdxbmzqq

CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition [article]

Tianlun Zheng, Zhineng Chen, Shancheng Fang, Hongtao Xie, Yu-Gang Jiang
2023 arXiv   pre-print
However, recent studies show that the two kinds of clues are not always well registered and therefore, feature and character might be misaligned in difficult text (e.g., with a rare shape).  ...  The Transformer-based encoder-decoder framework is becoming popular in scene text recognition, largely because it naturally integrates recognition clues from both visual and semantic domains.  ...  A recent study reported very fast yet efficient solutions by leveraging ViT-like architecture (Du et al., 2022) . We plan to incorporate it to speed up the recognition.  ... 
arXiv:2111.11011v5 fatcat:hdk2xpebr5fvzf7hwbov626ot4

Intelligent Character Recognition System Using Convolutional Neural Network

S. Suriya, Dhivya S, Balaji M
2020 EAI Endorsed Transactions on Cloud Systems  
Intellectual Character Recognition System is an application that uses Convolutional Neural Network (CNN) to recognize the Tamil character dataset accurately developed by HP Labs India.  ...  The proposed approach is capable of recognizing characters in a variety of challenging conditions using the Convolutional Neural Network, where traditional character recognition systems fail, notably in  ...  A Convolutional Neural Network (CNN, or ConvNet) is a unique sort of multi-layer neural systems, intended to perceive visual examples legitimately from pixel pictures with negligible preprocessing.  ... 
doi:10.4108/eai.16-10-2020.166659 fatcat:rrv3tyk2ezegdhcwsvuvvkgbrq

International Conference on Emerging Technology in Computing, Communication and Electronics

2020 2020 Emerging Technology in Computing, Communication and Electronics (ETCCE)  
The author applies a most recent state-of-art scalingbased 3D-CNN different pretrained deep neural architectures, such as VGG19, and ResNet-50 to compare the performance with the proposed architecture.  ...  In this work, we have used multiple deep convolution neural networks (CNN) with the same architecture of InceptionV3.  ...  Paper ID: 121 Bangla Broadcast Speech Recognition Using Support Vector Machine Authors: Refat Noor Swarna Abstract: Over the past few decades, incredible growth has been revealed in the recognition  ... 
doi:10.1109/etcce51779.2020.9350902 fatcat:2drvbgkrmjabfeawuw33hui6yy

Multi Task Deep Morphological Analyzer: Context Aware Joint Morphological Tagging and Lemma Prediction [article]

Saurav Jha, Akhilesh Sudhakar, Anil Kumar Singh
2019 arXiv   pre-print
We propose the Multi Task Deep Morphological analyzer (MT-DMA), a character-level neural morphological analyzer based on multitask learning of word-level tag markers for Hindi and Urdu.  ...  Exploiting character-level features in phonological space optimized for each tag using multi-objective genetic algorithm, our model establishes a new state-of-the-art accuracy score upon all seven of the  ...  This paper introduces a multi-task learning framework based upon two widely used architectures: (a) the aforementioned CNN-RNN model for predicting the POS, G, N, P, C, and TAM, and (b) an attention-based  ... 
arXiv:1811.08619v2 fatcat:qasqykysxjfm7oos4lkeg6mzze

uTHCD: A New Benchmarking for Tamil Handwritten OCR

Noushath Shaffi, Faizal Hajamohideen
2021 IEEE Access  
APPENDIX A HIERARCHICAL DATA FORMAT (HDF) FOR THE UTHDC DATABASE Listing 1. Python Code To Extract uTHCD Dataset  ...  BASIC CNN ARCHITECTURE We aim to design a minimalist CNN architecture to establish baseline accuracy for the proposed database.  ...  As this study predominantly uses the CNN architectures for reporting the use-case scenarios of the proposed dataset, the overall flow diagram of the model building using CNN is shown in Fig.12 .  ... 
doi:10.1109/access.2021.3096823 fatcat:rdrtnpzavzedtasnsc4ov4a23m

Survey on Aspect Category Detection

Siva Uday Sampreeth Chebolu, Paolo Rosso, Sudipta Kar, Thamar Solorio
2022 ACM Computing Surveys  
Various methods have been proposed to deal with such a problem.  ...  In this article, we first introduce several datasets in the community that deal with this task and take a closer look at them by providing some exploratory analysis.  ...  entity recognition, text summarization, and sentiment analysis.  ... 
doi:10.1145/3544557 fatcat:zrhucn4xbvguxgtfri7hd2ssdm

Transcription Alignment of Historical Vietnamese Manuscripts without Human-Annotated Learning Samples

Anna Scius-Bertrand, Michael Jungo, Beat Wolf, Andreas Fischer, Marc Bui
2021 Applied Sciences  
a detection accuracy of 96.4% with a YOLOv5m model without using any human annotation.  ...  Transcription alignment is a simpler task that aims to find a correspondence between text in the scanned image and its existing Unicode counterpart, a correspondence which can then be used as training  ...  Furthermore, we thank Kha Cong Nguyen, Cuong Tuan Nguyen, and Masaki Nakagawa, the authors of [3] , for providing us with their ground truth for the test images.  ... 
doi:10.3390/app11114894 fatcat:sfh27dqhgzawzbu7nndzgokfty
« Previous Showing results 1 — 15 out of 31 results