Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Filters








70,119 Hits in 4.8 sec

Segmentation of complex document

Souad Oudjemia, Zohra Ameur, Abdeldjali Ouahabi
2014 Carpathian Journal of Electronic and Computer Engineering  
This technique based on GLCM (Grey Level Co-occurrence Matrix) used to segment this type of document in three regions namely, 'graphics', 'background' and 'text'.  ...  In this paper we present a method for segmentation of documents image with complex structure.  ...  Other mixed methods exist and most are based on the principle of division and fusion [5] [11] [16] [27] .  ... 
doaj:09063aecbf7a4101b97c4c717fb931e1 fatcat:mq3rv74nezbmlfy6uymckqfktq

Enhanced Techniques for PDF Image Segmentation and Text Extraction

Madhuri Patil, Monika Pune, Ajay Zaware, A.D. Kulkarni
2020 International journal of recent advances in engineering & technology  
This paper presents two strategies under square based portrayal. After a concise presentation of the arrangement strategies, two techniques were improved and results were assessed.  ...  The execution estimations for division and time usage are striven for both the models.  ...  BLOCK BASED SEGMENTATION To change and rearrange the representation of a picture into something which is more important and less demanding to break down is called as block based segmentation.  ... 
doi:10.46564/ijraet.2020.v08i05.006 fatcat:isoxks4cird4flnv4ndleiphbm

A COMPARATIVE ANALYSIS OF LINE AND WORD SEGMENTATION FOR HANDWRITTEN DOCUMENT IMAGE

Neerugatti Varipally Vishwanath
2018 International Journal of Advanced Research in Computer Science  
A recentapproach [8] uses block-based Hough transform to detect lines and merging methods to correct false alarms.  ...  One of themost accurate methods uses piece-wise projection profiles to obtain an initial set of candidate lines and bivariate Gaussiandensities to assign overlapping CCs into text lines [7] .  ... 
doi:10.26483/ijarcs.v9i1.5428 fatcat:2cwapuqydjg67kcxpiwmp2hpxq

OntoSeg: A Novel Approach to Text Segmentation Using Ontological Similarity

Mostafa Bayomi, Killian Levacher, M. Rami Ghorab, Seamus Lawless
2015 2015 IEEE International Conference on Data Mining Workshop (ICDMW)  
This paper proposes OntoSeg, a novel approach to text segmentation based on the ontological similarity between text blocks.  ...  Current approaches to text segmentation are similar in that they all use word-frequency metrics to measure the similarity between two regions of text, so that a document is segmented based on the lexical  ...  ACKNOWLEDGMENT This work is supported by Science Foundation Ireland (Grant 12/CE/I2267) as part of CNGL Centre for Global Intelligent Content (www.cngl.ie) at Trinity College Dublin.  ... 
doi:10.1109/icdmw.2015.6 dblp:conf/icdm/BayomiLGL15 fatcat:x6mguiynjnh3be74lpz7gfaloi

Extraction of Hidden Text from Images using DWT

V. Beslin Geo, K. Sakthidasan @ Sankaran, P. Archana, M. Umarani
2018 International Journal of Engineering & Technology  
The proposed method uses connected region and edge detection approach which provides a segmented text from digital video stills.  ...  Compression of digital images leads to poor visual quality of background and text images. Digital images are significantly considered and segmented using DWT into text and background blocks.  ...  Based on this a high clarification text block is obtained by the high-frequency wavelet coefficients. D. Algorithm of DWT block-based text extraction 1.  ... 
doi:10.14419/ijet.v7i4.36.23908 fatcat:axao4r55mbgblnwpzzgyphx374

Segmentation of Arabic Handwritten Text to Lines

Mokhtari Younes, Yousfi Abdellah
2015 Procedia Computer Science  
One of the most important operations in a handwriting recognition system is segmentation.  ...  Segmentation of handwritten text is a necessary step in the development of a system of automatic writing recognition.  ...  blocks. 2 Our method is based on the subdivision of the document into columns by using a sliding window of a fixed size.  ... 
doi:10.1016/j.procs.2015.12.056 fatcat:eawbncqtdvadfhcroccair5h3u

Gradient-Angular-Features for Word-wise Video Script Identification

Palaiahnakote Shivakumara, Nabin Sharma, Umapada Pal, Michael Blumenstein, Chew Lim Tan
2014 2014 22nd International Conference on Pattern Recognition  
Script identification at the word level is challenging because of complex backgrounds and low resolution of video. The presence of graphics and scene text in video makes the problem more challenging.  ...  [7] have proposed a method based on spatial-gradient-features to identify the script in a frame at block level.  ...  The dominant text pixel selection is performed based on the histograms of the horizontal gradient and vertical gradient division.  ... 
doi:10.1109/icpr.2014.534 dblp:conf/icpr/ShivakumaraSPBT14 fatcat:4y5hpfuz3jfgvcjackzf4ihhpu

Compression of CompoundImages using Fuzzy Clustering Technique

K. Uma, P. Radhakrishnan, R. Vinoth
2016 Indian Journal of Science and Technology  
Finally, pattern matching procedure has been made to match the encoded blocks based on different scale dimensions.  ...  Hence, a new compression method based on the newly announced coding paradigm called Fuzzy cluster based compression is proposed in this paper that gives high coding efficiency for an extensive variety  ...  The text blocks or the non-smooth blocks are segmented using the Binary Tree Based Segmentation process where the text blocks are separated recursively in the tree based approach.  ... 
doi:10.17485/ijst/2016/v9is1/112864 fatcat:ltm7mechk5gafbwv3eopkcq4ie

Application on Web Page Filtering Technology

Bo Shen, Lei Li, Ning-wei Wang
2014 International Journal of Multimedia and Ubiquitous Engineering  
The most common methods are: methods based on statistics, dictionary and understanding: In a text, the more times adjacent characters simultaneously appear, the more they are likely to constitute a word  ...  Based on DIV tags dividing the content block of the page, this paper proposes a new data filtering scheme, DVPS algorithm.  ...  block division, we proposes DVPS algorithm based on visual characteristics.  ... 
doi:10.14257/ijmue.2014.9.12.35 fatcat:tdapvuf5azgz7js2yifg42rxzi

An Image based Steganography Scheme Implying Pseudo-Random Mapping of Text Segments to Logical Region of Cover Image using a New Block Mapping Function and Randomization Technique

Shiladitya Pujari, Sripati Mukhopadhyay
2012 International Journal of Computer Applications  
Based on PVD method, various approaches have also been proposed. Among them Chang et al. proposes a new method using tri-way pixel-value differencing.  ...  divided into a random number of square blocks.  ...  Step VI: This algorithm is based on hiding of text into equal sized square blocks of cover image.  ... 
doi:10.5120/7746-0799 fatcat:2aiezmyl6bhezcc5yjcyojvuqu

Segmentation of Connected Components and Overlapping Lines in Gurumukhi Handwritten Documents

Snehdeep Snehdeep, Manoj Kumar
2014 International Journal of Computer Applications  
This paper also provides a review on major problems in line segmentation that decreases the accuracy of recognition system. The proposed method has achieved 93.05% accuracy in text line segmentation.  ...  The proposed algorithm is based on mid-point detection.  ...  [3] , has proposed a technique based on cut text method (CTM).The proposed technique uses the concept of difference between header and base lines for segmenting handwritten Hindi text documents.  ... 
doi:10.5120/17874-8850 fatcat:b6iw2u2i5rh5xdrnl7dnjxb4qe

USING H.264/AVC-INTRA FOR DCT BASED SEGMENTATION DRIVEN COMPOUND IMAGE COMPRESSION

Ebenezer Juliet S, Sadasivam V, Jemi Florinabel D
2011 ICTACT Journal on Image and Video Processing  
It segments computer screen images into text/graphics and picture/background classes based on DCT energy in each 4x4 block, and then compresses both text/graphics pixels and picture/background blocks by  ...  This paper presents a one pass block classification algorithm for efficient coding of compound images which consists of multimedia elements like text, graphics and natural images.  ...  The segmentation is based on hierarchical color clustering and a variety of filters. Block-based approaches are studied mainly due to its low complexity.  ... 
doi:10.21917/ijivp.2011.0041 fatcat:m6eeprkkhjfgzd6n36abkhewby

The role of logical and generic document structure in relational discourse analysis [chapter]

Maja Bärenfänger, Harald Lüngen, Mirco Hilbert, Henning Lobin
2010 Pragmatics & Beyond New Series  
The segments of type "block" partition the document, i.e. every piece of text is part of exactly one CDS type="block".  ...  of a text and the text as a whole, while an RST relation establishes a functional relation between two or more parts of a text (discourse segments Method, Answers) should be determined automatically for  ... 
doi:10.1075/pbns.194.05bar fatcat:co6bbbrpa5bn7k5vs7u7kgushy

Segmentation Methods: A Review

Nikita Mehta
2020 International Journal for Research in Applied Science and Engineering Technology  
A document is segmented on different levels to extract the smallest individual unit of the text -an individual character.  ...  There are two types of documents: machine printed and handwritten. Segmentation of a handwritten document is more difficult than printed one.  ...  based methods a) Docstrum The x-y cut or recursive x-y cut algorithm [9] [10] is a top down approach decomposes a document image into a set of rectangular blocks.  ... 
doi:10.22214/ijraset.2020.31939 fatcat:26ogikpwnbaqnmumobwss5s4te

Line segmentation of handwritten Gurmukhi manuscripts

Simpel Jindal, Gurpreet Singh Lehal
2012 Proceeding of the workshop on Document Analysis and Recognition - DAR '12  
In this paper, we have discussed a method for segmenting lines for Gurmukhi handwritten manuscripts.  ...  Segmentation is one of the important phase of an OCR, as accuracy of an OCR depends upon the accuracy of segmentation.  ...  We are still working to find optimal number of strips in which the whole document should be divided. 2. Division of larger text blocks into smaller text blocks is not 100% accurate. 3.  ... 
doi:10.1145/2432553.2432568 dblp:conf/icvgip/JindalL12 fatcat:efprzy4ypbglvemtt3zhoe5tme
« Previous Showing results 1 — 15 out of 70,119 results