SIFT Keypoint Removal and Injection via Convex Relaxation.

This paper addresses the structure-and-motion problem, that requires to find camera motion and 3D struc- ture from point matches. ... This method has several advantages, like a provably lower computational complexity, which is necessary to achieve true scalability, and better error containment, leading to more stability and less drift ... della Ligura (MiBAC) and Soprintendenza per i Beni Archeologici della Liguria. ...

doi:10.1016/j.cviu.2015.05.011 fatcat:itw3ujs4nrgrxgoea3kode56zi

Multiple Versions

Both optimization-based and learning-based methods are covered. We also review benchmarks and datasets for evaluating non-rigid registration methods, and discuss potential future research directions. ... In particular, we review different approaches for representing the deformation field, and the methods for computing the desired deformation. ... It is therefore necessary to remove poor correspondences. ...

arXiv:2203.07858v2 fatcat:qono2kgr2bgjhio6rqecntsloe

Multiple Versions

; 2) longitudinal studies, where temporal structural or anatomical changes are investigated; and 3) population modeling and statistical atlases used to study normal anatomical variability. ... In order to study image registration methods in depth, their main components are identified and studied independently. The most recent techniques are presented in a systematic fashion. ... Nonetheless, a number of extensions of SIFT in higher dimensions have been proposed. Cheung and Hamarneh extended SIFT in the nD domain [262] and reported results for the 3D and 4D case. ...

doi:10.1109/tmi.2013.2265603 pmid:23739795 pmcid:PMC3745275 fatcat:svbac4wihzhylmlntlodzrmple

In both cases, we will formalize the approach, discuss its inherent ambiguities, and present the practical solutions that have been proposed to resolve them. ... two main classes of techniques that have proved most effective so far: The template-based methods that rely on establishing correspondences with a reference image in which the shape is already known, and ... We are especially indebted to Slobodan Ilic, Vincent Lepetit, Francesc Moreno-Noguer, Raquel Urtasun, and Aydin Varol. ...

doi:10.2200/s00319ed1v01y201012cov003 fatcat:r4s4vpjptvfjtgj3bpgrvav2xq

Outlier removal algorithms aim to detect and remove abnormal or negative data which sufficiently differ from training samples. ... In this thesis, we investigate the application of outlier removal algorithm in object recognition and pose estimation problems. ... A brief implementation walkthrough of SIFT keypoint detector and descriptor is given below. Local interest points for SIFT description, or keypoints, are detected via scale-space extrema detection. ...

doi:10.25560/74175 fatcat:nl3q7ssjrvftpizs7vwglckhiu

Similarly, text and visual data (images and videos) are two distinct data domains with extensive research in the past. ... We broadly categorize text-guided visual output into three main divisions and meaningful subdivisions by critically examining an extensive body of literature from top-tier computer vision venues and closely ... sec video, multi-lingual descriptions MSR-VTT [375] Internet, AMT 6 6513 497 2990 10,000 original, 320 × 240 30 fps, 20 captions/video, 41.2 h video, 20 categories Text-to-video- Text-video embedding SIFT-keypoints ...

doi:10.3390/s22186816 pmid:36146161 pmcid:PMC9503702 fatcat:mqhcrujj5bbebgo2brdnad3p6m

DOAJ

Through your example , you have taught me how to see and learn and develop mathematics and its many applications, and I am forever thankful to have grown so much under your tutelage. ... I am also indebted to my many collaborators and peers whom I have had the privilege of working with and learning from over the years. ... In the SIFT and SURF algorithms [66, 70] , keypoints and their local feature vectors are used for object recognition using a method called Hough transform voting. ...

fatcat:c6wrqinbdrenbeqlpxyc72mgb4

This manuscript describes the elements of a theory of information tailored to control and decision tasks and specifically to visual data. ... It has ramifications in vision-based control, navigation, 3-D reconstruction and rendering, as well as detection, localization, recognition and categorization of objects and scenes in live video. ... In this section we summarize the results of [8] , where it is shown that occlusion detection can be formulated as a variational optimization problem and relaxed to a convex optimization, that yields a ...

arXiv:1110.2053v4 fatcat:utdycuug75drzkm2a4s74ozeg4

Multiple Versions

We then run detectors on scenes, and for each scene keypoint compute the associated descriptor and establish a match with the Nearest Neighbor model descriptor via the kd-tree. ... We then run detectors on scenes, and for each scene keypoint establish a match with the Nearest Neighbor model descriptor via the kd-tree associated with the same scale as that found for the scene keypoint ...

doi:10.6092/unibo/amsdottorato/9513 fatcat:sfx27ld3mvhfddazrc7xrxt57e

We survey the different paradigms of that field (centralized and distributed) and the existing solutions. ... Finally, we conclude by giving an overview of the various largescale experiments that have been carried out until now and discuss the remaining challenges and future orientations. ... During the first phase (teach), a database is built using SURF keypoints and submapping techniques. ...

doi:10.1109/tiv.2017.2749181 fatcat:ohjoahw24zakrmrleg6vogzg3q

Classification of Plastic Waste DAY 3 -Jan 14, 2021 Zhang, Donglin; Wu, Xiaojun; Liu, Zhen; Yu, Jun; Kittler, Josef 1328 Fast Discrete Cross-Modal Hashing Based on Label Relaxation and Matrix ... video restoration and compression artifact removal DAY 2 -Jan 13, 2021 Gabdulkhakova, Aysylu; Kropatsch, Walter 3018 Generalized Conics: properties and applications DAY 2 -Jan 13, 2021 -DAY ...

doi:10.1109/icpr48806.2021.9412725 fatcat:3vge2tpd2zf7jcv5btcixnaikm

The framework combines the power of multiple visual cues and integrates the temporal continuity information of video. ... Experiments on a public and a real-world video sequence databases show the gain brought by the different stages of the method. ... Acknowledgments This research has received funding from Agence Nationale de la Recherche under Reference ANR-09-BLAN-0165-02 (IMMED project) and the European Community's Seventh Framework Programme (FP7 ...

doi:10.1155/2013/175064 fatcat:4h72st6qd5hhjd24gusoxf6lqa

DOAJ Szczepanski

It is the first dataset of video sequences comprising complex multi-person scenes and fully annotated tracks with 2D keypoints. ... applications in robotics, virtual and augmented reality, gaming and healthcare among others. ... ACKNOWLEDGEMENTS First and foremost, I owe a sincere gratitude to my supervisor Bernt Schiele. He took me as a master student and guided me through the entire Ph.D. journey. It is ...

doi:10.22028/d291-33512 fatcat:ioyobbty6rh7pifk5u5muj7msm

Many approaches from the document analysis and recognition research community have been proposed to alleviate the search process. ... Firstly, I would like to thank my supervisor Professor Christophoros Nikou for his seamless cooperation, his continuous support, his guidance, and most importantly, ... [116] represent document regions with a fixed-length descriptor based on the BoW representation of SIFT features extracted via a sliding window over the whole page. ...

doi:10.26268/heal.uoi.11485 fatcat:k453klfpmncivj33r2iggojm5y

Open Access

Applications include image classification, 3D shape classification and retrieval, panoramic image classification and segmentation, shape alignment and pose estimation. ... What these models have in common is that they leverage symmetries in the data to reduce sample and model complexity and improve generalization performance. ... [31] , we also use the convex hull for extra channels. ...

arXiv:2012.02771v1 fatcat:al6pkept5nhupgrvqwkhxycyne

Hierarchical structure-and-motion recovery from uncalibrated images

Preserved Fulltext

Other Versions

A Survey of Non-Rigid 3D Registration [article]

Preserved Fulltext

Other Versions

Deformable Medical Image Registration: A Survey

Preserved Fulltext

Deformable Surface 3D Reconstruction from Monocular Images

Preserved Fulltext

Outlier removal in real-time object recognition and pose estimation

Preserved Fulltext

A Review of Multi-Modal Learning from the Text-Guided Visual Processing Viewpoint

Preserved Fulltext

A COMPARISON FRAMEWORK FOR INTERLEAVED PERSISTENCE MODULES AND APPLICATIONS OF PERSISTENT HOMOLOGY TO PROBLEMS IN FLUID DYNAMICS A comparison framework for interleaved persistence modules and applications of persistent homology to problems in fluid dynamics

Preserved Fulltext

Steps Towards a Theory of Visual Information: Active Perception, Signal-to-Symbol Conversion and the Interplay Between Sensing and Control [article]

Preserved Fulltext

Other Versions

Learning to understand the world in 3D

Preserved Fulltext

Simultaneous Localization and Mapping: A Survey of Current Trends in Autonomous Driving

Preserved Fulltext

Learning Neural Textual Representations for Citation Recommendation

Preserved Fulltext

Multiple Feature Fusion Based on Co-Training Approach and Time Regularization for Place Classification in Wearable Video

Preserved Fulltext

Towards accurate multi-person pose estimation in the wild [article]

Preserved Fulltext

Keyword spotting in handwritten document images using supervised and unsupervised representations [article]

Preserved Fulltext

Learning Equivariant Representations [article]

Preserved Fulltext