Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Filters








27 Hits in 4.1 sec

Hierarchical structure-and-motion recovery from uncalibrated images

Roberto Toldo, Riccardo Gherardi, Michela Farenzena, Andrea Fusiello
2015 Computer Vision and Image Understanding  
This paper addresses the structure-and-motion problem, that requires to find camera motion and 3D struc- ture from point matches.  ...  This method has several advantages, like a provably lower computational complexity, which is necessary to achieve true scalability, and better error containment, leading to more stability and less drift  ...  della Ligura (MiBAC) and Soprintendenza per i Beni Archeologici della Liguria.  ... 
doi:10.1016/j.cviu.2015.05.011 fatcat:itw3ujs4nrgrxgoea3kode56zi

A Survey of Non-Rigid 3D Registration [article]

Bailin Deng and Yuxin Yao and Roberto M. Dyke and Juyong Zhang
2022 arXiv   pre-print
Both optimization-based and learning-based methods are covered. We also review benchmarks and datasets for evaluating non-rigid registration methods, and discuss potential future research directions.  ...  In particular, we review different approaches for representing the deformation field, and the methods for computing the desired deformation.  ...  It is therefore necessary to remove poor correspondences.  ... 
arXiv:2203.07858v2 fatcat:qono2kgr2bgjhio6rqecntsloe

Deformable Medical Image Registration: A Survey

A. Sotiras, C. Davatzikos, N. Paragios
2013 IEEE Transactions on Medical Imaging  
; 2) longitudinal studies, where temporal structural or anatomical changes are investigated; and 3) population modeling and statistical atlases used to study normal anatomical variability.  ...  In order to study image registration methods in depth, their main components are identified and studied independently. The most recent techniques are presented in a systematic fashion.  ...  Nonetheless, a number of extensions of SIFT in higher dimensions have been proposed. Cheung and Hamarneh extended SIFT in the nD domain [262] and reported results for the 3D and 4D case.  ... 
doi:10.1109/tmi.2013.2265603 pmid:23739795 pmcid:PMC3745275 fatcat:svbac4wihzhylmlntlodzrmple

Deformable Surface 3D Reconstruction from Monocular Images

Mathieu Salzmann, Pascal Fua
2010 Synthesis Lectures on Computer Vision  
In both cases, we will formalize the approach, discuss its inherent ambiguities, and present the practical solutions that have been proposed to resolve them.  ...  two main classes of techniques that have proved most effective so far: The template-based methods that rely on establishing correspondences with a reference image in which the shape is already known, and  ...  We are especially indebted to Slobodan Ilic, Vincent Lepetit, Francesc Moreno-Noguer, Raquel Urtasun, and Aydin Varol.  ... 
doi:10.2200/s00319ed1v01y201012cov003 fatcat:r4s4vpjptvfjtgj3bpgrvav2xq

Outlier removal in real-time object recognition and pose estimation

Mang Shao, Tae-Kyun Kim
2019
Outlier removal algorithms aim to detect and remove abnormal or negative data which sufficiently differ from training samples.  ...  In this thesis, we investigate the application of outlier removal algorithm in object recognition and pose estimation problems.  ...  A brief implementation walkthrough of SIFT keypoint detector and descriptor is given below. Local interest points for SIFT description, or keypoints, are detected via scale-space extrema detection.  ... 
doi:10.25560/74175 fatcat:nl3q7ssjrvftpizs7vwglckhiu

A Review of Multi-Modal Learning from the Text-Guided Visual Processing Viewpoint

Ubaid Ullah, Jeong-Sik Lee, Chang-Hyeon An, Hyeonjin Lee, Su-Yeong Park, Rock-Hyun Baek, Hyun-Chul Choi
2022 Sensors  
Similarly, text and visual data (images and videos) are two distinct data domains with extensive research in the past.  ...  We broadly categorize text-guided visual output into three main divisions and meaningful subdivisions by critically examining an extensive body of literature from top-tier computer vision venues and closely  ...  sec video, multi-lingual descriptions MSR-VTT [375] Internet, AMT 6 6513 497 2990 10,000 original, 320 × 240 30 fps, 20 captions/video, 41.2 h video, 20 categories Text-to-video- Text-video embedding SIFT-keypoints  ... 
doi:10.3390/s22186816 pmid:36146161 pmcid:PMC9503702 fatcat:mqhcrujj5bbebgo2brdnad3p6m

A COMPARISON FRAMEWORK FOR INTERLEAVED PERSISTENCE MODULES AND APPLICATIONS OF PERSISTENT HOMOLOGY TO PROBLEMS IN FLUID DYNAMICS A comparison framework for interleaved persistence modules and applications of persistent homology to problems in fluid dynamics

Rachel Levanger, Rachel Levanger, Steve Ferry, Jerrold Tunnell, Zheng-Chao Han
2017 unpublished
Through your example , you have taught me how to see and learn and develop mathematics and its many applications, and I am forever thankful to have grown so much under your tutelage.  ...  I am also indebted to my many collaborators and peers whom I have had the privilege of working with and learning from over the years.  ...  In the SIFT and SURF algorithms [66, 70] , keypoints and their local feature vectors are used for object recognition using a method called Hough transform voting.  ... 
fatcat:c6wrqinbdrenbeqlpxyc72mgb4

Steps Towards a Theory of Visual Information: Active Perception, Signal-to-Symbol Conversion and the Interplay Between Sensing and Control [article]

Stefano Soatto
2017 arXiv   pre-print
This manuscript describes the elements of a theory of information tailored to control and decision tasks and specifically to visual data.  ...  It has ramifications in vision-based control, navigation, 3-D reconstruction and rendering, as well as detection, localization, recognition and categorization of objects and scenes in live video.  ...  In this section we summarize the results of [8] , where it is shown that occlusion detection can be formulated as a variational optimization problem and relaxed to a convex optimization, that yields a  ... 
arXiv:1110.2053v4 fatcat:utdycuug75drzkm2a4s74ozeg4

Learning to understand the world in 3D

Riccardo Spezialetti
2021
We then run detectors on scenes, and for each scene keypoint compute the associated descriptor and establish a match with the Nearest Neighbor model descriptor via the kd-tree.  ...  We then run detectors on scenes, and for each scene keypoint establish a match with the Nearest Neighbor model descriptor via the kd-tree associated with the same scale as that found for the scene keypoint  ... 
doi:10.6092/unibo/amsdottorato/9513 fatcat:sfx27ld3mvhfddazrc7xrxt57e

Simultaneous Localization and Mapping: A Survey of Current Trends in Autonomous Driving

Guillaume Bresson, Zayed Alsayed, Li Yu, Sebastien Glaser
2017 IEEE Transactions on Intelligent Vehicles  
We survey the different paradigms of that field (centralized and distributed) and the existing solutions.  ...  Finally, we conclude by giving an overview of the various largescale experiments that have been carried out until now and discuss the remaining challenges and future orientations.  ...  During the first phase (teach), a database is built using SURF keypoints and submapping techniques.  ... 
doi:10.1109/tiv.2017.2749181 fatcat:ohjoahw24zakrmrleg6vogzg3q

Learning Neural Textual Representations for Citation Recommendation

Binh Thanh Kieu, Inigo Jauregi Unanue, Son Bao Pham, Hieu Xuan Phan, Massimo Piccardi
2021 2020 25th International Conference on Pattern Recognition (ICPR)  
Classification of Plastic Waste DAY 3 -Jan 14, 2021 Zhang, Donglin; Wu, Xiaojun; Liu, Zhen; Yu, Jun; Kittler, Josef 1328 Fast Discrete Cross-Modal Hashing Based on Label Relaxation and Matrix  ...  video restoration and compression artifact removal DAY 2 -Jan 13, 2021 Gabdulkhakova, Aysylu; Kropatsch, Walter 3018 Generalized Conics: properties and applications DAY 2 -Jan 13, 2021 -DAY  ... 
doi:10.1109/icpr48806.2021.9412725 fatcat:3vge2tpd2zf7jcv5btcixnaikm

Multiple Feature Fusion Based on Co-Training Approach and Time Regularization for Place Classification in Wearable Video

Vladislavs Dovgalecs, Rémi Mégret, Yannick Berthoumieu
2013 Advances in Multimedia  
The framework combines the power of multiple visual cues and integrates the temporal continuity information of video.  ...  Experiments on a public and a real-world video sequence databases show the gain brought by the different stages of the method.  ...  Acknowledgments This research has received funding from Agence Nationale de la Recherche under Reference ANR-09-BLAN-0165-02 (IMMED project) and the European Community's Seventh Framework Programme (FP7  ... 
doi:10.1155/2013/175064 fatcat:4h72st6qd5hhjd24gusoxf6lqa

Towards accurate multi-person pose estimation in the wild [article]

Insafutdinov Eldar, Universität Des Saarlandes
2021
It is the first dataset of video sequences comprising complex multi-person scenes and fully annotated tracks with 2D keypoints.  ...  applications in robotics, virtual and augmented reality, gaming and healthcare among others.  ...  ACKNOWLEDGEMENTS First and foremost, I owe a sincere gratitude to my supervisor Bernt Schiele. He took me as a master student and guided me through the entire Ph.D. journey. It is  ... 
doi:10.22028/d291-33512 fatcat:ioyobbty6rh7pifk5u5muj7msm

Keyword spotting in handwritten document images using supervised and unsupervised representations [article]

Άγγελος Γιώτης, University Of Ioannina
2022
Many approaches from the document analysis and recognition research community have been proposed to alleviate the search process.  ...  Firstly, I would like to thank my supervisor Professor Christophoros Nikou for his seamless cooperation, his continuous support, his guidance, and most importantly,  ...  [116] represent document regions with a fixed-length descriptor based on the BoW representation of SIFT features extracted via a sliding window over the whole page.  ... 
doi:10.26268/heal.uoi.11485 fatcat:k453klfpmncivj33r2iggojm5y

Learning Equivariant Representations [article]

Carlos Esteves
2020 arXiv   pre-print
Applications include image classification, 3D shape classification and retrieval, panoramic image classification and segmentation, shape alignment and pose estimation.  ...  What these models have in common is that they leverage symmetries in the data to reduce sample and model complexity and improve generalization performance.  ...  [31] , we also use the convex hull for extra channels.  ... 
arXiv:2012.02771v1 fatcat:al6pkept5nhupgrvqwkhxycyne
« Previous Showing results 1 — 15 out of 27 results