Learning accurate template matching with differentiable coarse-to-fine correspondence refinement.

To produce such embeddings, we propose a pixel-level contrastive learning framework. A coarse-to-fine strategy ensures both global and local anatomical information are encoded. ... On two X-ray datasets, SAM, with only one labeled template image, surpasses supervised methods trained on 50 labeled images. ... Meanwhile, it needs to encode local information to differentiate adjacent structures with similar appearance for accurate localization. ...

arXiv:2012.02383v3 fatcat:3v3h4q6kwnfcxe7j2vfqbtxfpe

Open Access Multiple Versions

For this purpose, we introduce an end-to-end CNN framework which derives the shape in a coarse-to-fine fashion. ... The proposed architecture is composed of two main blocks, a network that recovers the coarse facial geometry (CoarseNet), followed by a CNN that refines the facial features of that geometry (FineNet). ... Acknowledgments Research leading to these results was supported by European Community's FP7-ERC program, grant agreement no. 267414. ...

arXiv:1611.05053v2 fatcat:ifdct32ajfcx3brvubkvtjrzoi

Multiple Versions

The fine-grained object pose estimator is also aided by correspondence of learned spatial descriptor of the input image pair. ... Second, we demonstrate the ability of the learned semantic correspondence to perform unsupervised part-segmentation transfer using only a single part-annotated 3D template model per object class. ... [33] explored the idea of coarse to fine level view-point estimation. ...

arXiv:1808.01134v1 fatcat:csdq5aqnpbci7lfvupcvzhvfbq

Thanks to recently released collections of deformable objects with known intra-state correspondences, DispVoxNets learn a deformation model and further priors (e.g., weak point topology preservation) for ... All properties of DispVoxNets are ascertained numerically and qualitatively in extensive experiments and comparisons to several previous methods. ... −xi 2 √ D , with the template points y i and corresponding reference points x i . ...

arXiv:1907.10367v2 fatcat:7kfk7pjwyfaz5opqln22tjubxi

Multiple Versions

Thanks to recently released collections of deformable objects with known intra-state correspondences, DispVoxNets learn a deformation model and further priors (e.g., weak point topology preservation) for ... All properties of DispVoxNets are ascertained numerically and qualitatively in extensive experiments and comparisons to several previous methods. ... −xi 2 √ D , with the template points y i and corresponding reference points x i . ...

doi:10.1109/3dv.2019.00013 dblp:conf/3dim/ShimadaGTST19 fatcat:yyu77nlzlbhahay2cosi4y65bq

Then the depth images from multi-view stereo along with the rendered depth images of the coarse shape are used as a contrastive input whose features guide the refinement of the coarse shape through a series ... Deep learning based 3D shape generation methods generally utilize latent features extracted from color images to encode the semantics of objects and guide the shape generation process. ... Our system can struggle to roughly reconstruct shapes with very complex topology while some fine topology of the mesh is missing. ...

arXiv:2010.08682v3 fatcat:mszoiznjofcqllyc77ywdkqogu

Multiple Versions

compared to voxel representations. ... We also introduce a new algorithm in 3D reconstruction at an arbitrary resolution by aggregating vectors along three orthogonal axes, resolving inherent problems with regressing fixed dimension of vectors ... We then refine our coarse-level vectors along depth by attending to high-resolution visual cues. ...

arXiv:2401.14565v1 fatcat:j7zdd3kfyvhfbl5rxolrzdif7y

CC2D-SSL captures the consistent anatomical information in a coarse-to-fine fashion by comparing the cascade feature representations and generates predictions on the training set. ... CC2D consists of two stages: 1) Self-supervised learning (CC2D-SSL) and 2) Training with pseudo-labels (CC2D-TPL). ... Therefore, we propose to match the coarse-grained corresponding areas first, then gradually compare the finer-grained areas in the selected coarse area. ...

arXiv:2103.04527v1 fatcat:bahjye6iunayvksgj72pnivq3e

Open Access

Moreover, we propose a coarse to fine training strategy, which enables fine-grained correspondence prediction. ... Establishing correspondences from image to 3D has been a key task of 6DoF object pose estimation for a long time. To predict pose more accurately, deeply learned dense maps replaced sparse templates. ... We are thankful to Rene Schuster, Fangwen Shu, Yaxu Xie and Ghazal Ghazaei for proofreading the paper. ...

arXiv:2203.09418v2 fatcat:5rvexg3jqjgl7l5i4x6rsaamea

Multiple Versions

In this paper, we present an end-to-end single-view mesh reconstruction framework that is able to generate high-quality meshes with complex topologies from a single genus-0 template mesh. ... Moreover, a boundary refinement network is designed to refine the boundary conditions to further improve the visual quality of the reconstructed mesh. ... Such a design paradigm enables more accurate learning of fine geometric details with even less training time. ...

arXiv:1909.00321v1 fatcat:w5bvpfrcinbclfx5gtia5vb3sy

to refine the initial mesh. ... Thus, they are limited to the reconstruction of objects that have the same topology as the template. ... While Mesh R-CNN can reconstruct arbitrary topologies, the use of coarse voxel grids limits its ability to accurately reconstruct fine structures such as the chair legs [25] . ...

arXiv:2109.11844v1 fatcat:cmz742pllbf7hpy2qvys2wakfy

Owing to recent advancements in deep learning methods and relevant databases, it is becoming increasingly easier to recognize 3D objects using only RGB images from single viewpoints. ... For relatively low-cost data acquisition systems without depth sensors or cameras at multiple viewpoints, we first consider existing databases with 2D RGB photos and their relevant attributes. ... Coarse-to-fine KITTI 3D than Mono3D, due to the lower to-fine method, considerably re-resolution of images in the coarse- No steps for accurate 2D vehicle bounding ducing a search space. boxes. ...

doi:10.3390/electronics10040517 fatcat:rziqhrkefvelpg3vgb6qxfprte

DOAJ

First, RIZER, as a whole, adopts a detector-free coarse-to-fine registration framework, and the data-driven methods use a Transformer based on zero-shot learning. ... After testing, RIZER achieved a correct matching rate of 99.55% with an RMSE of 1.36 and had an advantage in the number of correct matches, as well as a good generalization ability for other multimodal ... of image registration for deep learning to efficiently and accurately obtain a wide range of correspondences. ...

doi:10.3390/rs16020214 fatcat:2qxooy776fertkoenz2tywactu

DOAJ Szczepanski

A non-linear matching scheme is introduced to estimate the location of the boundary points using only the intensity data within each image plane. ... We use the term sparse to describe volumetric images in which the sampling resolution within the imaging plane is far higher than that of the perpendicular direction. ... Our matching strategy is built around a coarse-to-fine approach. ...

doi:10.1016/s1361-8415(00)00020-7 pmid:11145314 fatcat:5mqh4hpucncdpij3m5ijtzuozi

However, there are two major limitations with these approaches. First, the computed regional features might not reflect the actual regional tensor distributions. ... Subsequently, regional integration and other operations such as edge detection are performed to extract more features to guide the registration. ... warrant the differentiation of different anatomical structures and hence can be utilized to assist correspondence matching in the course of registration. ...

doi:10.1109/cvprw.2009.5204350 dblp:conf/cvpr/YapWZLS09 fatcat:y72qo53befdzde27l5k5epgmtq

SAM: Self-supervised Learning of Pixel-wise Anatomical Embeddings in Radiological Images [article]

Preserved Fulltext

Other Versions

Learning Detailed Face Reconstruction from a Single Image [article]

Preserved Fulltext

Other Versions

iSPA-Net: Iterative Semantic Pose Alignment Network [article]

Preserved Fulltext

DispVoxNets: Non-Rigid Point Set Alignment with Supervised Learning Proxies [article]

Preserved Fulltext

Other Versions

DispVoxNets: Non-Rigid Point Set Alignment with Supervised Learning Proxies

Preserved Fulltext

MeshMVS: Multi-View Stereo Guided Mesh Reconstruction [article]

Preserved Fulltext

Other Versions

TIFu: Tri-directional Implicit Function for High-Fidelity 3D Character Reconstruction [article]

Preserved Fulltext

One-Shot Medical Landmark Detection [article]

Preserved Fulltext

ZebraPose: Coarse to Fine Surface Encoding for 6DoF Object Pose Estimation [article]

Preserved Fulltext

Other Versions

Deep Mesh Reconstruction from Single RGB Images via Topology Modification Networks [article]

Preserved Fulltext

Learnable Triangulation for Deep Learning-based 3D Reconstruction of Objects of Arbitrary Topology from Single RGB Images [article]

Preserved Fulltext

A Survey on Deep Learning Based Methods and Datasets for Monocular 3D Object Detection

Preserved Fulltext

Radiation-Variation Insensitive Coarse-to-Fine Image Registration for Infrared and Visible Remote Sensing Based on Zero-Shot Learning

Preserved Fulltext

Detecting the brain surface in sparse MRI using boundary models

Preserved Fulltext

TIMER: Tensor Image Morphing for Elastic Registration

Preserved Fulltext