A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2023; you can also visit the original URL.
The file type is application/pdf
.
Filters
SAM: Self-supervised Learning of Pixel-wise Anatomical Embeddings in Radiological Images
[article]
2023
arXiv
pre-print
To produce such embeddings, we propose a pixel-level contrastive learning framework. A coarse-to-fine strategy ensures both global and local anatomical information are encoded. ...
On two X-ray datasets, SAM, with only one labeled template image, surpasses supervised methods trained on 50 labeled images. ...
Meanwhile, it needs to encode local information to differentiate adjacent structures with similar appearance for accurate localization. ...
arXiv:2012.02383v3
fatcat:3v3h4q6kwnfcxe7j2vfqbtxfpe
Learning Detailed Face Reconstruction from a Single Image
[article]
2017
arXiv
pre-print
For this purpose, we introduce an end-to-end CNN framework which derives the shape in a coarse-to-fine fashion. ...
The proposed architecture is composed of two main blocks, a network that recovers the coarse facial geometry (CoarseNet), followed by a CNN that refines the facial features of that geometry (FineNet). ...
Acknowledgments Research leading to these results was supported by European Community's FP7-ERC program, grant agreement no. 267414. ...
arXiv:1611.05053v2
fatcat:ifdct32ajfcx3brvubkvtjrzoi
iSPA-Net: Iterative Semantic Pose Alignment Network
[article]
2018
arXiv
pre-print
The fine-grained object pose estimator is also aided by correspondence of learned spatial descriptor of the input image pair. ...
Second, we demonstrate the ability of the learned semantic correspondence to perform unsupervised part-segmentation transfer using only a single part-annotated 3D template model per object class. ...
[33] explored the idea of coarse to fine level view-point estimation. ...
arXiv:1808.01134v1
fatcat:csdq5aqnpbci7lfvupcvzhvfbq
DispVoxNets: Non-Rigid Point Set Alignment with Supervised Learning Proxies
[article]
2019
arXiv
pre-print
Thanks to recently released collections of deformable objects with known intra-state correspondences, DispVoxNets learn a deformation model and further priors (e.g., weak point topology preservation) for ...
All properties of DispVoxNets are ascertained numerically and qualitatively in extensive experiments and comparisons to several previous methods. ...
−xi 2 √ D , with the template points y i and corresponding reference points x i . ...
arXiv:1907.10367v2
fatcat:7kfk7pjwyfaz5opqln22tjubxi
DispVoxNets: Non-Rigid Point Set Alignment with Supervised Learning Proxies
2019
2019 International Conference on 3D Vision (3DV)
Thanks to recently released collections of deformable objects with known intra-state correspondences, DispVoxNets learn a deformation model and further priors (e.g., weak point topology preservation) for ...
All properties of DispVoxNets are ascertained numerically and qualitatively in extensive experiments and comparisons to several previous methods. ...
−xi 2 √ D , with the template points y i and corresponding reference points x i . ...
doi:10.1109/3dv.2019.00013
dblp:conf/3dim/ShimadaGTST19
fatcat:yyu77nlzlbhahay2cosi4y65bq
MeshMVS: Multi-View Stereo Guided Mesh Reconstruction
[article]
2021
arXiv
pre-print
Then the depth images from multi-view stereo along with the rendered depth images of the coarse shape are used as a contrastive input whose features guide the refinement of the coarse shape through a series ...
Deep learning based 3D shape generation methods generally utilize latent features extracted from color images to encode the semantics of objects and guide the shape generation process. ...
Our system can struggle to roughly reconstruct shapes with very complex topology while some fine topology of the mesh is missing. ...
arXiv:2010.08682v3
fatcat:mszoiznjofcqllyc77ywdkqogu
TIFu: Tri-directional Implicit Function for High-Fidelity 3D Character Reconstruction
[article]
2024
arXiv
pre-print
compared to voxel representations. ...
We also introduce a new algorithm in 3D reconstruction at an arbitrary resolution by aggregating vectors along three orthogonal axes, resolving inherent problems with regressing fixed dimension of vectors ...
We then refine our coarse-level vectors along depth by attending to high-resolution visual cues. ...
arXiv:2401.14565v1
fatcat:j7zdd3kfyvhfbl5rxolrzdif7y
One-Shot Medical Landmark Detection
[article]
2021
arXiv
pre-print
CC2D-SSL captures the consistent anatomical information in a coarse-to-fine fashion by comparing the cascade feature representations and generates predictions on the training set. ...
CC2D consists of two stages: 1) Self-supervised learning (CC2D-SSL) and 2) Training with pseudo-labels (CC2D-TPL). ...
Therefore, we propose to match the coarse-grained corresponding areas first, then gradually compare the finer-grained areas in the selected coarse area. ...
arXiv:2103.04527v1
fatcat:bahjye6iunayvksgj72pnivq3e
ZebraPose: Coarse to Fine Surface Encoding for 6DoF Object Pose Estimation
[article]
2022
arXiv
pre-print
Moreover, we propose a coarse to fine training strategy, which enables fine-grained correspondence prediction. ...
Establishing correspondences from image to 3D has been a key task of 6DoF object pose estimation for a long time. To predict pose more accurately, deeply learned dense maps replaced sparse templates. ...
We are thankful to Rene Schuster, Fangwen Shu, Yaxu Xie and Ghazal Ghazaei for proofreading the paper. ...
arXiv:2203.09418v2
fatcat:5rvexg3jqjgl7l5i4x6rsaamea
Deep Mesh Reconstruction from Single RGB Images via Topology Modification Networks
[article]
2019
arXiv
pre-print
In this paper, we present an end-to-end single-view mesh reconstruction framework that is able to generate high-quality meshes with complex topologies from a single genus-0 template mesh. ...
Moreover, a boundary refinement network is designed to refine the boundary conditions to further improve the visual quality of the reconstructed mesh. ...
Such a design paradigm enables more accurate learning of fine geometric details with even less training time. ...
arXiv:1909.00321v1
fatcat:w5bvpfrcinbclfx5gtia5vb3sy
Learnable Triangulation for Deep Learning-based 3D Reconstruction of Objects of Arbitrary Topology from Single RGB Images
[article]
2021
arXiv
pre-print
to refine the initial mesh. ...
Thus, they are limited to the reconstruction of objects that have the same topology as the template. ...
While Mesh R-CNN can reconstruct arbitrary topologies, the use of coarse voxel grids limits its ability to accurately reconstruct fine structures such as the chair legs [25] . ...
arXiv:2109.11844v1
fatcat:cmz742pllbf7hpy2qvys2wakfy
A Survey on Deep Learning Based Methods and Datasets for Monocular 3D Object Detection
2021
Electronics
Owing to recent advancements in deep learning methods and relevant databases, it is becoming increasingly easier to recognize 3D objects using only RGB images from single viewpoints. ...
For relatively low-cost data acquisition systems without depth sensors or cameras at multiple viewpoints, we first consider existing databases with 2D RGB photos and their relevant attributes. ...
Coarse-to-fine KITTI 3D than Mono3D, due to the lower to-fine method, considerably re-resolution of images in the coarse- No steps for accurate 2D vehicle bounding ducing a search space. boxes. ...
doi:10.3390/electronics10040517
fatcat:rziqhrkefvelpg3vgb6qxfprte
Radiation-Variation Insensitive Coarse-to-Fine Image Registration for Infrared and Visible Remote Sensing Based on Zero-Shot Learning
2024
Remote Sensing
First, RIZER, as a whole, adopts a detector-free coarse-to-fine registration framework, and the data-driven methods use a Transformer based on zero-shot learning. ...
After testing, RIZER achieved a correct matching rate of 99.55% with an RMSE of 1.36 and had an advantage in the number of correct matches, as well as a good generalization ability for other multimodal ...
of image registration for deep learning to efficiently and accurately obtain a wide range of correspondences. ...
doi:10.3390/rs16020214
fatcat:2qxooy776fertkoenz2tywactu
Detecting the brain surface in sparse MRI using boundary models
2000
Medical Image Analysis
A non-linear matching scheme is introduced to estimate the location of the boundary points using only the intensity data within each image plane. ...
We use the term sparse to describe volumetric images in which the sampling resolution within the imaging plane is far higher than that of the perpendicular direction. ...
Our matching strategy is built around a coarse-to-fine approach. ...
doi:10.1016/s1361-8415(00)00020-7
pmid:11145314
fatcat:5mqh4hpucncdpij3m5ijtzuozi
TIMER: Tensor Image Morphing for Elastic Registration
2009
2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops
However, there are two major limitations with these approaches. First, the computed regional features might not reflect the actual regional tensor distributions. ...
Subsequently, regional integration and other operations such as edge detection are performed to extract more features to guide the registration. ...
warrant the differentiation of different anatomical structures and hence can be utilized to assist correspondence matching in the course of registration. ...
doi:10.1109/cvprw.2009.5204350
dblp:conf/cvpr/YapWZLS09
fatcat:y72qo53befdzde27l5k5epgmtq
« Previous
Showing results 1 — 15 out of 4,360 results