Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–14 of 14 results for author: Nair, N G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.09977  [pdf, other

    cs.CV

    MaxFusion: Plug&Play Multi-Modal Generation in Text-to-Image Diffusion Models

    Authors: Nithin Gopalakrishnan Nair, Jeya Maria Jose Valanarasu, Vishal M Patel

    Abstract: Large diffusion-based Text-to-Image (T2I) models have shown impressive generative powers for text-to-image generation as well as spatially conditioned image generation. For most applications, we can train the model end-toend with paired data to obtain photorealistic generation quality. However, to add an additional task, one often needs to retrain the model from scratch using paired data across al… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  2. arXiv:2404.09976  [pdf, other

    cs.CV

    Diffscaler: Enhancing the Generative Prowess of Diffusion Transformers

    Authors: Nithin Gopalakrishnan Nair, Jeya Maria Jose Valanarasu, Vishal M. Patel

    Abstract: Recently, diffusion transformers have gained wide attention with its excellent performance in text-to-image and text-to-vidoe models, emphasizing the need for transformers as backbone for diffusion models. Transformer-based models have shown better generalization capability compared to CNN-based models for general vision tasks. However, much less has been explored in the existing literature regard… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  3. arXiv:2310.00224  [pdf, other

    cs.CV cs.AI cs.LG

    Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis

    Authors: Nithin Gopalakrishnan Nair, Anoop Cherian, Suhas Lohit, Ye Wang, Toshiaki Koike-Akino, Vishal M. Patel, Tim K. Marks

    Abstract: Conditional generative models typically demand large annotated training sets to achieve high-quality synthesis. As a result, there has been significant interest in designing models that perform plug-and-play generation, i.e., to use a predefined or pretrained model, which is not explicitly trained on the generative task, to guide the generative process (e.g., using language). However, such guidanc… ▽ More

    Submitted 29 September, 2023; originally announced October 2023.

    Comments: Accepted at ICCV 2023

  4. arXiv:2308.03726  [pdf, other

    cs.CV

    AdaptiveSAM: Towards Efficient Tuning of SAM for Surgical Scene Segmentation

    Authors: Jay N. Paranjape, Nithin Gopalakrishnan Nair, Shameema Sikder, S. Swaroop Vedula, Vishal M. Patel

    Abstract: Segmentation is a fundamental problem in surgical scene analysis using artificial intelligence. However, the inherent data scarcity in this domain makes it challenging to adapt traditional segmentation techniques for this task. To tackle this issue, current research employs pretrained models and finetunes them on the given data. Even so, these require training deep networks with millions of parame… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: 10 pages, 6 figures, 5 tables

  5. arXiv:2303.12790  [pdf, other

    cs.CV

    $CrowdDiff$: Multi-hypothesis Crowd Density Estimation using Diffusion Models

    Authors: Yasiru Ranasinghe, Nithin Gopalakrishnan Nair, Wele Gedara Chaminda Bandara, Vishal M. Patel

    Abstract: Crowd counting is a fundamental problem in crowd analysis which is typically accomplished by estimating a crowd density map and summing over the density values. However, this approach suffers from background noise accumulation and loss of density due to the use of broad Gaussian kernels to create the ground truth density maps. This issue can be overcome by narrowing the Gaussian kernel. However, e… ▽ More

    Submitted 4 April, 2024; v1 submitted 22 March, 2023; originally announced March 2023.

    Comments: Accepted at CVPR'24. The project is available at https://dylran.github.io/crowddiff.github.io

  6. arXiv:2212.07352  [pdf, other

    cs.CV

    Bi-Noising Diffusion: Towards Conditional Diffusion Models with Generative Restoration Priors

    Authors: Kangfu Mei, Nithin Gopalakrishnan Nair, Vishal M. Patel

    Abstract: Conditional diffusion probabilistic models can model the distribution of natural images and can generate diverse and realistic samples based on given conditions. However, oftentimes their results can be unrealistic with observable color shifts and textures. We believe that this issue results from the divergence between the probabilistic distribution learned by the model and the distribution of nat… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

  7. arXiv:2212.00793  [pdf, other

    cs.CV

    Unite and Conquer: Plug & Play Multi-Modal Synthesis using Diffusion Models

    Authors: Nithin Gopalakrishnan Nair, Wele Gedara Chaminda Bandara, Vishal M. Patel

    Abstract: Generating photos satisfying multiple constraints find broad utility in the content creation industry. A key hurdle to accomplishing this task is the need for paired data consisting of all modalities (i.e., constraints) and their corresponding output. Moreover, existing methods need retraining using paired data across all modalities to introduce a new condition. This paper proposes a solution to t… ▽ More

    Submitted 20 April, 2023; v1 submitted 1 December, 2022; originally announced December 2022.

    Comments: Accepted at CVPR 2023

  8. arXiv:2209.09498  [pdf, other

    cs.CV eess.IV

    NBD-GAP: Non-Blind Image Deblurring Without Clean Target Images

    Authors: Nithin Gopalakrishnan Nair, Rajeev Yasarla, Vishal M. Patel

    Abstract: In recent years, deep neural network-based restoration methods have achieved state-of-the-art results in various image deblurring tasks. However, one major drawback of deep learning-based deblurring networks is that large amounts of blurry-clean image pairs are required for training to achieve good performance. Moreover, deep networks often fail to perform well when the blurry images and the blur… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: Accepted at ICIP 2022

  9. arXiv:2209.08814  [pdf, other

    cs.CV

    T2V-DDPM: Thermal to Visible Face Translation using Denoising Diffusion Probabilistic Models

    Authors: Nithin Gopalakrishnan Nair, Vishal M. Patel

    Abstract: Modern-day surveillance systems perform person recognition using deep learning-based face verification networks. Most state-of-the-art facial verification systems are trained using visible spectrum images. But, acquiring images in the visible spectrum is impractical in scenarios of low-light and nighttime conditions, and often images are captured in an alternate domain such as the thermal infrared… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: Accepted at The IEEE conference series on Automatic Face and Gesture Recognition 2023

  10. arXiv:2208.11284  [pdf, other

    cs.CV

    AT-DDPM: Restoring Faces degraded by Atmospheric Turbulence using Denoising Diffusion Probabilistic Models

    Authors: Nithin Gopalakrishnan Nair, Kangfu Mei, Vishal M. Patel

    Abstract: Although many long-range imaging systems are designed to support extended vision applications, a natural obstacle to their operation is degradation due to atmospheric turbulence. Atmospheric turbulence causes significant degradation to image quality by introducing blur and geometric distortion. In recent years, various deep learning-based single image atmospheric turbulence mitigation methods, inc… ▽ More

    Submitted 20 September, 2022; v1 submitted 23 August, 2022; originally announced August 2022.

    Comments: Accepted to IEEE WACV 2023

  11. arXiv:2206.11892  [pdf, other

    cs.CV cs.LG

    DDPM-CD: Denoising Diffusion Probabilistic Models as Feature Extractors for Change Detection

    Authors: Wele Gedara Chaminda Bandara, Nithin Gopalakrishnan Nair, Vishal M. Patel

    Abstract: Remote sensing change detection is crucial for understanding the dynamics of our planet's surface, facilitating the monitoring of environmental changes, evaluating human impact, predicting future trends, and supporting decision-making. In this work, we introduce a novel approach for change detection that can leverage off-the-shelf, unlabeled remote sensing images in the training process by pre-tra… ▽ More

    Submitted 12 January, 2024; v1 submitted 23 June, 2022; originally announced June 2022.

    Comments: Code available at: https://github.com/wgcban/ddpm-cd

  12. arXiv:2206.05039  [pdf, other

    cs.CV

    Image Generation with Multimodal Priors using Denoising Diffusion Probabilistic Models

    Authors: Nithin Gopalakrishnan Nair, Wele Gedara Chaminda Bandara, Vishal M Patel

    Abstract: Image synthesis under multi-modal priors is a useful and challenging task that has received increasing attention in recent years. A major challenge in using generative models to accomplish this task is the lack of paired data containing all modalities (i.e. priors) and corresponding outputs. In recent work, a variational auto-encoder (VAE) model was trained in a weakly supervised manner to address… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

  13. SAR Despeckling using a Denoising Diffusion Probabilistic Model

    Authors: Malsha V. Perera, Nithin Gopalakrishnan Nair, Wele Gedara Chaminda Bandara, Vishal M. Patel

    Abstract: Speckle is a multiplicative noise which affects all coherent imaging modalities including Synthetic Aperture Radar (SAR) images. The presence of speckle degrades the image quality and adversely affects the performance of SAR image understanding applications such as automatic target recognition and change detection. Thus, SAR despeckling is an important problem in remote sensing. In this paper, we… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: Our code is available at https://github.com/malshaV/SAR_DDPM

  14. arXiv:2204.08974  [pdf, other

    cs.CV eess.IV

    A comparison of different atmospheric turbulence simulation methods for image restoration

    Authors: Nithin Gopalakrishnan Nair, Kangfu Mei, Vishal M. Patel

    Abstract: Atmospheric turbulence deteriorates the quality of images captured by long-range imaging systems by introducing blur and geometric distortions to the captured scene. This leads to a drastic drop in performance when computer vision algorithms like object/face recognition and detection are performed on these images. In recent years, various deep learning-based atmospheric turbulence mitigation metho… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.