Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
×
Jun 30, 2023 · In this work, we introduce Semantic Pyramid AutoEncoder (SPAE) for enabling frozen LLMs to perform both understanding and generation tasks ...
The paper introduces SPAE, a method that aligns visual representation with a fixed LLM representation. SPAE effectively captures semantics and visual fine- ...
This work enables a standalone frozen LLM to understand and generate other modalities which are unseen in training. Tokenization via vector quantization. VQ-VAE ...
Nov 14, 2023 · Great work! After reading your paper SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs , I'm very interested in ...
This method marks the first successful attempt to enable a frozen LLM to generate image content while surpassing state-of-the-art performance in image ...
3 days ago · In this work, we introduce Semantic Pyramid AutoEncoder (SPAE) for enabling frozen LLMs to perform both understanding and generation tasks ...
SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs. Supplementary Materials. Appendix Overview. This supplementary document provides ...
Jun 30, 2023 · In this work, we introduce Semantic Pyramid AutoEncoder (SPAE) for enabling frozen LLMs to perform both understanding and generation tasks ...
Feb 5, 2024 · Bibliographic details on SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs.