menu
techminis

A naukri.com initiative

google-web-stories
Home

>

ML News

>

Highlighti...
source image

Arxiv

1w

read

139

img
dot

Image Credit: Arxiv

Highlighting What Matters: Promptable Embeddings for Attribute-Focused Image Retrieval

  • Ideal text-to-image (T2I) retrievers should prioritize specific visual attributes relevant to queries.
  • CLIP-like retrievers have poor performance on attribute-focused queries due to focusing on global semantics and subjects, leaving out other details.
  • Recent Multimodal Large Language Model (MLLM)-based retrievers also struggle with limitations in handling attribute-focused queries.
  • Proposal to use promptable image embeddings to boost performance by highlighting required attributes, with acceleration strategies to enhance real-world applicability.

Read Full Article

like

8 Likes

For uninterrupted reading, download the app