The research addresses the challenge of room scene discovery and grouping in unstructured vacation rental image collections.
The proposed approach uses machine learning for room-type detection, overlap detection, and clustering of images to help travelers understand property layouts.
A supervised pipeline is introduced, focused on efficiency, low latency, and sample-efficient learning for real-time and data-scarce environments.
The models and pipeline developed in the research show strong performance in room scene discovery and grouping, surpassing existing methods like contrastive learning and clustering with pretrained embeddings.