Dreamweaver is a neural architecture designed to discover hierarchical and compositional representations from raw videos.
The model leverages a Recurrent Block-Slot Unit (RBSU) to decompose videos into their constituent objects and attributes.
Dreamweaver outperforms current state-of-the-art baselines for world modeling and allows the generation of novel videos by recombining attributes from previously seen objects.
The research is evaluated under the DCI framework across multiple datasets.