Researchers have introduced a new two-stage method for robotic object placement called AnyPlace, which demonstrates the ability to predict feasible placement poses.
The system uses a vision language model (VLM) to produce potential placement locations, combined with depth-based models for geometric placement prediction.
The creators of AnyPlace have developed a fully synthetic dataset of 1,489 randomly generated objects, covering insertion, stacking, and hanging.
The model achieved an 80% success rate on the vial insertion task, showing robustness and generalisation.