menu
techminis

A naukri.com initiative

google-web-stories
Home

>

ML News

>

TransforMe...
source image

Arxiv

1d

read

75

img
dot

Image Credit: Arxiv

TransforMerger: Transformer-based Voice-Gesture Fusion for Robust Human-Robot Communication

  • TransforMerger is a transformer-based reasoning model that infers a structured action command for robotic manipulation based on fused voice and gesture inputs.
  • It merges multimodal data into a single unified sentence and employs probabilistic embeddings to handle uncertainty.
  • The model integrates contextual scene understanding to resolve ambiguous references and is robust to noise, misalignment, and missing information.
  • TransforMerger outperforms deterministic baselines, demonstrating its effectiveness in enabling more robust and flexible human-robot communication.

Read Full Article

like

4 Likes

For uninterrupted reading, download the app