Mamba-3D as Masked Autoencoders for Accurate and Data-Efficient Analysis of Medical Ultrasound Videos

A naukri.com initiative

New

Mamba-3D a...

Arxiv

160

Image Credit: Arxiv

Ultrasound videos are an important form of clinical imaging data for diagnostic analysis.
E-ViM$^3$ is a data-efficient Vision Mamba network that enhances space-time correlations.
Enclosure Global Tokens (EGT) capture and aggregate global features effectively.
With limited labels, E-ViM$^3$ achieves competitive performance in semantic analysis tasks.

Read Full Article

9 Likes

For uninterrupted reading, download the app