MedSegFactory is a medical synthesis framework that generates high-quality paired medical images and segmentation masks across various modalities and tasks to enhance existing segmentation tools.
It utilizes a dual-stream diffusion model, with one stream synthesizing medical images and the other generating corresponding segmentation masks, ensuring precise alignment through Joint Cross-Attention (JCA).
The bidirectional interaction between the streams allows for improved consistency between generated image-mask pairs and on-demand generation based on user-defined prompts.
MedSegFactory's approach improves scalability and data quality, demonstrating superior performance in 2D and 3D segmentation tasks while addressing data scarcity and regulatory constraints in the medical imaging domain.