Introduction of QoQ-Med-7B/32B, the first open generalist clinical foundation model that reasons across medical images, time-series signals, and text reports.
Trained with Domain-aware Relative Policy Optimization (DRPO) to address performance imbalance from skewed clinical data distributions.
Demonstrated improved diagnostic performance by 43% in macro-F1 on average across all visual domains compared to other training methods like GRPO.
QoQ-Med highlights salient regions related to diagnosis with an IoU 10x higher than open models and reaches the performance of OpenAI o4-mini, aiming to foster reproducibility in research.