GPT-4o has powerful image generation capabilities as a natively multimodal architecture.The inner workings and details of GPT-4o remain largely undisclosed, posing a challenge for researchers and developers.An empirical study compared GPT-4o with competitors and specialized models for image generation tasks.This article explores GPT-4o's strengths, weaknesses, and its position in the quest for unified generative AI.