Generating fine details of entity interactions is a long-standing challenge.A new dataset, InterActing, has been introduced with 1000 fine-grained prompts covering different interaction scenarios.The proposed approach, DetailScribe, leverages LLMs to decompose interactions and uses a VLM for critiquing generated images.The results show significantly improved image quality, indicating the potential of enhanced inference strategies.