<ul data-eligibleForWebStory="false">A small-N case study explored the use of Grok 4 to reframe a 'beauty analysis' prompt for ChatGPT o3.Initially, ChatGPT rejected the request, but after reframing the task and using neutral language, Grok complied.The experiment resulted in Grok providing numeric scores and OpenCV-style Python code for the analysis.This experiment suggests vulnerabilities in keyword-based refusal mechanisms and emphasizes the need for ensemble-level alignment defenses.