<ul data-eligibleForWebStory="true">Image restoration methods often struggle to reconstruct textual regions accurately, resulting in text-image hallucination.Text-Aware Image Restoration (TAIR) is introduced to recover visual contents and textual fidelity simultaneously.SA-Text, a large-scale benchmark of scene images annotated with text instances, is presented.A multi-task diffusion framework called TeReDiff integrates features from diffusion models into a text-spotting module.The joint training of components allows for rich text representations used in denoising.Experiments show that the approach outperforms existing methods, improving text recognition accuracy.Project page: https://cvlab-kaist.github.io/TAIR/