Researchers have proposed RAG-check, a comprehensive method to evaluate multi-modal RAG systems.RAG-check consists of three components: relevancy evaluation, span categorization, and correctness assessment.The evaluation results show performance variations across different RAG system configurations.GPT-4o emerges as the most effective model for context generation in RAG systems.