Code review comment generation evaluation is revisited.Traditional evaluation methods based on text similarity face challenges.DeepCRCEval framework integrates human evaluators and Large Language Models (LLMs).LLM-Reviewer baseline shows potential in efficient comment generation.