Auto-Evaluating Chatbots with GenAI: The Pipeline, The Prompts, and The ProofA new project explores using an LLM to evaluate other LLMs in chatbot responses.The Auto-Eval system rates chatbot responses on relevance, helpfulness, clarity, and factual accuracy.The results show that the LLM evaluation method is effective in scoring chatbot responses.