Large Language Models still encounter challenges in reasoning tasks, especially for smaller models.Inference-time methods like prompting have shown effectiveness but rely on sequential queries.The ensemble method, running multiple models in parallel, is a promising approach for better inference-time performance.A novel training-free LLM ensemble framework is proposed for improved reasoning in math tasks.