<ul data-eligibleForWebStory="true">Large language models (LLMs) like ChatGPT have gained success since November 2022.Many open-source models are available, but deploying them comes with unknown requirements.Tests conducted at the Centre Inria de l'Université de Bordeaux compare Mistral and LLaMa models' performance.vLLM, a Python library optimized for LLM inference, was used in the study.Results from the tests help evaluate LLM performance based on available GPUs.The study aims to assist private and public groups in deploying LLMs by providing valuable information.