Large language models (LLMs) are being used to mimic human behavior in sequential decision-making tasks.A study compared the exploration-exploitation strategies of LLMs, humans, and multi-armed bandit (MAB) algorithms.Reasoning enhances LLM decision-making, making them exhibit more human-like behavior with a mix of random and directed exploration.LLMs perform similarly to humans in simple tasks but struggle to match human adaptability in complex, non-stationary environments.