Researchers have proposed a pipeline for generating synthetic data and investigating the factors that influence the validity of the data.They explored the use of Large Language Models (LLMs) to generate synthetic datasets for language detection.The study focused on inclusive language detection in Italian job advertisements.Results show that the fine-tuned models trained on synthetic data performed better than other models on both real and synthetic test datasets.