Cross-lingual open-ended generation is an important yet understudied problem.XL-AlpacaEval is a new benchmark for evaluating cross-lingual generation capabilities in Large Language Models (LLMs).XL-Instruct is a high-quality synthetic data generation method that significantly improves model performance.XL-Instruct shows strong zero-shot transfer to both English-only and multilingual generation tasks.