Model choices based solely on per-token costs can be misleading, as tokenization costs can vary between different models.
Prompt formatting, such as the use of markdown or XML tags, can significantly impact model performance and should be considered during model migration.