Run the same prompt on Haiku / Sonnet / Opus / GPT-4o and compare cost, latency, and output side-by-side.