Can Opus + Sonnet Advisor Cut Costs While Raising AI Benchmark Scores?
Anthropic’s new advisor strategy lets the cheaper Opus model act as a consultant for Sonnet or Haiku, delivering higher benchmark scores—e.g., SWE‑bench Multilingual up to 74.8% and BrowseComp up to 41.2%—while reducing per‑task cost to about 15% of solo runs, though it introduces trade‑offs such as the need for the executor to recognize when to ask for advice and potential vendor lock‑in.
