RouteMoA: Dynamic Routing Without Pre‑Inference for Efficient Multi‑Agent Mixtures
RouteMoA moves model selection ahead of inference by using a lightweight scorer to predict each model's suitability from the query, dramatically cutting computation cost and latency while preserving or improving accuracy, as demonstrated on a 15‑model pool with up to 90% cost reduction and 64% latency reduction.
