Multi-model evaluation