improve validation criteria of webvoyager/gaia evals #2755
ci.yml
on: pull_request
determine-evals
3s
run-e2e-tests
3m 10s
run-e2e-local-tests
4m 1s
run-e2e-bb-tests
2m 7s
run-agent-evals
6s