Add behavioral evals for web tool selection (google_web_search vs web_fetch)

## Summary

The eval suite has no tests covering how the agent chooses between `google_web_search` and `web_fetch`. These are the two web tools available to the agent but they serve different purposes:

- `google_web_search` — for open-ended queries where the agent needs to find information
- `web_fetch` — for fetching a specific URL the user has already provided

Without evals, regressions in this distinction go undetected.

## Expected behavior

- When asked for current information with no URL, agent should use `google_web_search`
- When given a specific URL, agent should use `web_fetch`, not search
- When the answer is in local files, agent should use neither web tool

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add behavioral evals for web tool selection (google_web_search vs web_fetch) #23483

Summary

Expected behavior

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Add behavioral evals for web tool selection (google_web_search vs web_fetch) #23483

Description

Summary

Expected behavior

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions