AI Test Stack
All Notes
1 min readMay 28, 2025

Claude Opus 4 Tool Use: Patterns That Actually Work

Been testing parallel tool calls with Claude Opus 4 in production. Here are the retry patterns and timeout strategies that reduced our latency by 40%.

Observation

Parallel tool calls help only when the agent contract is already explicit. If the model is still uncertain about tool purpose, parallelism amplifies noise instead of reducing latency.

What worked

  • strict tool descriptions
  • bounded retries
  • timeout-aware orchestration
  • logging both tool inputs and tool summaries

Practical note

The biggest improvement came from removing ambiguous tools, not from adding more concurrency.