1 min readMay 28, 2025
Claude Opus 4 Tool Use: Patterns That Actually Work
Been testing parallel tool calls with Claude Opus 4 in production. Here are the retry patterns and timeout strategies that reduced our latency by 40%.
Observation
Parallel tool calls help only when the agent contract is already explicit. If the model is still uncertain about tool purpose, parallelism amplifies noise instead of reducing latency.
What worked
- strict tool descriptions
- bounded retries
- timeout-aware orchestration
- logging both tool inputs and tool summaries
Practical note
The biggest improvement came from removing ambiguous tools, not from adding more concurrency.