v2.2.1

October 24, 2025

Cap historical tool calls to control context size and spend

A new parameter, max_tool_calls_from_history, lets you cap how many historical tool call pairs are loaded into context. This keeps token usage predictable, reduces costs, and lowers the risk of hitting context limits — especially for long-running sessions and tool-heavy agents.

Details

  • Configurable at the agent, team, and workflow level
  • Improves latency and reliability by preventing context bloat
  • Backward compatible: defaults remain unchanged unless you set it

Who this is for: Platform owners and engineering teams running multi-turn or high-volume automations who need tighter cost and performance control.