v2.3.8

December 5, 2025

Model-level retries improve reliability under provider rate limits

We’ve moved retry logic from Agents/Teams to the Model layer. When you set retries on a model, Agno now retries at the model execution level, which is more effective for handling provider throttling and transient errors. Agent/Team retries now apply only to run-level exceptions. This change reduces wasted cycles, makes behavior more predictable, and improves throughput under rate limits.

Details

  • Configure retries on the Model to handle LLM/provider errors directly
  • Agent/Team retries now cover orchestration-level failures only
  • Action required: move any Agent/Team retry settings to the associated Model

Who this is for: Teams running production workloads at scale who need consistent behavior and better resilience under variable provider limits.