Model-level retries improve reliability under provider rate limits

We’ve moved retry logic from Agents/Teams to the Model layer. When you set retries on a model, Agno now retries at the model execution level, which is more effective for handling provider throttling and transient errors. Agent/Team retries now apply only to run-level exceptions. This change reduces wasted cycles, makes behavior more predictable, and improves throughput under rate limits.

Details

Configure retries on the Model to handle LLM/provider errors directly
Agent/Team retries now cover orchestration-level failures only
Action required: move any Agent/Team retry settings to the associated Model

Who this is for: Teams running production workloads at scale who need consistent behavior and better resilience under variable provider limits.