v2.3.8
December 5, 2025
Model-level retries improve reliability under provider rate limits
We’ve moved retry logic from Agents/Teams to the Model layer. When you set retries on a model, Agno now retries at the model execution level, which is more effective for handling provider throttling and transient errors. Agent/Team retries now apply only to run-level exceptions. This change reduces wasted cycles, makes behavior more predictable, and improves throughput under rate limits.
Details
- Configure retries on the Model to handle LLM/provider errors directly
- Agent/Team retries now cover orchestration-level failures only
- Action required: move any Agent/Team retry settings to the associated Model
Who this is for: Teams running production workloads at scale who need consistent behavior and better resilience under variable provider limits.
