v1.5.5
May 27, 2025
1.5.5
New Features:
- Claude File Upload: We can now upload a file to Anthropic directly and then use it as an input to an agent.
- Claude 4 Code Execution Tool: Updated Claude to execute Python code in a secure, sandboxed environment.
- Prompt caching with Anthropic Models: allowed resuming from specific prefixes in your prompts. This approach significantly reduces processing time and costs for repetitive tasks or prompts with consistent elements.
- Vercel v0 Model: Added support for new Vercel v0 models and cookbook examples.
- Qdrant Hybrid Search support
- Markdown Knowledge Base: Added native support for Markdown-based knowledge bases.
- AI/ML API platform integration: Introduced integration with
AI/ML API
, a platform providing AI/ML models. AI/ML API provides 300+ AI models including Deepseek, Gemini, ChatGPT. The models run at enterprise-grade rate limits and uptimes. - Update Pydantic and dataclass in function handling: Added support for
Pydantic
anddataclass
objects as input to a function. See here for an example.
Improvements:
- Timeout handling for API calls in ExaTools class:
- Timeout functionality to Exa API calls to prevent indefinite hanging of search operations. The implementation uses Python's
concurrent.futures
module to enforce timeouts on all Exa API operations (search, get contents, find similar, and answer generation). - This change addresses the issue where Exa search functions would hang indefinitely, causing potential service disruptions and resource leaks.
- Timeout functionality to Exa API calls to prevent indefinite hanging of search operations. The implementation uses Python's
- Fetch messages from last N sessions:
- A tool for the agent, something like
get_previous_session_messages(number_of_sessions: int)
that returns a list of messages that the agent can analyse - Switch on with
search_previous_sessions_history
- A tool for the agent, something like
- Redis Expiration: Added
expire
key to set TTL on Redis keys. - Add Anthropic Cache Write to Agent Session Metrics: Added
cache_creation_input_tokens
to agent session metrics, to allow for tracking Anthropic cache write statistics
Bug Fixes:
- Huggingface Embedder Updates:
- Huggingface has changed some things on their API and they've deprecated
.post
on theirInferenceClient()
- https://discuss.huggingface.co/t/getting-error-attributeerror-inferenceclient-object-has-no-attribute-post/156682 - We can also no longer use
id: str = "jinaai/jina-embeddings-v2-base-code"
as default, because these models are no longer provided by theHF Inference API
. Changed the default toid: str = "intfloat/multilingual-e5-large"
- Huggingface has changed some things on their API and they've deprecated
- Add
role_map
forOpenAIChat
: This allows certain models that don’t adhere to OpenAI’s role mapping to be used virOpenAILike
. - Use Content Hash as ID in Upsert in Pgvector: Use reproducible
content_hash
in upsert as ID. - Insert in Vector DB passes only last chunk meta_data: Insert in vector db passes only last chunk meta_data. issue link- https://discord.com/channels/965734768803192842/1219054452221153463/1376631140047130649
- Remove Argument Sanitization: Replaced with a safer way to do this that won't break arguments that shouldn't be sanitized
- Handle async tools when running async agents on playground: Fixed a regression where using Agents with async tools (e.g. MCP tools) was breaking in the Playground.