Open
Description
I have written a similar (not anywhere near as good framework) to MCP agent and I'm currently attempting to switch to MCP agent.
One of the things that was important for me limiting LLM failures was being able to set max tool response sizes in the calling code (as I don't control the MCP servers).
If the LLM makes a call that returns say 50k tokens that might be fine but one that returns 500k tokens would get rejected with an error telling the LLM to use a more specific query/etc.
I think I can add this by inheriting Agent and customizing the call tool method but I wanted to flag it as a feature request here too