tool-use
Coverage, reference pages, tools, and guides connected to this topic.
-
Claude API Adds Streaming for High-Throughput Agents
New streaming and batching endpoints in Claude API optimize for agentic deployments requiring real-time processing.
-
Mistral Small 4 Tops Reasoning Benchmarks for Agent Use
22B-parameter Mistral Small 4 outperforms larger closed models on reasoning and instruction benchmarks critical for agents.
-
MCP Toolbox
Reference servers and clients for the Model Context Protocol.
-
Six failure modes in tool-using agents, and the patterns that fix them
An empirical taxonomy of agent tool-use failures across 4,000 traces from production deployments. Schema drift and silent partial-failure dominate.