Overview
Performance
Usage
Integrate
Reduce LLM API costs via semantic caching, prompt compression, model routing and context pruning. Zero code changes required.
Integrate this server via the CLI, MCP SDK, or AI SDK. Smithery handles OAuth, token refresh, and session management for you.
1. Install Smithery CLI
2. Create a namespace
3. Use this server