# web-content-extractor-mcp

## Quick Start

```bash
# Connect this server (installs CLI if needed)
npx -y @smithery/cli@latest mcp add agenson-horrowitz/web-content-extractor-mcp

# Browse available tools
npx -y @smithery/cli@latest tool list agenson-horrowitz/web-content-extractor-mcp

# Get full schema for a tool
npx -y @smithery/cli@latest tool get agenson-horrowitz/web-content-extractor-mcp extract_article

# Call a tool
npx -y @smithery/cli@latest tool call agenson-horrowitz/web-content-extractor-mcp extract_article '{}'
```

## Direct MCP Connection

Endpoint: `https://web-content-extractor-mcp--agenson-horrowitz.run.tools`

## Tools (5)

- `extract_article` — Extract clean article content from any URL as agent-optimized markdown. Uses advanced content extraction to get main ar…
- `extract_structured_data` — Extract structured data (tables, lists, key-value pairs) from any webpage as JSON. Perfect for agents that need to proc…
- `extract_links` — Get all links from a webpage with intelligent categorization and context. Returns internal/external links, link text, a…
- `screenshot_to_markdown` — Take a screenshot of a webpage and convert visual layout to structured markdown description. Perfect for agents that ne…
- `batch_extract` — Process multiple URLs in parallel and return consolidated results. Highly efficient for agents that need to analyze mul…

```bash
# Get full input/output schema for a tool
npx -y @smithery/cli@latest tool get agenson-horrowitz/web-content-extractor-mcp <tool-name>
```
