This skill provides guidance for using Firecrawl MCP for web scraping and content extraction...
This skill enables web scraping and content extraction using Firecrawl through MCP. Firecrawl provides powerful capabilities for scraping single pages, crawling entire websites, searching with content extraction, and extracting structured data.
| Task | Tool | Notes |
|---|---|---|
| Single page content | firecrawl_scrape |
Fast, reliable |
| Find URLs on site | firecrawl_map |
Discovery only |
| Web search | firecrawl_search |
With optional scraping |
| Multiple pages | firecrawl_map + firecrawl_scrape |
Better than crawl |
| Structured data | firecrawl_extract |
Uses LLM extraction |
| Full site crawl | firecrawl_crawl |
Use with caution |
Scrape content from a single URL. Most powerful and reliable scraper.
Tool: mcp__firecrawl__firecrawl_scrape
mcp__firecrawl__firecrawl_scrape({
url: "https://docs.example.com/api",
formats: ["markdown"],
onlyMainContent: true,
maxAge: 172800000
})
Key Parameters:
url: Target URLformats: Output formats - "markdown", "html", "links"onlyMainContent: Extract only main content (recommended)maxAge: Cache TTL in ms (500% faster with cache)Search the web and optionally scrape results.
Tool: mcp__firecrawl__firecrawl_search
Without scraping (preferred):
mcp__firecrawl__firecrawl_search({
query: "xterm.js terminal tutorial",
limit: 5
})
With scraping:
mcp__firecrawl__firecrawl_search({
query: "VS Code extension API",
limit: 3,
scrapeOptions: {
formats: ["markdown"],
onlyMainContent: true
}
})
Search Operators:
| Operator | Example | Description |
|---|---|---|
"" |
"exact phrase" |
Exact match |
- |
-deprecated |
Exclude term |
site: |
site:github.com |
Specific site |
inurl: |
inurl:docs |
URL contains |
intitle: |
intitle:tutorial |
Title contains |
Discover all URLs on a website.
Tool: mcp__firecrawl__firecrawl_map
mcp__firecrawl__firecrawl_map({
url: "https://docs.example.com",
limit: 100,
search: "api"
})
Extract structured data using LLM.
Tool: mcp__firecrawl__firecrawl_extract
mcp__firecrawl__firecrawl_extract({
urls: ["https://shop.example.com/product/123"],
prompt: "Extract product details",
schema: {
type: "object",
properties: {
name: { type: "string" },
price: { type: "number" }
},
required: ["name", "price"]
}
})
Crawl multiple pages (use sparingly).
Tool: mcp__firecrawl__firecrawl_crawl
mcp__firecrawl__firecrawl_crawl({
url: "https://docs.example.com/guide",
maxDiscoveryDepth: 2,
limit: 10
})
Warning: Crawl can return large responses. Keep limits low.
map + scrape for better controlmaxAge for repeated requestsmaxDiscoveryDepth and limit lowFor complete tool parameters, see references/tools.md.