PDF processing API for conversion, extraction, merging, splitting and more
All-in-one PDF processing API. Convert, extract, merge, split, compress PDFs and more. Supports OCR for scanned documents.
Official docs: https://docs.pdf.co/
Use this skill when you need to:
Set environment variable:
export PDFCO_API_KEY="your-email@example.com_your-api-key"
Extract text from PDF with OCR support:
Write to /tmp/request.json:
{
"url": "https://pdfco-test-files.s3.us-west-2.amazonaws.com/pdf-to-text/sample.pdf",
"inline": true
}
curl --location --request POST "https://api.pdf.co/v1/pdf/convert/to/text" --header "x-api-key: $(printenv PDFCO_API_KEY)" --header "Content-Type: application/json" -d @/tmp/request.json
With specific pages (1-indexed):
Write to /tmp/request.json:
{
"url": "https://pdfco-test-files.s3.us-west-2.amazonaws.com/pdf-to-text/sample.pdf",
"pages": "1-3",
"inline": true
}
curl --location --request POST "https://api.pdf.co/v1/pdf/convert/to/text" --header "x-api-key: $(printenv PDFCO_API_KEY)" --header "Content-Type: application/json" -d @/tmp/request.json
Convert PDF tables to CSV:
Write to /tmp/request.json:
{
"url": "https://pdfco-test-files.s3.us-west-2.amazonaws.com/pdf-to-csv/sample.pdf",
"inline": true
}
curl --location --request POST "https://api.pdf.co/v1/pdf/convert/to/csv" --header "x-api-key: $(printenv PDFCO_API_KEY)" --header "Content-Type: application/json" -d @/tmp/request.json
Combine multiple PDFs into one:
Write to /tmp/request.json:
{
"url": "https://pdfco-test-files.s3.us-west-2.amazonaws.com/pdf-merge/sample1.pdf,https://pdfco-test-files.s3.us-west-2.amazonaws.com/pdf-merge/sample2.pdf",
"name": "merged.pdf"
}
curl --location --request POST "https://api.pdf.co/v1/pdf/merge" --header "x-api-key: $(printenv PDFCO_API_KEY)" --header "Content-Type: application/json" -d @/tmp/request.json
Split PDF by page ranges:
Write to /tmp/request.json:
{
"url": "https://pdfco-test-files.s3.us-west-2.amazonaws.com/pdf-split/sample.pdf",
"pages": "1-2,3-"
}
curl --location --request POST "https://api.pdf.co/v1/pdf/split" --header "x-api-key: $(printenv PDFCO_API_KEY)" --header "Content-Type: application/json" -d @/tmp/request.json
Reduce PDF file size:
Write to /tmp/request.json:
{
"url": "https://pdfco-test-files.s3.us-west-2.amazonaws.com/pdf-optimize/sample.pdf",
"name": "compressed.pdf"
}
curl --location --request POST "https://api.pdf.co/v1/pdf/optimize" --header "x-api-key: $(printenv PDFCO_API_KEY)" --header "Content-Type: application/json" -d @/tmp/request.json
Convert HTML or URL to PDF:
Write to /tmp/request.json:
{
"html": "<h1>Hello World</h1><p>This is a test.</p>",
"name": "output.pdf"
}
curl --location --request POST "https://api.pdf.co/v1/pdf/convert/from/html" --header "x-api-key: $(printenv PDFCO_API_KEY)" --header "Content-Type: application/json" -d @/tmp/request.json
From URL:
Write to /tmp/request.json:
{
"url": "https://example.com",
"name": "webpage.pdf"
}
curl --location --request POST "https://api.pdf.co/v1/pdf/convert/from/url" --header "x-api-key: $(printenv PDFCO_API_KEY)" --header "Content-Type: application/json" -d @/tmp/request.json
Extract structured data from invoices:
Write to /tmp/request.json:
{
"url": "https://pdfco-test-files.s3.us-west-2.amazonaws.com/ai-invoice-parser/sample-invoice.pdf",
"inline": true
}
curl --location --request POST "https://api.pdf.co/v1/ai-invoice-parser" --header "x-api-key: $(printenv PDFCO_API_KEY)" --header "Content-Type: application/json" -d @/tmp/request.json
Upload a local file first, then use the returned URL:
Step 1: Get presigned upload URL
curl -s "https://api.pdf.co/v1/file/upload/get-presigned-url?name=myfile.pdf&contenttype=application/pdf" --header "x-api-key: $(printenv PDFCO_API_KEY)" | jq -r '.presignedUrl, .url'
Copy the presigned URL and file URL from the response.
Step 2: Upload file
Replace <presigned-url> with the URL from Step 1:
curl -X PUT "<presigned-url>" --header "Content-Type: application/pdf" --data-binary @/path/to/your/file.pdf
Step 3: Use file URL in subsequent API calls
Replace <file-url> with the file URL from Step 1:
Write to /tmp/request.json:
{
"url": "<file-url>",
"inline": true
}
curl --location --request POST "https://api.pdf.co/v1/pdf/convert/to/text" --header "x-api-key: $(printenv PDFCO_API_KEY)" --header "Content-Type: application/json" -d @/tmp/request.json
For large files, use async mode to avoid timeouts:
Step 1: Start async job
Write to /tmp/request.json:
{
"url": "https://example.com/large-file.pdf",
"async": true
}
curl -s --location --request POST "https://api.pdf.co/v1/pdf/convert/to/text" --header "x-api-key: $(printenv PDFCO_API_KEY)" --header "Content-Type: application/json" -d @/tmp/request.json | jq -r '.jobId'
Copy the job ID from the response.
Step 2: Check job status
Replace <job-id> with the job ID from Step 1:
Write to /tmp/request.json:
{
"jobid": "<job-id>"
}
curl --location --request POST "https://api.pdf.co/v1/job/check" --header "x-api-key: $(printenv PDFCO_API_KEY)" --header "Content-Type: application/json" -d @/tmp/request.json
| Parameter | Type | Description |
|---|---|---|
url |
string | URL to source file (required) |
inline |
boolean | Return result in response body |
async |
boolean | Run as background job |
pages |
string | Page range, 1-indexed (e.g., "1-3", "1,3,5", "2-") |
name |
string | Output filename |
password |
string | PDF password if protected |
expiration |
integer | Output link expiration in minutes (default: 60) |
{
"url": "https://pdf-temp-files.s3.amazonaws.com/.../result.pdf",
"pageCount": 5,
"error": false,
"status": 200,
"name": "result.pdf",
"credits": 10,
"remainingCredits": 9990
}
With inline: true, the response includes body field with extracted content.
| Endpoint | Description |
|---|---|
/pdf/convert/to/text |
PDF to text (OCR supported) |
/pdf/convert/to/csv |
PDF to CSV |
/pdf/convert/to/json |
PDF to JSON |
/pdf/merge |
Merge multiple PDFs |
/pdf/split |
Split PDF by pages |
/pdf/optimize |
Compress PDF |
/pdf/convert/from/html |
HTML to PDF |
/pdf/convert/from/url |
URL to PDF |
/ai-invoice-parser |
AI-powered invoice parsing |
/document-parser |
Template-based document parsing |
/file/upload/get-presigned-url |
Get upload URL |
/job/check |
Check async job status |
async: true for files over 40 pages or 10MBlang for non-English)remainingCredits in response