Smithery Logo
MCPsSkillsDocsPricing
Login
Smithery Logo

Accelerating the Agent Economy

Resources

DocumentationPrivacy PolicySystem Status

Company

PricingAboutBlog

Connect

© 2026 Smithery. All rights reserved.

    ratacat

    ebook-extractor

    ratacat/ebook-extractor
    Data & Analytics
    17
    1 installs

    About

    SKILL.md

    Install

    Install via Skills CLI

    or add to your agent
    • Claude Code
      Claude Code
    • Codex
      Codex
    • OpenClaw
      OpenClaw
    • Cursor
      Cursor
    • Amp
      Amp
    • GitHub Copilot
      GitHub Copilot
    • Gemini CLI
      Gemini CLI
    • Kilo Code
      Kilo Code
    • Junie
      Junie
    • Replit
      Replit
    • Windsurf
      Windsurf
    • Cline
      Cline
    • Continue
      Continue
    • OpenCode
      OpenCode
    • OpenHands
      OpenHands
    • Roo Code
      Roo Code
    • Augment
      Augment
    • Goose
      Goose
    • Trae
      Trae
    • Zencoder
      Zencoder
    • Antigravity
      Antigravity
    ├─
    ├─
    └─

    About

    Use when user wants to extract text from ebooks (EPUB, MOBI, PDF). Use for converting ebooks to plain text for analysis, processing, or reading. Handles all common ebook formats.

    SKILL.md

    Ebook Text Extractor

    Overview

    Extract plain text from EPUB, MOBI, and PDF files using Python scripts. No LLM calls - pure text extraction.

    Supported Formats

    Format Tool Used Notes
    EPUB ebooklib + BeautifulSoup Direct parsing, preserves structure
    MOBI Calibre ebook-convert Converts to EPUB first, then extracts
    PDF PyMuPDF (fitz) Fast, handles most PDFs well

    Usage

    Unified extractor (auto-detects format):

    python3 ~/.claude/skills/ebook-extractor/scripts/extract.py /path/to/book.epub
    python3 ~/.claude/skills/ebook-extractor/scripts/extract.py /path/to/book.mobi
    python3 ~/.claude/skills/ebook-extractor/scripts/extract.py /path/to/book.pdf
    

    Output options:

    # To stdout (default)
    python3 scripts/extract.py book.epub
    
    # To file
    python3 scripts/extract.py book.epub -o output.txt
    python3 scripts/extract.py book.epub > output.txt
    

    Format-specific scripts:

    python3 scripts/extract_epub.py book.epub
    python3 scripts/extract_mobi.py book.mobi
    python3 scripts/extract_pdf.py book.pdf
    

    Setup

    # One-command setup (installs all dependencies)
    ~/.claude/skills/ebook-extractor/setup.sh
    
    # Or manually:
    pip install -r ~/.claude/skills/ebook-extractor/requirements.txt
    brew install calibre  # macOS, for MOBI support
    

    Script Location

    ~/.claude/skills/ebook-extractor/scripts/

    Common Issues

    Problem Solution
    Missing package Run setup.sh or pip install -r requirements.txt
    MOBI fails Ensure Calibre is installed: brew install calibre
    PDF garbled Some PDFs are image-based; OCR needed (not supported)
    Recommended Servers
    Jina AI
    Jina AI
    Apify
    Apify
    ScrapeGraph AI Integration Server
    ScrapeGraph AI Integration Server
    Repository
    ratacat/claude-skills
    Files