Smithery Logo
MCPsSkillsDocsPricing
Login
Smithery Logo

Give agents more agency

Resources

DocumentationPrivacy PolicySystem Status

Company

PricingAboutBlog

Connect

© 2026 Smithery. All rights reserved.

    sanjay3290

    imagen

    sanjay3290/imagen
    Design
    53
    4 installs

    About

    SKILL.md

    Install

    Install via Skills CLI

    or add to your agent
    • Claude Code
      Claude Code
    • Codex
      Codex
    • OpenClaw
      OpenClaw
    • Cursor
      Cursor
    • Amp
      Amp
    • GitHub Copilot
      GitHub Copilot
    • Gemini CLI
      Gemini CLI
    • Kilo Code
      Kilo Code
    • Junie
      Junie
    • Replit
      Replit
    • Windsurf
      Windsurf
    • Cline
      Cline
    • Continue
      Continue
    • OpenCode
      OpenCode
    • OpenHands
      OpenHands
    • Roo Code
      Roo Code
    • Augment
      Augment
    • Goose
      Goose
    • Trae
      Trae
    • Zencoder
      Zencoder
    • Antigravity
      Antigravity
    ├─
    ├─
    └─

    About

    Generate images using Google Gemini's image generation capabilities. Use this skill when the user needs to create, generate, or produce images for any purpose including UI mockups, icons,...

    SKILL.md

    Imagen - AI Image Generation Skill

    Overview

    This skill generates images using Google Gemini's image generation model (gemini-3-pro-image-preview). It enables seamless image creation during any Claude Code session - whether you're building frontend UIs, creating documentation, or need visual representations of concepts.

    Cross-Platform: Works on Windows, macOS, and Linux.

    When to Use This Skill

    Automatically activate this skill when:

    • User requests image generation (e.g., "generate an image of...", "create a picture...")
    • Frontend development requires placeholder or actual images
    • Documentation needs illustrations or diagrams
    • Visualizing concepts, architectures, or ideas
    • Creating icons, logos, or UI assets
    • Any task where an AI-generated image would be helpful

    How It Works

    1. Takes a text prompt describing the desired image
    2. Calls Google Gemini API with image generation configuration
    3. Saves the generated image to a specified location (defaults to current directory)
    4. Returns the file path for use in your project

    Usage

    Python (Cross-Platform - Recommended)

    # Basic usage
    python scripts/generate_image.py "A futuristic city skyline at sunset"
    
    # With custom output path
    python scripts/generate_image.py "A minimalist app icon for a music player" "./assets/icons/music-icon.png"
    
    # With custom size
    python scripts/generate_image.py --size 2K "High resolution landscape" "./wallpaper.png"
    

    Requirements

    • GEMINI_API_KEY environment variable must be set
    • Python 3.6+ (uses standard library only, no pip install needed)

    Output

    Generated images are saved as PNG files. The script returns:

    • Success: Path to the generated image
    • Failure: Error message with details

    Examples

    Frontend Development

    User: "I need a hero image for my landing page - something abstract and tech-focused"
    -> Generates and saves image, provides path for use in HTML/CSS
    

    Documentation

    User: "Create a diagram showing microservices architecture"
    -> Generates visual representation, ready for README or docs
    

    UI Assets

    User: "Generate a placeholder avatar image for the user profile component"
    -> Creates image in appropriate size for component use
    
    Recommended Servers
    Nanobanana
    Nanobanana
    JigsawStack Image Generation
    JigsawStack Image Generation
    OpenZeppelin
    OpenZeppelin
    Repository
    sanjay3290/ai-skills
    Files