Generate images using Google's Nano Banana Pro (Gemini 3 Pro Image) with workflow-based prompting
Generate professional images, infographics, and diagrams using Google's Nano Banana 2 model (gemini-3.1-flash-image-preview).
Nano Banana 2 (released February 26, 2026):
All scripts use Python via uv run with inline dependencies.
uv run scripts/generate.py "prompt" output.png [aspect_ratio] [size]
Examples:
# Basic image
uv run scripts/generate.py "A cozy coffee shop in autumn" coffee.png
# Infographic with specific aspect ratio
uv run scripts/generate.py "Infographic explaining how neural networks work" nn.png 16:9 2K
# 4K professional image
uv run scripts/generate.py "Professional headshot, studio lighting" headshot.png 3:2 4K
uv run scripts/edit.py input.png "edit instructions" output.png
Examples:
# Edit existing image
uv run scripts/edit.py photo.png "Change the background to a beach sunset" edited.png
uv run scripts/compose.py "prompt" output.png --refs image1.png image2.png
Examples:
# Combine styles from multiple images
uv run scripts/compose.py "Combine these styles into a logo" logo.png --refs style1.png style2.png
Workflows provide structured approaches for specific visual types. Each workflow follows the PAI 6-step editorial process:
When generating images, follow the appropriate workflow:
1. What data/concept needs visualization?
2. What's the key insight or takeaway?
3. Aspect ratio: 16:9 (landscape) recommended
4. Include: clear hierarchy, minimal text, supporting icons
5. Generate at 2K minimum for text clarity
1. What system/process is being illustrated?
2. What are the key components and relationships?
3. Style: flat colors, clean lines, minimal detail
4. Generate at 2K for label clarity
Requires GEMINI_API_KEY environment variable. This should be set from Geoffrey's secrets:
source ~/Library/Mobile\ Documents/com~apple~CloudDocs/Geoffrey/secrets/.env
By default, save images to /tmp/ or user-specified paths. For persistent storage, use:
~/Library/Mobile Documents/com~apple~CloudDocs/Geoffrey/images/
DO NOT use the Read tool on generated images.
Why:
Workflow:
| Size | Cost per Image |
|---|---|
| 1K | Free tier / $0.04 |
| 2K | $0.134 |
| 4K | $0.24 |