Generate and edit high-quality AI images using Google's Gemini 3 Pro Image model (Nano Banana Pro) via MCP...
Generate stunning 4K images, edit photos, and create graphics with perfect text rendering using Google's latest Gemini 3 Pro Image model via MCP.
Invoke when user:
Get a free API key from Google AI Studio:
Recommended: NanoBanana-MCP (uses Gemini 3 Pro for highest quality)
# Quick install via Claude Code CLI
claude mcp add nano-banana --env GEMINI_API_KEY=your-key-here -- npx -y nanobanana-mcp
Or add to ~/.claude/settings.json manually:
{
"mcpServers": {
"nano-banana": {
"command": "npx",
"args": ["-y", "nanobanana-mcp"],
"env": {
"GEMINI_API_KEY": "your-api-key-here"
}
}
}
}
Alternative: Nano-Banana-MCP by ConechoAI (Gemini 2.5 Flash - faster, lower cost)
{
"mcpServers": {
"nano-banana": {
"command": "npx",
"args": ["nano-banana-mcp"],
"env": {
"GEMINI_API_KEY": "your-api-key-here"
}
}
}
}
Once MCP is configured, these tools become available:
| Tool | Purpose | Key Parameters |
|---|---|---|
gemini_generate_image |
Create new images from text prompts | prompt, model, aspectRatio, imageSize |
gemini_edit_image |
Modify existing images with instructions | imagePath, instructions, model |
continue_editing |
Refine the last generated image | instructions |
get_image_history |
List all generated images in session | - |
| Model ID | Description |
|---|---|
gemini-3-pro-image-preview |
Default. Highest quality, 4K support, best text rendering |
gemini-2.0-flash-exp |
Faster generation, good quality, lower cost |
gemini-2.0-flash-preview-image-generation |
Alternative 2.0 model |
| Size | Use Case |
|---|---|
4K |
Final assets, print, marketing materials |
2K |
Balanced quality and speed |
1K |
Fast iteration, prototyping |
| Feature | Capability |
|---|---|
| 4K Output | Up to 5632×3072 pixels |
| Text Rendering | Accurate text in images (signs, labels, UI) |
| Multi-Image Composition | Combine up to 14 reference images |
| Character Consistency | Maintain same character across 5+ images |
| Google Search Grounding | Real-world accurate imagery |
[Subject] + [Style] + [Details] + [Technical Specs]
Example:
"A cozy coffee shop interior, watercolor illustration style, warm lighting, wooden furniture, steaming cup on table, 4K resolution, soft morning light through windows"
For high-stakes work requiring exact reproducibility, use structured JSON schemas.
Trigger phrases:
| Type | Use Case | Key Controls |
|---|---|---|
marketing_image |
Product shots, hero images | subject, props, lighting, camera, brand locks |
ui_builder |
App screens, dashboards | tokens, screens, containers, components |
diagram_spec |
Flowcharts, infographics | nodes, edges, data constraints |
User: "I need a hero shot for Aurora Lime seltzer"
Claude asks: "For the Aurora Lime hero shot:
Result: Structured JSON with exact specifications that can be iterated field-by-field.
JSON enables changing ONE thing without regenerating everything:
| Change | What Stays Fixed |
|---|---|
| Swap lighting direction | Subject, props, background |
| Try different camera angle | Lighting, props, environment |
| Change background color | Subject geometry, lighting setup |
| Add/remove props | Everything else |
references/json-prompting.md - Full JSON prompting guidereferences/translator-prompt.md - Translator system promptreferences/schemas/ - Template schemas for each typereferences/examples-json.md - Filled-out examplesNano Banana Pro excels at text rendering:
"A vintage movie poster for 'COSMIC ADVENTURE' with bold retro typography, starfield background, astronaut silhouette, 1970s sci-fi aesthetic"
For consistent characters across images:
history:0 reference in subsequent promptsFirst: "A young woman with red curly hair, freckles, green eyes, wearing a blue jacket"
Then: "The same woman from history:0, now sitting at a café, reading a book"
User: "Create an image of a futuristic city at sunset"
Claude uses: gemini_generate_image
Prompt: "Futuristic cityscape at golden hour sunset, towering glass skyscrapers with holographic advertisements, flying vehicles, warm orange and purple sky, photorealistic, 4K resolution, cinematic lighting"
User: "Edit this photo to make it look like winter"
Claude uses: gemini_edit_image
Input: [user's image path]
Instructions: "Transform to winter scene: add snow on ground and surfaces, frost on windows, visible breath, overcast sky, cool blue color grading"
User: "Make the lighting warmer"
Claude uses: continue_editing
Instructions: "Adjust lighting to warmer tones, add golden hour glow, enhance orange/yellow highlights, softer shadows"
Images save to: ~/Documents/nanobanana_generated/
Naming format: generated-[timestamp]-[id].png
.env files for key management in projects| Model | Speed | Quality | Cost | Best For |
|---|---|---|---|---|
gemini-3-pro-image-preview |
Slower | Highest (4K) | Higher | Final assets, print, marketing |
gemini-2.0-flash-exp |
Fast | Good | Lower | Prototyping, iteration, drafts |
| Issue | Solution |
|---|---|
| "API key invalid" | Verify key at AI Studio |
| "Rate limited" | Wait 60s, or upgrade API tier |
| "MCP not connected" | Restart Claude Code, check config syntax |
| "Image not saving" | Check write permissions on output directory |
Works well with:
references/prompting-guide.md - Detailed prompting techniquesreferences/examples.md - Sample prompts by category