Nano Banana Pro image preview

Nano Banana Pro

Array·nano-banana·by Google

Nano Banana Pro generates high quality images from text with sharp details, smooth rendering and impressively accurate visual output

Runtime (p50)
15s
Estimated price
From $0.15
Call the API
prediction.sh
sh
curl -X POST \
  -H "X-API-Key: $EACHLABS_API_KEY" \
  -H "Content-Type: application/json" \
  --data '{
    "model": "nano-banana-pro",
    "version": "0.0.1",
    "input": {
        "prompt": "An ultra-realistic photo captured at golden hour on a wide, open countryside dirt road. In the center stands a single weathered wooden post with a rustic white wooden sign attached to it, featuring clean black sans-serif text that reads ‘NANO BANANA PRO.’ Below the sign, a yellow banana is taped to the post using slightly wrinkled gray duct tape. Warm sunset light, long shadows, soft pastel sky tones, and expansive fields of dry grass fill the background. Shallow depth of field, natural contrast, smooth bokeh, and professional outdoor commercial photography aesthetics.",
        "num_images": 1,
        "aspect_ratio": "16:9",
        "output_format": "png",
        "resolution": "1K"
    },
    "webhook_url": ""
}' \
  https://api.eachlabs.ai/v1/prediction/
Documentation8 sections
  • Overview

    nano-banana-pro — Image Generation AI Model

    Transform text prompts into stunning 4K images with nano-banana-pro, Google DeepMind's advanced text-to-image AI model powered by Gemini 3 Pro Image architecture. This professional upgrade to the nano-banana family delivers breakthrough 94% text accuracy, enabling crystal-clear rendering of complex text, infographics, and multilingual content that most AI image generators struggle with. Ideal for developers seeking a Google text-to-image solution or creators needing text-to-image AI model with studio-grade precision, nano-banana-pro generates production-ready visuals in under 10 seconds, perfect for marketing materials and commercial assets.

  • Capabilities
    • Generates high-quality images with sharp details and smooth rendering
    • Supports native 2K resolution and optional 4K upscaling for print and professional use
    • Excels at multi-image fusion and character identity preservation
    • Advanced text rendering for legible typography in posters, UI mockups, and marketing assets
    • Superior prompt understanding, especially for technical photographic terms and brand-specific colors
    • Real-time processing with stable outputs across batches
    • Contextual grounding using Google Search for fact-based image generation
  • Use cases

    Use Cases for nano-banana-pro

    Marketing teams use nano-banana-pro's superior text rendering for global campaigns, generating localized posters like "Create a billboard ad with 'Summer Sale 50% Off' in Japanese, golden hour lighting on a beach scene"—delivering accurate multilingual text without manual design work.

    Developers building AI image generator API tools for e-commerce input product photos and prompts such as "Place this sneaker on a urban street with neon signs and rainy reflections," fusing multiple references for photorealistic mockups that speed up prototyping.

    Designers leverage its thinking mode and professional controls for infographics, prompting "Chart of solar system orbits with labeled distances in km, deep space background, telephoto view"—producing data-rich visuals grounded in real web knowledge for educational content.

    Content creators benefit from 4K outputs for high-res print materials, combining style references to maintain character consistency in series, perfect for professional presentations or social media assets requiring sharp details.

  • Tips & tricks

    How to Use nano-banana-pro on Eachlabs

    Access nano-banana-pro seamlessly on Eachlabs via the Playground for instant testing, API for scalable integrations, or SDK for custom apps. Input a text prompt, optional up to 14 reference images, aspect ratio, and controls like lighting; receive 4K PNG/JPEG outputs in under 10 seconds with SynthID watermarking for responsible use. Start generating high-accuracy visuals today.

    ---
  • Technical spec

    What Sets nano-banana-pro Apart

    nano-banana-pro stands out in the competitive text-to-image landscape with native 4K resolution (4096x4096), supporting aspect ratios like 1:1, 16:9, 9:16, 4:3, and 3:4, plus output formats including PNG, JPEG, and WebP—all processed in ~5-10 seconds. Its 94% text accuracy handles legible text in multiple languages, typography styles, and complex layouts, allowing users to create posters or infographics without post-editing fixes.

    • Multi-image fusion combines up to 8-14 reference images (like logos and product photos) with natural language prompts, ensuring brand consistency and character continuity across outputs—ideal for nano-banana-pro API integrations in e-commerce apps.
    • Web search grounding pulls real-time data for factually accurate visuals, such as current events in news graphics, setting it apart from models without live knowledge integration.
    • Professional controls for camera angles, lighting, depth of field, and color grading provide precise edits via conversational prompts, enabling studio-quality results for designers.
  • Things to be aware of
    • Experimental features like multi-stage planning and reasoning-aware upscaling may require careful prompt structuring
    • Some users report occasional inconsistencies in micro-textures or lighting gradients, especially at higher resolutions
    • Performance can vary based on prompt complexity, with more detailed prompts requiring longer processing times
    • Resource requirements are higher for 4K generation and multi-image fusion, which may impact workflow efficiency
    • Consistency is generally strong but can degrade over very long editing sessions or with highly abstract prompts
    • Positive feedback highlights the model's speed, accuracy, and ability to handle complex instructions
    • Common concerns include occasional text rendering issues for longer phrases and minor artifacts in upscaling
  • Key considerations
    • The model excels with clear, detailed prompts and benefits from iterative refinement for optimal results
    • Best practices include specifying technical terms (e.g., lens types, lighting cues) and using multi-turn conversations for complex edits
    • Quality vs speed: While generation is fast, higher resolutions and complex prompts may require more processing time
    • Prompt engineering tips: Use precise language, reference real-world objects or styles, and leverage Google Search grounding for contextually accurate outputs
    • Avoid overly abstract or ambiguous prompts, as these can lead to inconsistent or less accurate results
  • Limitations
    • Primary technical constraints include occasional artifacts in micro-textures and lighting gradients at higher resolutions
    • May not be optimal for highly abstract or ambiguous prompts, where outputs can be less consistent or accurate

        Note: The model won't always follow the exact number of image outputs that the user explicitly asks for.

Related models

4 models
* FAQ

About Nano Banana Pro

01 / 03

What is Nano Banana Pro text-to-image and what sets it apart from the base model?

Nano Banana Pro is Google's professional-tier text-to-image model in the Nano Banana family. It generates higher-fidelity images from natural language prompts compared to the standard Nano Banana, offering improved photorealism, richer compositional detail, and better rendering of complex scenes and fine textures for professional content creation use cases.