GEMINI-3
Gemine 3 Pro Edit transforms uploaded images through prompt based editing with smooth, accurate and high quality results
Avg Run Time: 0.000s
Model Slug: gemini-3-pro-image-preview-edit
Playground
Input
Output
Example Result
Preview and download your result.

API & SDK
Create a Prediction
Send a POST request to create a new prediction. This will return a prediction ID that you'll use to check the result. The request should include your model inputs and API key.
Get Prediction Result
Poll the prediction endpoint with the prediction ID until the result is ready. The API uses long-polling, so you'll need to repeatedly check until you receive a success status.
Readme
Overview
gemini-3-pro-image-preview-edit — Image Editing AI Model
gemini-3-pro-image-preview-edit, Google's advanced image-to-image AI model from the Gemini 3 family, enables precise editing of uploaded images using natural language prompts for smooth, high-quality transformations while preserving original composition. Known internally as part of Gemini 3 Pro Image Edit (Nano Banana Pro), this preview model excels in controlled modifications like object replacement, lighting adjustments, and background swaps, delivering professional-grade results up to 4K resolution. Developers and creators searching for a reliable Google image-to-image solution find it ideal for iterative workflows without regenerating entire visuals.
Technical Specifications
What Sets gemini-3-pro-image-preview-edit Apart
gemini-3-pro-image-preview-edit stands out in the image-to-image AI model landscape with its ability to perform localized edits on existing images, maintaining layout and structure unlike full regeneration models. This enables rapid refinements such as typography corrections or perspective shifts, cutting iteration time for designers. It supports up to 4K output resolution with sharp text rendering in multiple languages, outperforming predecessors in artifact reduction and physical accuracy.
- Advanced conversational editing: Uses thought signatures for multi-turn refinements, allowing iterative changes like "adjust lighting to golden hour" on prior outputs for consistent series.
- Real-world knowledge grounding: Integrates Google Search data for context-aware edits, such as accurate product mockups with current trends, ideal for AI image editor API integrations.
- High-resolution control: Generates 1K default, 2K, or 4K images with aspect ratios like 16:9; processes inputs via media_resolution parameters for fine details.
Average processing favors 2K for speed and quality balance, with formats optimized for professional assets like print-ready visuals.
Key Considerations
- The model uses a "Thinking" process by default, generating interim images to refine composition and logic before producing the final output
- Multi-turn conversational editing is supported, preserving context with "Thought Signatures" for each edit step
- Higher resolutions improve detail and text clarity but increase token usage and latency; balance quality and speed based on project needs
- For best results, provide clear, specific prompts and leverage reference images when consistency is critical
- Editing workflows require returning all "Thought Signatures" to avoid errors in multi-step processes
- Prompt engineering is important: detailed, structured prompts yield more accurate and controllable results
Tips & Tricks
How to Use gemini-3-pro-image-preview-edit on Eachlabs
Access gemini-3-pro-image-preview-edit seamlessly on Eachlabs via the Playground for instant testing, API for production gemini-3-pro-image-preview-edit API integrations, or SDK for custom apps. Upload an image, provide a descriptive prompt like object swaps or lighting changes, specify resolution (up to 4K) and aspect ratio, then generate high-fidelity PNG outputs with preserved structure in seconds.
---Capabilities
- Generates and edits images from text prompts with high fidelity and accuracy
- Supports multi-turn, conversational editing workflows with preserved context
- Excels at rendering clear, legible text and complex diagrams within images
- Maintains character and object consistency across edits using reference images
- Integrates real-world knowledge via grounding for factual, data-driven outputs
- Handles professional asset production, including UI mockups, infographics, and creative visual content
- Offers fine-grained control over image physics (lighting, focus, color grading) and composition
What Can I Use It For?
Use Cases for gemini-3-pro-image-preview-edit
E-commerce marketers upload product photos and prompt edits like "replace background with marble kitchen counter, add morning light," yielding photorealistic composites for listings without studio reshoots—leveraging its layout preservation for AI photo editing for e-commerce.
UI/UX designers refine mockups by describing "swap button color to brand blue, enhance shadow depth," maintaining composition fidelity across iterations to speed prototyping with edit images with AI precision.
Developers building automated image editing APIs integrate gemini-3-pro-image-preview-edit for apps handling user uploads, using prompts such as "correct text to 'Sale 50% Off' in elegant font, adjust perspective to eye-level," ensuring scalable, high-fidelity outputs with 4K support.
Content creators perform style transfers on portraits, like "apply magazine color grading and background replacement to tropical beach," achieving client-ready results with minimal artifacts via its reasoning-driven edits.
Things to Be Aware Of
- Some experimental features, such as multi-turn editing and grounding, may behave unpredictably in edge cases or with ambiguous prompts
- Users have reported occasional glitches in the API during early access, especially with editing workflows
- High-resolution outputs (2K/4K) require more computational resources and may increase latency
- Consistency across multiple edits is generally strong, but complex compositions may still require manual refinement
- Positive feedback highlights the model's realistic image generation, strong composition, and improved text rendering over previous versions
- Common concerns include occasional imperfections in text lettering and the need for precise prompt engineering to achieve desired results
- All generated images include a SynthID watermark for provenance and authenticity
Limitations
- The model's parameters and full technical details are not publicly disclosed, limiting transparency for some advanced users
- May not be optimal for ultra-fast, high-volume generation tasks where speed is prioritized over quality
- Complex or highly abstract prompts may still yield imperfect results, especially in text rendering or intricate scene composition
Pricing
Pricing Type: Dynamic
Charge $0.15 per image generation
Pricing Rules
| Parameter | Rule Type | Base Price |
|---|---|---|
| num_images | Per Unit Example: num_images: 1 × $0.15 = $0.15 | $0.15 |
Related AI Models
You can seamlessly integrate advanced AI capabilities into your applications without the hassle of managing complex infrastructure.
