
Imagen 4 | Fast
Use this fast version of Imagen 4 when speed and cost are more important than quality
Avg Run Time: 13.000s
Model Slug: imagen-4-fast
Category: Text to Image
Input
Output
Example Result
Preview and download your result.

Create a Prediction
Send a POST request to create a new prediction. This will return a prediction ID that you'll use to check the result. The request should include your model inputs and API key.
Get Prediction Result
Poll the prediction endpoint with the prediction ID until the result is ready. The API uses long-polling, so you'll need to repeatedly check until you receive a success status.
Overview
A text-to-image diffusion model developed by Google DeepMind. It generates high-quality, photorealistic images from text prompts. Bring your ideas to life—faster, sharper, and truer to your vision.
Technical Specifications
Aspect ratios supported:
- 1:1: 1024×1024
- 3:4: 896×1280
- 4:3: 1280×896
- 9:16: 768×1408
- 16:9: 1408×768
Prompt languages: English, Chinese (Simplified & Traditional), Hindi, Japanese, Korean,Portuguese, Spanish.
Prompt length: On average, 1 token equals about 4 characters, so prompts can be up to approximately 1,900 characters long.
Key Considerations
Prompt quality matters: Clear, detailed prompts lead to better results.
Not fact-grounded: Expect some inconsistencies in details and realism.
Style & text limits: Handles diverse styles and text better—but not perfectly.
Tips & Tricks
Be specific: Use clear, detailed prompts. Define the subject, key features, and any actions it’s performing.
Set the scene: Describe the environment and mood—include background elements, lighting, weather, or time of day.
Specify a style: Mention the desired artistic style, such as photorealism, vector art, or a specific art movement.
Guide composition: Include parameters for camera angle and compositional elements. Structured, descriptive language helps generate more targeted, intentional visuals.
Capabilities
Photorealism: Create lifelike images of people, animals, landscapes, and more—down to the finest detail.
Ultra-sharp details: Rich textures, vibrant colors, and stunning close-ups with natural depth and gradients.
Smarter text rendering: Better spelling, longer passages, and more sophisticated typography—ideal for comics, collectibles, and design work.
More styles, more control: From hyper-realistic to abstract, it handles a wide range of visual styles with improved accuracy.
Fast mode: Explore ideas at lightning speed—up to 10× faster than Google’s earlier models.
High-resolution output: Generate crisp, creative visuals at up to 2K resolution.
What Can I Use It For?
Comics and storybooks: Generate characters, scenes, and panels with readable text and clear visuals.
Concept art: Quickly explore visual ideas for games, films, and animation.
Commercial and marketing visuals: Produce eye-catching imagery for product mockups, posters, and digital content.
Collectibles and packaging: Design greeting cards, covers, and layouts with improved text rendering.
Things to Be Aware Of
No customization tools: It doesn’t support style transfer, subject tuning, or few-shot personalization.
No image editing: Features like inpainting, outpainting, masking, or image upscaling are not available.
No negative prompting: You can’t exclude elements (e.g., “no text,” “no watermark”) via prompts.
Limitations
Lack of factual grounding: Imagen isn’t built for real-world accuracy. It can introduce artifacts in complex scenes, especially
with small faces, text, or thin structures.
Centering issues: Struggles with perfect alignment, such as placing a circle exactly in the center.
Unclear prompts: Nonsensical input (like random characters or emojis) can lead to unpredictable results.
Output Format: JPG,PNG
Pricing Detail
This model runs at a cost of $0.020 per execution.
Pricing Type: Fixed
The cost remains the same regardless of which model you use or how long it runs. There are no variables affecting the price. It is a set, fixed amount per run, as the name suggests. This makes budgeting simple and predictable because you pay the same fee every time you execute the model.
Related AI Models
You can seamlessly integrate advanced AI capabilities into your applications without the hassle of managing complex infrastructure.