← Home Gemini / How to Generate Product Mockups in Gemini…
10 min
Gemini

How to Generate Product Mockups in Gemini 2.0 Flash with Imagen 3

promptyze
Editor · Promptowy
01.04.2026 Date
10 min Reading time
How to Generate Product Mockups in Gemini 2.0 Flash with Imagen 3
AI-generated product mockup concept promptowy.com

E-commerce photography is expensive. A single product shoot with proper lighting, backgrounds, and retouching can run $500-2000 per SKU. Now multiply that by your entire catalog. Google’s Gemini 2.0 Flash integrated with Imagen 3 won’t replace professional photography for hero shots, but for everyday product listings, lifestyle mockups, and A/B testing? It cuts costs dramatically and delivers results in minutes instead of weeks.

This tutorial walks through the current state of Google’s AI image generation pipeline — what works, what doesn’t, and how to prompt your way to usable product mockups. We’re focusing on Gemini 2.0 Flash with Imagen 3 because that’s what actually exists and ships today, not vaporware announcements.

nn

What You’ll Need

n

Before diving in, gather your tools. You’ll need a Google AI Studio account (free tier works fine for testing), product reference images or detailed descriptions, and patience for iteration. Imagen 3 is good, but it’s not magic — you’ll refine prompts multiple times before hitting gold.

Google AI Studio gives you access to Gemini 2.0 Flash, which can generate images through Imagen 3. The interface is cleaner than most AI playgrounds, and you can switch between text generation and image generation modes seamlessly. If you’re generating mockups for client work, consider the paid tier — rate limits on the free version will slow you down during batch generation.

nnnn

Understanding the Imagen 3 Workflow

n

Imagen 3 doesn’t work like Midjourney or DALL-E where you throw a prompt at it and pray. It performs best when you structure requests in layers: subject description, material properties, lighting setup, background context, camera angle. Think like a photographer writing a shot list, not a poet describing a dream.

The model handles photorealistic renders well but struggles with brand-specific details like logos, packaging text, or intricate patterns. If your product has embossed text or complex graphics, you’ll need to composite those in post. Imagen 3 excels at shapes, textures, lighting, and spatial relationships — lean into those strengths.

nn

Pro tip ✅

Generate your base mockup first, then use traditional photo editing tools like Photoshop or Canva to add logos and text overlays. AI-generated text still looks like alphabet soup.

nn

Crafting Your First Product Mockup Prompt

n

Start simple. Here’s a foundational prompt structure for a basic product shot:

nn

Product photography of a ceramic coffee mug, white matte finish, cylindrical shape with curved handle, placed on light gray surface, soft natural lighting from left side, clean white background, shallow depth of field, shot with 50mm lens, professional studio quality, high resolution

nn

This prompt hits the key elements: product type and material, color and finish, shape details, surface placement, lighting direction and quality, background choice, camera characteristics. Each element gives Imagen 3 concrete instructions instead of vague aspirations.

nn

Now let’s add context for lifestyle mockups:

nn

Product photography of a stainless steel water bottle, brushed metal finish, placed on wooden picnic table outdoors, dappled sunlight filtering through trees, forest background slightly out of focus, condensation droplets on bottle surface, natural environment, 35mm lens perspective, editorial photography style

nn

The lifestyle approach embeds your product in a scene. You’re selling the experience, not just the object. Imagen 3 handles environmental context surprisingly well — just be specific about backgrounds, lighting conditions, and mood.

nn

Controlling Lighting and Shadows

n

Lighting makes or breaks product photography, and the same applies to AI-generated mockups. Imagen 3 responds to directional cues, but you need to spell them out.

nn

Product photography of a leather wallet, dark brown cognac color, lying flat on black marble surface, dramatic side lighting from right creating long shadows, key light with soft fill, luxury product aesthetic, high contrast, commercial photography style, tack-sharp focus

nn

Notice the specificity: side lighting from a direction, shadow length, key and fill light mention, contrast level. Imagen 3 won’t always nail the shadow physics perfectly, but you’ll get closer to intentional lighting than random results.

nn

For softer, more approachable product shots:

nn

Product photography of a skincare serum bottle, frosted glass with gold cap, standing upright on white seamless backdrop, soft diffused lighting from above and front, minimal shadows, bright and airy atmosphere, beauty product photography, clean aesthetic, professional retouching look

nn

Diffused lighting eliminates harsh shadows. This works well for beauty, wellness, and lifestyle products where you want an inviting, not dramatic, presentation.

nnnn

Background Replacement and Scene Setting

n

You’re not limited to studio backdrops. Imagen 3 can place products in kitchens, offices, living rooms, outdoor settings — anywhere you can describe. The trick is balancing detail in the environment with focus on the product.

nn

Product photography of a smartwatch with black band, displayed on wrist of person wearing business casual outfit, modern minimalist office background with blurred MacBook and coffee cup on desk, natural window light from left, shallow depth of field keeping watch in sharp focus, lifestyle editorial style

nn

Here we’re using a partial human element (wrist) without showing faces, which avoids the uncanny valley problem. The background has identifiable props (MacBook, coffee) that suggest context without stealing focus. Shallow depth of field keeps the watch sharp while softening distractions.

nn

For e-commerce catalog shots where you need consistency across multiple products:

nn

Product photography of a backpack, navy blue canvas with leather straps, centered on pure white background, evenly lit from all sides with no shadows, floating product style, technical catalog photography, orthogonal front view, every detail visible, commercial product shot

nn

The floating product on white background is e-commerce standard. Even lighting from all sides eliminates shadows, giving you clean cutout potential if you need transparent backgrounds later.

nn

Note 💡

Orthogonal views (straight-on, no perspective distortion) work best for products where size and shape need to be accurately represented. Save dramatic angles for marketing materials, not technical catalogs.

nn

Material and Texture Specification

n

Imagen 3 understands material properties, but you need to name them explicitly. Generic terms like “shiny” or “soft” produce generic results. Use proper material vocabulary.

nn

Product photography of a silk scarf, lightweight fabric with subtle sheen, draped over minimalist concrete pedestal, soft indirect lighting highlighting fabric texture and flow, muted color palette, fine art product photography, macro detail visible in weave, shallow focus on fabric folds

nn

Silk has specific qualities: lightweight, sheen, drape, weave. Naming these helps the model render believable fabric physics. Same principle applies to any material:

nn

Product photography of a chef's knife, Damascus steel blade with visible folded pattern, ebony wood handle with brass rivets, resting on dark slate cutting board, dramatic low-key lighting emphasizing steel texture and grain, professional culinary photography, macro sharpness on blade edge

nn

Damascus steel, ebony wood, brass rivets — each material has distinct visual properties. The more specific your material callouts, the more believable your mockup.

nn

Camera Angle and Composition

n

Photographers think in focal lengths and perspectives. So should you when prompting Imagen 3. Different angles serve different purposes.

nn

Product photography of a sneaker, white and red colorway, shot from 45-degree angle showing side profile and top, placed on reflective black acrylic surface, studio lighting with rim light separating product from background, commercial footwear photography, 85mm lens compression, sharp focus throughout

nn

The 45-degree angle shows multiple sides without requiring separate shots. Rim lighting creates separation. 85mm compression makes the product look heroic without distortion.

nn

For flat lay compositions popular in social media:

nn

Flat lay product photography of a notebook, fountain pen, reading glasses, and coffee cup arranged on light wood surface, shot directly from above, natural morning light from window, lifestyle desk scene, editorial magazine style, all items in focus, warm color grading

nn

Flat lays require even focus across the frame. Mention that explicitly, or Imagen 3 might blur parts of your composition thinking it’s being artistic.

nn

Avoid 🚫

Don’t ask for extreme fish-eye or ultra-wide perspectives unless distortion is intentional. Imagen 3 struggles with perspective correction on severe angles, and your product will look wonky.

nn

Batch Generation Strategy

n

Once you’ve dialed in a prompt that works, batch generation becomes straightforward. In Google AI Studio, save your prompt as a template and create variations by swapping color, angle, or background parameters.

nn

For example, if you nailed the lighting and composition for a water bottle mockup, create color variants:

nn

Product photography of a stainless steel water bottle, [COLOR] powder-coated finish, placed on wooden surface, soft natural lighting from left, clean white background, 50mm lens, professional quality

nn

Swap [COLOR] with midnight blue, forest green, coral pink, charcoal gray — you’ve just generated four catalog variations in the time it takes to type. The base lighting and composition remain consistent, giving your product line visual coherence.

nn

For A/B testing different backgrounds:

nn

Product photography of a laptop stand, aluminum construction, placed on [BACKGROUND], bright even lighting, commercial product photography, sharp focus, professional quality

nn

Test [BACKGROUND] options: white seamless backdrop, modern office desk, home workspace, minimalist studio setup. See which environment resonates with your target audience before committing to a full photo shoot.

nnnn

Post-Processing and Refinement

n

AI-generated mockups need finishing touches. Imagen 3 produces solid foundations, but you’ll want to refine details in post. Common fixes include sharpening product edges, adjusting color balance, removing artifacts, and adding brand elements.

nn

Use your standard photo editing workflow — Photoshop, Lightroom, or even Canva for simpler edits. The advantage of AI mockups isn’t that they’re perfect out of the box, it’s that you skip the most expensive part: the initial shoot and setup. Post-processing time stays roughly the same.

nn

Pro tip ✅

Generate multiple variations of each mockup and composite the best parts. AI doesn’t always nail everything in one shot, but it might produce perfect lighting in one generation and perfect product shape in another. Blend them.

nn

Real-World Cost Comparison

n

A traditional product photography shoot for a single SKU with three angle variations typically costs $300-800 depending on your market. That includes photographer time, studio rental, equipment, and basic retouching. For a catalog of 50 products, you’re looking at $15,000-40,000.

nn

Generating the same mockups with Gemini 2.0 Flash and Imagen 3 costs about $0.10-0.50 per image on paid tiers (free tier is slower but functional). For 150 images (50 products, three angles each), you’re spending $15-75 plus maybe 10-20 hours of your time prompting and refining. Even factoring in labor, the cost difference is substantial.

nn

This doesn’t mean AI replaces professional photography entirely. Hero shots for ad campaigns, lifestyle imagery with models, and products where every detail matters still benefit from human photographers and proper sets. But for routine catalog updates, seasonal color variations, and fast iteration? AI mockups make financial sense.

nn

Warning ⚠️

AI-generated images can’t be copyrighted in most jurisdictions (yet). If intellectual property protection matters for your mockups, consult legal advice. For disposable marketing content and internal testing, it’s less critical.

nn

Common Pitfalls and Solutions

n

Imagen 3 has quirks. It sometimes adds random objects you didn’t request, misinterprets material properties, or generates products that look slightly off. Here’s how to course-correct.

nn

If you’re getting unwanted elements in frame, add explicit negative cues:

nn

Product photography of a wine bottle, dark green glass, centered on white background, studio lighting, no other objects, no text on label, clean minimalist composition, professional catalog shot

nn

The “no other objects, no text” instructions help prevent Imagen 3 from adding creative flourishes you didn’t ask for.

nn

If material rendering looks wrong, increase specificity:

nn

Product photography of a glass vase, clear transparent glass with high refraction, subtle light caustics visible through glass, placed on white surface, backlit with soft glow, luxury product photography, crystal clarity, no frosting or texture

nn

Calling out refraction, caustics, and explicitly stating “no frosting” guides the model toward the exact glass properties you want.

nn

Pro tip ✅

If a prompt isn’t working after three attempts, start over with simpler language. Sometimes you’ve overcomplicated the request and the model is confused. Strip it back to basics, then add detail incrementally.

nn

When to Use AI Mockups vs Real Photography

n

Not every situation calls for AI-generated product imagery. Here’s a practical breakdown.

nn

Use AI mockups for: catalog updates where you need dozens of variations quickly, A/B testing different backgrounds or contexts before committing to shoots, seasonal color launches where the product shape stays the same, internal presentations and mockups for stakeholder approval, and fast iteration during product development phases.

nn

Stick with real photography for: products where tactile qualities matter (fabrics, textures), anything involving food (AI food looks plastic), shots with human models or hands interacting with products, hero images for major ad campaigns, and products with intricate branding or text that needs to be legible.

nn

The sweet spot is hybrid workflows. Use AI to generate background plates and product foundations, then composite in real product detail shots or professionally photographed hero elements. You get speed and cost savings without sacrificing quality where it matters.

nn

What This Means for Your Workflow

n

Google’s Gemini 2.0 Flash integrated with Imagen 3 isn’t revolutionary, but it’s practical. E-commerce teams can cut catalog photography budgets significantly, especially for routine updates and testing. The learning curve is steeper than clicking “generate,” but once you understand prompt structure and material specification, you’ll produce usable mockups faster than scheduling a photographer.

nn

The technology will improve. Imagen 4, whenever it arrives, will likely handle brand details and text better. Until then, treat AI mockups as a foundation, not a finished product. Generate fast, refine smart, and save the big photography budget for shots that actually need a human behind the camera.

author avatar
promptyze
promptyze
Founder · Editor · Promptowy

Piszę o AI i automatyzacji od 3 lat. Prowadzę promptowy.com.

More →