How to Generate Stunning Images with Nano Banana: The Complete Prompt Guide
Master Nano Banana image generation with 8 copy-paste prompts covering portraits, products, editorial art, social media, and text-in-image design.
Nano Banana is Promptyze’s home-grown AI image generation playground, powered by Google’s Gemini Flash model under the hood. Think of it as Gemini’s image capabilities with a focused, no-fluff interface tuned for people who actually want results — not a settings menu with 47 sliders they’ll never touch. Whether you’re generating product shots, editorial art, social media visuals, or portrait photography, the prompting logic is the same: be specific, be structured, and stop writing prompts like you’re texting a friend.
This guide covers everything from your first image to multi-character consistency, high-detail renders, and readable text inside images. You’ll walk away with a stack of copy-paste prompts and a clear mental model of how Nano Banana interprets your input. No vague advice, no “experiment and see” cop-outs — just what works, and why.
What You’ll Achieve
By the end of this tutorial, you’ll know how to write prompts that produce sharp, high-detail images across five use cases: portrait photography, product photography, editorial/concept art, social media graphics, and text-in-image designs. You’ll also understand how to maintain subject consistency across multiple generations and avoid the most common prompting mistakes that produce muddy, off-brief results.
How Nano Banana Reads Your Prompt
Nano Banana doesn’t treat your prompt as a wish — it treats it as a technical brief. The model parses your input for subject, environment, lighting, style, mood, and technical specs roughly in that order of priority. If you leave any of those out, it fills the gaps itself, and its defaults are… fine. Just fine. If you want great, you supply the detail. The golden structure is: Subject → Action/Pose → Environment → Lighting → Style → Technical spec. That’s the order that gets results.

Portrait Photography Prompts
Portrait generation is where most people start, and where most people write their worst prompts. “A woman with brown hair” is not a portrait brief — it’s barely a description. You need to specify the lens, the lighting setup, the mood, and the context. Here’s a prompt that actually delivers:
Portrait of a woman in her late 30s, close-up shot, natural window light from the left, soft shadows, relaxed expression, wearing a cream linen shirt, blurred urban background, shot on 85mm lens, shallow depth of field, editorial photography style, high detail, photorealistic
That prompt works because it mimics how a photographer would brief a shoot. The 85mm lens reference tells the model to compress the background and flatten facial features in a flattering way. “Natural window light from the left” creates a specific, believable shadow pattern. Remove any of those details and you get a more generic result — still fine, but not distinctive.
For a more dramatic editorial look, swap the lighting and style parameters:
Portrait of a man in his 50s, dramatic side lighting, deep shadows, high contrast, weathered face with strong bone structure, wearing a dark wool coat, black background, shot on 50mm lens, noir photography style, ultra-detailed, photorealistic
Pro tip ✅
Adding a specific lens focal length (35mm, 50mm, 85mm, 135mm) is one of the highest-leverage moves in portrait prompting. Each focal length has a different relationship to facial proportions — 35mm introduces slight distortion for a reportage feel, 85mm flatters faces for classic portraiture, 135mm compresses features for a cinematic editorial look. One word, dramatically different output.
Product Photography Prompts
Product shots are where Nano Banana genuinely earns its keep for commercial users. The key is surface, light source, and negative space — because product photography is essentially a conversation between the object and the light around it.
Minimalist product photography, glass perfume bottle with gold cap, placed on white marble surface, soft diffused studio lighting, subtle reflections, clean white background, luxury aesthetic, macro lens, ultra-sharp focus, commercial photography style
Notice the “subtle reflections” call-out — without it, you often get either no reflections (looks flat and fake) or harsh mirror reflections (looks wrong). Specifying the surface material and then asking for “subtle” reflections threads that needle.
Product shot of a matte black leather wallet, slightly open to show card slots, placed on dark textured concrete, single spotlight from above, deep shadows, high contrast, luxury menswear aesthetic, top-down angle, sharp focus, commercial photography

Pro tip ✅
For product shots, always specify both the surface the product sits on AND the background. They can be the same (white table, white background) or deliberately different (dark concrete surface, warm amber background). The contrast between surface and background is what gives product images visual depth — without it, products look pasted-in rather than placed.
Editorial and Concept Art Prompts
Editorial images live between photography and illustration — they need to feel real enough to be credible, but composed enough to carry a concept. This is where environment and mood carry the most weight in your prompt.
Editorial concept image, lone figure standing at the edge of a vast salt flat at golden hour, long shadow stretching behind them, warm amber and rose tones, wide-angle composition, sense of solitude and scale, cinematic photography, high detail, photorealistic
The phrase “sense of solitude and scale” is doing real work here — it’s an instruction about compositional intent, not just a description. Nano Banana responds to these mood directives and uses them to make compositional choices like subject placement, negative space, and color temperature.
Surrealist editorial image, oversized vintage clock melting over the edge of a glass table in a sunlit mid-century modern living room, photorealistic rendering, warm afternoon light, dust particles in the air, hyperdetailed, 4:3 ratio, magazine cover composition
Pro tip ✅
Saying “magazine cover composition” or “editorial spread composition” tells the model to leave deliberate negative space for text overlay — which is exactly what you want if you’re actually building visual content. It also tends to produce more intentional, less cluttered compositions than unconstrained generation.
Social Media Visual Prompts
Social media visuals have specific constraints: they need to pop at small sizes, work in square or portrait crops, and communicate instantly. That means bold color contrast, clear subject hierarchy, and minimal visual clutter. Your prompt needs to reflect those constraints explicitly.
Social media visual, square format, bold typographic composition, minimalist design, deep navy background, single centered object — a steaming coffee cup — warm orange accent lighting, high contrast, clean and modern aesthetic, graphic design style, ultra-sharp
Instagram-style portrait, young woman laughing, candid moment, golden hour backlight creating rim lighting, bokeh background of soft green foliage, warm color grade, film grain overlay, lifestyle photography aesthetic, vertical 4:5 ratio, vibrant and natural
Note 💡
Specifying the aspect ratio or format (square, vertical 4:5, horizontal 16:9) in your prompt gives you much more useful output for actual publishing. A horizontally-composed image dropped into a square crop loses its subject half the time — getting the ratio right at generation stage saves you a painful cropping session later.
Text-in-Image Prompts
Getting readable, correctly spelled text inside an AI-generated image has historically been the Achilles heel of every image model. Nano Banana handles it better than most, but you need to set it up correctly. The key rules: keep the text short (under 6 words per element), specify its placement precisely, and use high-contrast backgrounds.
Minimalist motivational poster, bold sans-serif white text reading "FOCUS WINS" centered on a deep black background, subtle geometric texture behind text, clean modern graphic design, high contrast, print-ready quality, sharp typography
Coffee shop chalkboard sign, hand-lettered style text reading "DAILY SPECIAL" at the top and "COLD BREW $4" below, chalk texture, dark slate background, warm café lighting, photorealistic, high detail
Warning ⚠️
The longer and more complex the text you ask for, the higher the chance of character errors or garbled letters. Stick to 1-6 words per text element and be specific about font style (bold sans-serif, hand-lettered, serif headline) rather than leaving it to chance. If you need longer text in an image, generate the visual first and overlay text yourself in a design tool — that’s not cheating, that’s just being a professional.
Maintaining Subject Consistency Across Generations
One of the trickier challenges with any AI image generator is generating the same character or object across multiple images without it morphing into something slightly different each time. The approach that works best with Nano Banana is what you might call “character anchoring” — building a dense, specific physical description and using it verbatim across every prompt in a series.
Portrait of a woman, early 40s, sharp angular jawline, dark brown eyes, short silver-streaked black hair worn tucked behind one ear, small scar above left eyebrow, olive skin tone, wearing a charcoal grey blazer — standing in a glass-walled office, natural daylight, professional editorial photography, photorealistic
That character description — the angular jaw, silver-streaked hair, scar, olive skin — is your consistency anchor. Copy those exact descriptors into your next prompt and change only the environment and context. It’s not perfect, but it gets you significantly closer to a coherent visual identity across a set of images than a loose description will.
Portrait of a woman, early 40s, sharp angular jawline, dark brown eyes, short silver-streaked black hair worn tucked behind one ear, small scar above left eyebrow, olive skin tone, wearing a black turtleneck — sitting at a café table, warm ambient light, shallow depth of field, lifestyle photography, photorealistic

Pro tip ✅
Build a “character card” — a short block of text with your subject’s physical anchors — and keep it in a notes doc or clipboard manager. Every time you generate that character, paste the card at the start of your prompt. It takes 10 seconds and saves you 10 rounds of regeneration trying to get the face back to where it was.
Getting Maximum Detail and Sharpness
Nano Banana responds well to technical quality descriptors appended at the end of a prompt. Think of them as a finishing instruction — they don’t describe the subject, they describe the render quality you expect. A small set of reliable terms to append: ultra-detailed, hyperrealistic, sharp focus, high resolution, 4K quality, photorealistic rendering, professional photography. Stack 2-3 of these at the end of any prompt and you’ll consistently get crisper, more detailed output than the default.
Macro photograph of a honeybee on a lavender flower, extreme close-up, individual hairs on the bee's body visible, pollen dusting on legs, soft bokeh background of purple blooms, natural outdoor lighting, ultra-detailed, sharp focus, hyperrealistic, professional macro photography
Pro tip ✅
Don’t front-load quality modifiers — put them at the end of your prompt where they function as a render quality instruction rather than a subject description. “Ultra-detailed portrait of a woman” reads differently than “portrait of a woman, ultra-detailed” — the second version applies the quality modifier to the whole output, not just the character description.
What’s Actually Worth Your Time
Nano Banana rewards structured thinking, not longer prompts. The photographers and designers who get the best results aren’t writing essays — they’re writing precise briefs: subject, light, style, technical spec, done. The prompts in this guide average maybe 30-40 words, which is the sweet spot between “too vague” and “overspecified chaos.” Start with the portrait and product templates above, swap in your own subjects and contexts, and you’ll have usable images inside your first five minutes. The character anchoring technique is worth spending real time on if you’re building any kind of recurring visual content — it’s the closest thing to reliable consistency that current AI image generation offers, and it costs you nothing but a well-organized clipboard.


