Powered by the new Gemini 3 model, Nano Banana Pro was released today.

More news in the world of AI wouldn’t you know. Google today announced Nano Banana Pro, the newest version of its AI image generator built on the Gemini 3 Pro reasoning engine. The name may still sound like a Mario Kart power-up, but the upgrade itself is substantial. And I continue to be impressed with how rapidly the company is iterating on this impressive bit of tech.

Three major improvements headline the release:

  • Sharper, multilingual text rendering (arguably Google’s best yet)
  • Better real-world reasoning powered by Gemini 3
  • Creator-grade controls including 14 reference images, 5 person consistency, and 4K outputs

For filmmakers, designers, advertisers and anyone generating ideas at 1 a.m. (a familiar situation here at Stark Insider), this matters. Nano Banana Pro is built for practical creation, not just amusement. Though as you can see in my testing below, amusement is certainly possible if you so choose.

Why This Update Matters for Creators

Google keeps emphasizing that Nano Banana Pro is “for creation, not just generation.” In practical terms, the model can understand and apply context in ways earlier versions could not.

Context-Aware Image Generation

Nano Banana Pro now pulls information from Google Search to provide up-to-date environmental knowledge. Ask for “a San Francisco skyline in tomorrow’s weather,” and it can adjust lighting, clouds, and atmosphere based on real conditions. Pretty cool.

For creators, this helps with:

  • Shotlist planning
  • Storyboards set in actual cities or landmarks
  • Product mockups that reference real brands or signage
  • Mood boards that reflect real-world atmospheric details

It’s not perfect, but it’s meaningfully better at connecting images to real locations and scenarios. In my testing this all worked well, with a few caveats here and there. Consistency from one prompt to the next, for instance, continues to be a challenge for all gen AI models in my experience.

Best-in-Class Text Rendering

“Nano Banana Pro is Google’s best text-rendering model yet.”

Signs, labels, menus, posters and storefronts look much more believable. The improvement is noticeable even on casual tests. For Stark Insider, where we often generate mock film posters or visual concepts for short films, legible text makes a big difference. I formerly painstakingly created this stuff in local Starbucks plugging away in Illustrator. The results were decent at times, but now I can iterate so fast, and get all the proper text in place so easily that it ultimately frees up more of my time. And yours too!

Advanced Creative Controls

One of the biggest surprises is the new reference system. You can supply up to 14 images and maintain visible consistency for up to 5 unique people.

This supports:

  • Character continuity in storyboards
  • Visual development for short films
  • Continuity planning for shoots
  • Product angle consistency for reviews or mockups

It’s not quite at the artistic level of Midjourney, but it sits in an interesting place for creators who need accuracy, not just aesthetic flair.

The feature is slightly reminiscent of Google’s Mixboard. If you haven’t tried that one you should give it a go.

My Hands-On Tests: A Quick Reality Check

I tested several prompts to understand the model better. This wasn’t a full review, just quick first impressions.

Standard Holiday Scene: Solid Results

Create a snowy wonderland scene with a cute cabin, some Christmas decorations. A billboard in the distance says WELCOME TO SNOWY HAVEN. Children play. A dog watches from a window. Some people are running around nervously because an APB issued suggested that a criminal may be on the loose nearby. Also there’s an ornery film critic (who refused to place Messiah of Evil on to his best horror movies list) observing the entire situation unfold!

Cozy snowy winter village scene with children building a snowman, a dog in the cabin window, and townspeople reacting to an APB alert.

The initial prompt described a snowy cabin, children playing, a dog in the window, and a “WELCOME TO SNOWY HAVEN” billboard. Nano Banana Pro handled the composition well. The billboard text was clean and readable.

Surreal Version: Surprisingly Good

Thanks. Ok, let’s try another version, this time more surreal. Like Bergman, Lynch and Fellini in a blender.

Alternate version of the Snowy Haven winter scene with a cozy cabin, a dog in the window, children playing, and townspeople reacting to distant danger.

Alternate surreal Snowy Haven scene with large eye imagery, eerie winter village, and stylized motion words like “Rumble” and “Squish.”

I asked for a surreal version inspired by Bergman, Lynch, and Fellini. The result had an atmospheric, slightly dreamlike quality. Not mind-blowing, but definitely more expressive than earlier Google models. I especially appreciate the reversed words on the billboard.

Multi-Panel Comic Layout: Complete Miss

Nice. Alright, let’s do a third version, this time from a different camera angle, and also using multiple-panels to set the scene in motion like a comic book. Maybe 3-5 panels.

Chaotic multi-panel comic layout produced by Nano Banana Pro, with warped panels, distorted characters, and surreal winter imagery.

Next, I asked for a new camera angle and a layout similar to a comic book with three to five panels. This didn’t work at all. Panels merged into one another, perspective collapsed, and continuity disappeared.

Worth noting: ChatGPT produces far stronger multi-panel outputs. Nano Banana Pro simply isn’t ready for this use case.

Criterion Collection Mockup: Amusing but Unconvincing

Ok now create a Criterion Blu-Ray with artwork for this upcoming movie release thanks!

Surreal Criterion-style Blu-ray mockup titled “Snowy Haven,” featuring a clown critic, melting clocks, swirling sky, and distorted comic-style panels.

Finally, I asked for a faux Criterion Blu-ray cover. The design didn’t resemble a Criterion release; it felt generic and lacked clear visual identity. Also, the some of the text did not turn out well (see back cover copy). The most entertaining part? The model added “A Film by Lars von Trier” without being asked. That alone deserves an award for sheer audacity. Actually, I need to see this film.

Where Nano Banana Pro Fits in the AI Landscape

Google isn’t trying to match Midjourney’s artistic output, at least not yet. Instead, Nano Banana Pro highlights areas where Google already has advantages: search, accuracy, multilingual text, and seamless integration with existing workflows. I think this is a smart approach.

“Google is betting that Gemini 3’s reasoning edge will differentiate Nano Banana Pro from competitors. Early results suggest they may be right.”

The strategy is clear: provide an image generator that slots directly into Workspace, Ads, and enterprise pipelines. Midjourney remains the stronger choice for abstract artistry. Nano Banana Pro, however, is quickly becoming an appealing option for creators who need clear text, consistent branding, and reliable visual reasoning.

What You Can Actually Do With It

Here are some practical benefits off the top of my mind:

  • Storyboards with consistent characters and settings
  • Infographics with readable labels and clean layout
  • Product shots from multiple angles for reviews or SEO
  • Localized imagery with proper text translation
  • Workspace integration for Slides, Docs, and Vids
  • Structured templates for Google Ads
  • Color palettes and visual concepts for film development

This is the kind of tool that speeds up the early stages of creative planning.

Final Take: A Step Forward, Not a Leap

Nano Banana Pro represents a clear improvement. It’s sharper, more consistent, and noticeably better at text. The reasoning improvements from Gemini 3 Pro help with scene accuracy and environmental details.

Complex layouts, such as comic-style panels, still need work, at least based on my preliminary tests. Also note: free-tier users will run into quota limits fairly quickly. This is increasingly par for the course. Gen AI requires a massive amount of compute.

Overall, this is a meaningful step forward. Google is building toward a creative platform, not just another image generator. For filmmakers, designers, and anyone crafting visual stories, Nano Banana Pro is worth watching as it evolves.

Along with Claude Code, and ChatGPT, I can easily see Nano Banana Pro slotting into our workflow here at Stark Insider.

Clinton Stark
Filmmaker and editor at Stark Insider, covering arts, AI & tech, and indie film. Inspired by Bergman, slow cinema and Chipotle. Often found behind the camera or in the edit bay. Peloton: ClintTheMint.