Google Gemini’s AI image model gets a ‘bananas’ upgrade - Latest AI News, Trends & Insights

Google is upgrading its Gemini chatbot with a new AI image model that gives users finer control over editing photos, a step meant to catch up with OpenAI’s popularimage toolsand draw users from ChatGPT.

The update, called Gemini 2.5 Flash Image, rolls out starting Tuesday to all users in the Gemini app, as well as to developers via the Gemini API, Google AI Studio, and Vertex AI platforms.

Gemini’s new AI image model is designed to make more precise edits to images — based on natural language requests from users — while preserving the consistency of faces, animals, and other details, something that most rival tools struggle with. For instance, ask ChatGPT or xAI’s Grok to change the color of someone’s shirt in a photo, and the result might include a distorted face or an altered background.

Google’s new tool has already drawn attention. In recent weeks, social media usersravedover an impressive AI image editor in the crowdsourced evaluation platform, LMArena. The model appeared to users anonymously under the pseudonym “nano-banana.”

strange object spotted under the microscope over the weekend in the lab… pic.twitter.com/t1SBhqAnL0— Demis Hassabis (@demishassabis) August 25, 2025

strange object spotted under the microscope over the weekend in the lab…pic.twitter.com/t1SBhqAnL0

Google says it’s behind the model (if it wasn’tobvious alreadyfrom all the banana-related hints), which is really the native image capability within its flagshipGemini 2.5 FlashAI model. Google says the image model is state-of-the-art on LMArena and other benchmarks.

“We’re really pushing visual quality forward, as well as the model’s ability to follow instructions,” said Nicole Brichtova, a product lead on visual generation models at Google DeepMind, in an interview with TechCrunch.

“This update does a much better job making edits more seamlessly, and the models outputs are usable for whatever you want to use them for,” said Brichtova.

AI image models have become a critical battle ground for Big Tech. When OpenAI launched GPT-4o’s native image generator in March, it drove ChatGPT’susagethrough the roof thanks to a frenzy of AI-generatedStudio Ghiblimemes that, according to OpenAI CEO Sam Altman, left the company’s GPUs “melting.”

To keep up with OpenAI and Google, Meta announced last week that it wouldlicenseAI image models from the startup Midjourney. Meanwhile, the a16z-backed German unicornBlack Forest Labscontinues to dominate benchmarks with its FLUX AI image models.

Perhaps Gemini’s impressive AI image editor can help Google close its user gap with OpenAI. ChatGPT now logs more than700 millionweekly users. On Google’s earnings call in July, the tech giant’s CEO Sundar Pichai revealed that Gemini had450 millionmonthlyusers — implying weekly users are even lower.

Brichtova says Google specifically designed the image model with consumer use cases in-mind, such as helping users visualize their home and garden projects. The model also has better “world knowledge” and can combine multiple references in a single prompt; for example, merging an image of a sofa, a living room photo, and a color palette into one cohesive render.

While Gemini’s new AI image generator makes it easier for users to make and edit realistic images, the company has safeguards that limit what users can create. Google has struggled with AI image generator safeguards in the past. At one point, the companyapologizedfor Gemini generating historically inaccurate pictures of people, androlled backthe AI image generator altogether.

Now, Google feels that it’s struck a better balance.

“We want to give users creative control so that they can get from the models what they want,” said Brichtova. “But it’s not like anything goes.”

The generative AI section of Google’s terms of service prohibits users from generating “non-consensual intimate imagery.” Those same kinds of safeguards don’t seem to exist for Grok, which allowed users to create AI-generatedexplicit imagesresembling celebrities, such as Taylor Swift.

To address the rise of deepfake imagery, which can make it hard for users to discern what’s real online, Brichtova says that Google applies visual watermarks to AI-generated images, as well as identifiers in its metadata. However, someone scrolling past an image on social media may not look for such identifiers.

Source: Techcrunch

More articles