Modern Artifacts

Blogging the Contemporary Artifacts of Tomorrow

New Technology

Google Bard is now able to generate images and Gemini Pro has been upgraded to be more capable.

Advertisement: Click here to learn how to Generate Art From Text

Google is updating their search engine. Bard AI chatbot to step up its competition with rival OpenAI’s ChatGPT. The Sundar Pichai-led Internet giant announced today that it will expand Bard to include image generation capabilities powered by its Imagen 2 AI model as well as an enhanced version of Gemini Pro.

The move gives more people access to Bard’s AI smarts, including a new free tool to create AI images.

“These updates make Bard an even more helpful and globally accessible AI collaborator for everything from big, creative projects to smaller, everyday tasks,” Jack Krawczyk, product lead for Bard, noted in a blog post.

Separately the company announced that it is also experimenting with a second image generator, called ImageFXStarting today, you can start using.

VB Event

The AI Impact Tour – NYC

We’ll be in New York on February 29 in partnership with Microsoft to discuss how to balance risks and rewards of AI applications. Please fill out the form below to request an invitation to this exclusive event.

 


Request an invitation

Gemini Pro supports multiple languages

Google has been around for over a month. Gemini in three sizes: Nano for mobile devices, Pro for more intermediate use cases, and Ultra, what it claimed to be the most powerful and capable large language model (LLM) yet developed by any company — even more powerful than GPT-4 — though this one is not due out until later this year.

Comparisons by third-parties between Gemini Pro and other models, including the most powerful LLM available from Google. found that it actually lags behind even OpenAI’s older GPT-3.5 TurboGoogle’s attempt to prove to the world that they can take on the new entrants in the AI race is a worrying sign. Google did release an improved version of Gemini Pro on Bard in the last monthOnly in English. 

But today’s flurry of new consumer-facing AI announcements should help Google close the gap. The latest update to Bard, Gemini Pro can be downloaded in over 40 languages — including Korean, Spanish, Tamil, Italian and Russian — across more than 230 countries and territories.

This not only gives more people access to Gemini Pro’s advanced understanding, summarizing, reasoning and coding capabilities but also Bard’s double-check feature, which validates a response by searching across the web.

Imagen-2 on Bard will take on ChatGPT+ with DALL-3

It is also bringing in the long-awaited AI Image Generation capabilities. This is being delivered with the help of the Imagen 2 model, which, Google says, can produce high-quality, photorealistic outputs from text inputs, turning Bard into more of a direct and capable competitor to OpenAI’s ChatGPT Plus with DALL-E 3 image generator model, which has been available to users of OpenAI’s subscription tiers since October 2023.

“Just type in a description — like “create an image of a dog riding a surfboard” — and Bard will generate custom, wide-ranging visuals to help bring your idea to life,” Krawczyk noted.

Imagen 2 on Bard

We tested Bard to generate images and found it produced outputs in 30-40 seconds, with good consistency. In some cases, however, it failed to generate the image altogether – even when it did not involve any famed individual, which Google filters out (likely in an effort to avoid scandalous deepfakes similar to What happened to Taylor Swift? and users of Microsoft’s Designer AI image generator powered by OpenAI’s DALL-E 3).

There’s also no support to change the aspect ratio of outputs or any prompt in any other language apart from English at this stage — at least not from our initial usage of the tool.

However, what’s good is that given the AI-generated media and copyright infringementGoogle Bard gives users the option of reporting legal issues relating to data protection, copyright, and other laws, for all generated media.

The company also stated that it limits production of violent, offensive, or sexually explicit contentDeepmind developed SynthIDWatermarks can be embedded into pixels to make them digitally identifiable. This can help people differentiate if a visual has been generated with Google’s AI or an actual human artist.

AI Images: A new way to iterate

Google has announced that in addition to the updates for Bard it is also testing ImageFX. ImageFX is a new tool powered by Imagen 2 for image generation. 

Available today AI Test Kitchen, Google’s app for experimental AI projects, ImageFX tries to spur creative ideas with “expressive chips” that give users adjacent dimensions and suggestions to iterate on their prompt. This feature is also available in other tools, such as Ideogram.

The AI Test Kitchen includes a number of other interesting Google experimental projects, including MusicFXTextFX is an AI experiment for lyricists.

VentureBeat’s missionThis is a digital town square where technical decision-makers can learn about transformative business technology and transact. Discover our Briefings.



LEAVE A RESPONSE

Your email address will not be published. Required fields are marked *