Google's Gems, a feature that lets users customize the company's Gemini chatbots as AI experts on any topic, will soon be available to Gemini Advanced subscribers.
Google made the announcement in a blog post on Wednesday, also saying that its latest image generation model, Imagen 3, will be released to all Gemini users soon.
These features were previewed at Google's I/O 2024 developer conference in May, where AI was a major theme. Google showed off updates to make the Gemini chatbot more conversational and overall smarter, declaring this the “Gemini era.” But just as Google fell behind ChatGPT maker OpenAI in releasing its generative AI chatbot, it's now following OpenAI's lead again by rolling out the ability to create custom versions of its bots.
jewelry
Gemini Advanced subscribers, who pay $20 per month for access to the Gemini 1.5 Pro model, will get access to Gems “in the coming days,” Dave Citron, senior director of product management for Gemini Experiences, said in the post.
Gems act as topic experts or cheerleaders for goals such as reviewing projects, brainstorming ideas for events, writing captions for social media posts, and acting as gym buddies.
To create a Gem, you give it instructions, give it a name, and chat with it; it remembers your instructions so you don't have to start over with each interaction, like with traditional chatbots, Citron says.
Google has also released pre-made Gems that include a learning coach, brainstorming, career guide, writing editor, coding partner, and more.
According to the article, Gems will be available in 150 countries and “most languages,” and will also be available for business and enterprise users.
In November, OpenAI unveiled GPTs, its own version of customized chatbots.
Like Gems, OpenAI's equivalent tool lets you create GPTs for specific purposes, like learning the rules of a board game, teaching math to kids, or designing stickers. First, you give it instructions and choose what it can do, and you're ready to go. OpenAI's GPT store currently has millions of GPTs, including a Canva design tool, a fitness trainer, and a haiku writer.
Image 3
Google also announced that the Imagen 3 image generation model will be available to all Gemini users in the coming days.
At I/O, Google said the new model can create more photorealistic images, including details like sunlight and animal whiskers, and in longer prompts, remembers to include specific details like wildflowers and birds. Additionally, Imagen 3 can generate styles like “photorealistic landscapes, textured oil paintings, and whimsical claymation scenes,” the post said.
Citron noted that Google has made “significant progress” in generating images of people with Imagen 3. But the model doesn't generate realistic images of identifiable individuals, minors, or excessively gory, violent, or sexual scenes.
Imagen 3 uses the SynthID tool to watermark AI-generated images.
Gemini Advance
In addition to access to Gems and Gemini 1.5 Pro, Gemini Advanced subscribers are given a window of 1 million tokens, which CEO Sundar Pichai said in May would increase to 2 million tokens later this year.
Large language models like Gemini break down words into tokens so that they can understand queries. The tokens in the context window help the model remember things. The more tokens there are, the better the memory. According to Google's numbers, a context window of 1 million tokens can ingest 1,500 pages of text.
But in his review of Gemini Advanced, CNET's Imad Khan said the model was plagued with errors and wasn't comparable to OpenAI's ChatGPT Plus subscription, which also costs $20 per month.