At first Gemini Live will be available only in English and for android users.
Google introduced a new feature known as ‘Gemini Live’ which allows users to talk to Gemini as naturally as possible. Google aims to provide a new AI generated personal assistant that can help with complex tasks and save hours. All this while being more natural, conversational and intuitive. The Alphabet owned company announced and introduced the tool on Tuesday.
The users can talk to this generative artificial intelligence tool in the same way as they would speak to a friend over the phone. The tool allows a back and forth conversation where the users can ask for an opinion, brainstorm strategies and even share ideas in the most natural and interactive way.
The major significance of the new features is that it involves an ability to transform AI from one-way conversational tools or static conversations into a more interactive and responsive assistance. During the conversation the users can interrupt to make specific points or pause the discussion and return back to it much later.
In order to make the conversation feel more natural, Google has introduced 10 new voices that the users can choose from along with their choice of tone and style.
The tool is hands-free, hence the users can converse with the app in the background even with their phones locked which is similar to talking over the phone while driving or doing other tasks. It’s like having a buddy in your pocket with whom you can discuss new ideas or practice an important talk.
The California based company aims for a bigger share of the generative AI market and is giving competition to Microsoft backed Bing and ChatGPT.
While in December google launched Gemini 1.0 their first multimodal model in three different sizes ultra, pro and nano. This model was followed by an advanced version 1.5 Pro which comes with a 1 million token window context. While the latest 1.5 flash is trained on up to a two million-token context window.
Despite the huge investments, gemini has yet to break even and significantly contribute to the company’s sales. The service business of Google takes over 87.2% of the total company’s sales in the second quarter of the present year. It increased overall revenue by about $73.9 billion, or 11.5 percent on an annual basis.
At first it will be available only in English and for android users. Gemini is fully integrated into the Android user experience, giving additional context-aware capabilities that are only available on Android. Gemini provides assistance just when you need it, regardless of what you’re doing on your Android phone.
Google aims at expanding to the operating system of Apple’s iOS in many more languages in the next coming weeks.
Gemini Live is available to the users who have a subscription to the paid Gemini Advanced. Gemini Advanced requires customers to pay $19.99 per month for the Google One AI premium subscription. Available in about 150 countries which includes the US, the UK, the UAE and Saudi Arabia.
The introduction of Gemini Live made a mark in the competitive generative AI market. Giving its competitor OpenAI’s ChatGPT a tough competition. In May, OpenAI introduced GPT-4o which is an updated version of the model technology that powers ChatGPT.
Google intends to distinguish Gemini Live from the latest version of ChatGPT by emphasising natural interaction and personalization.
Its capacity to provide real-time, conversational support, along with customizable voices, establishes it as a serious rival in the AI assistant space. For users, this means more tailored experiences.
Regional bodies are also targeting leaders. In May, Abu Dhabi’s Technology Innovation Institute released Falcon 2, the second edition of its huge language model, to compete with models produced by Google and OpenAI. In the same month, Core42, a unit of Abu Dhabi’s AI and cloud business G42, debuted Jais Chat, a bilingual Arabic and English chatbot built in the UAE.