Chat GPt-4o vs Gemini Live.

Abith Ahamed
3 min readMay 17, 2024

--

We are living in the era of the AI revolution. Tech giants and startups are in a race to build AI products. Open AI and Google announced their new AI capabilities with GPT-4o and the Gemini live. These two chatbots are now able to respond in real-time. We have seen in the demos that both products can give real-time responses. Now, both services can do advanced tasks such as reading the environment through video feeds and providing feedback. Detecting the things around us, explaining what we want to know, real-time translation, etc. So, AI has quickly evolved in past years. This helps a lot in day-to-day life.

Now, we’ll have a close look at these services separately.

Chat GPT-4o.

GPT-4o gives much more natural human interaction. We can input any text, image, or audio inputs and get the outputs. This matches the GPT-4 turbo performance on text in English and code. Also, it has improved the text in non-English. The exciting part is that this model better understands audio and visuals than existing models. Existing chat GPT-3.5 and 4 use three pipeline models to achieve the voice inputs. It converts the voice to text, gets the text output, and gives the output to users by converting output text to voice. Users can experience a delay and lose some information in this process.

But with the GPT-4o, all the inputs and outputs are processed by the same neural network, so it can produce real-time responses and laughter, singing, other emotions etc.

Gemini Live.

Google has released a competitor similar to GPT-4o called Gemini Live at their Google I/O event. This is a part of project Astra, which is coming to smart glasses in the future. But for now, it’s only available for smartphones. Unlike GPT-4o, Gemini relies on imagen3 and Google veo to output images and videos. Users can speak up at their own pace and interrupt Gemini to add more information to get clear answers. Gemini Live will get Google Lens-esque features to enable the camera. So Gemini can see the world and provide feedback.

Still, Gemini Live is not available even for premium users. Once it is, users can access it via the Gemini app, which is available on Android and iOS.

Difference between Chat GPT-4o and Gemini Live.

This is a simple intro about the new AI models released by OpenAI and Google. We will wait and see how they work and perform in the real world. I think both might be useful for easing our work. It’s your choice as to what to choose as your AI partner.

--

--

Abith Ahamed
Abith Ahamed

Written by Abith Ahamed

Passionate about technology, networking, and cybersecurity. Network Engineer| Cybersecurity Specialist | Constantly exploring the ever-evolving tech landscape.

No responses yet