OpenAI Reveals GPT-4o And A New Desktop App At The Spring Update Event

By Shikhar Mehrotra • Updated On 14 May 2024

Like
Comment
Share

So far, ChatGPT users have been able to use the Voice Mode to talk to the chatbot and get answers to their queries. However, the general latency with GPT-3.5 is around 2.8 seconds, whereas the latency on GPT-4.0 is around 5.4 seconds. While the Voice Mode works, it doesn’t feel as natural and intuitive as having a regular conversation, something that OpenAI has improved with its latest GPT-4o model.

ALSO SEE: Best Laptops Under 50000 in India (May 2024)

What Is OpenAI’s GPT-4o?

Say hello to GPT-4o, our new flagship model which can reason across audio, vision, and text in real time: https://t.co/MYHZB79UqN

Text and image input rolling out today in API and ChatGPT with voice and video in the coming weeks. pic.twitter.com/uuthKZyzYx
— OpenAI (@OpenAI) May 13, 2024

OpenAI’s GPT-4o is a multimodal model that can interact with text, visuals, or audio. According to the official release, the new tool can respond to audio inputs in around 0.2 seconds, with an average response time of 0.3 seconds, similar to the human response time. The model matches GPT-4 Turbo performance on text in English and code, with significant improvements in non-English languages.

The current Voice Mode consists of three separate models. While the first transcribes audio to text and the second model gets the query solved by GPT-3.5 or GPT-4.0. The third model then transcribes the result from text to audio. However, in the process, GPT-4.0 can’t observe users’ tone, multiple speakers, or background noises and can’t express emotion, either.

While one might argue whether this is a genuine problem, OpenAI seems to have solved it with GPT-4o. The new tool consists of a single new model that takes the input via text or audio, answers the query, and relays it to the user using the desired output method. That’s how GPT-4o functions differently than the current model.

ALSO SEE: Cars With Front Parking Sensors

GPT-4o New Features

@BeMyEyes with GPT-4o pic.twitter.com/nWb6sEWZlo
— OpenAI (@OpenAI) May 13, 2024

For example, you could upload an image and discuss it with the AI model. On the other hand, you could ask it to recognize something on the screen and provide more information about it. Here’s a list of all the features that GPT-4o will provide.

GPT-4 level intelligence
Responses from the model and the web
Analyze data and create charts
Chat about photos
Upload files for assistance in summarizing, writing, or analyzing
Discover and use GPTs
Building a more helpful experience with Memory

During the Spring Update launch event, the company showcased the GPT-4o in several demo videos. In these videos, the model, running on a smartphone, recognized several real-world objects, people, and their surroundings while answering users’ queries. However, not all GPT-4o abilities will immediately make it to users’ phones. For now, OpenAI is rolling out upgraded text and image abilities.

In the coming days, OpenAI will release the audio and vision capabilities. What’s important is that unlike GPT-4.0, GPT-4o will be available to all ChatGPT users without a subscription fee. Even so, ChatGPT Plus users will have a five times higher conversation limit.

ALSO SEE: Power Steering Cars Price List

OpenAI Gets A New Desktop App For Simplified Usage

Apart from GPT-4o, OpenAI also released a new desktop app for ChatGPT. Per Mira Murati, the interface contains refreshed UI elements that aim to make interactions more natural. The interface now supports a new keyboard shortcut (Option + Space), allowing users to ask ChatGPT a question immediately. “You can now have voice conversations with ChatGPT directly from your computer, starting with Voice Mode that has been available in ChatGPT at launch,” reads the official blog post.

You can follow Smartprix on Twitter, Facebook, Instagram, and Google News. Visit smartprix.com for the latest tech and auto news, reviews, and guides.

Shikhar Mehrotra

Shikhar Mehrotra is a seasoned technology writer and reviewer with over five years of experience covering consumer tech across India and global markets. At Smartprix, he has authored more than 1,700 articles, including news stories, features, comparisons, and product reviews spanning automobiles, smartphones, chipsets, wearables, laptops, home appliances, and operating systems. Shikhar has reviewed flagship devices such as the iPhone 16, Galaxy S25+, and Sennheiser HD 505 Open-Ear headphones. He also contributes regularly to Smartprix’s growing automotive section.

With a deep understanding of both iOS and Android ecosystems, Shikhar specializes in daily tech news, how-to explainers, product comparisons, and in-depth reviews. His DSLR photography in product reviews is recognized as among the best on the team.

Before joining Smartprix, Shikhar wrote for leading publications including Forbes Advisor India, Republic World, and ScreenRant. He holds a Bachelor of Arts in Journalism and Mass Communication from Amity University, Lucknow.

These Are The Best Amazon Prime Day Deals On Air Conditioners I’d Recommend

I know it sounds odd to buy an air conditioner in July, when the worst of the summer heat has already passed in most of India. But if you’ve ever tried buying an AC in April when the mercury bar is touching 44°C and every store has a three-week waiting time, and the monsoon is …

What Makes GPT-4 Turbo OpenAI’s Most Powerful Generative AI Model?

Last week, reports suggested that Sam Altman-led OpenAI is going to announce several key upgrades to its large language model GPT, along with reduced prices for developers. On November 6, 2023, the company announced similar improvements, including a new GPT-4 Turbo model, which has a vast database, new APIs for enabling features like image analysis …

ChatGPT App Launched on iOS: How To Use ChatGPT on iPhone?

It’s official. OpenAI has finally launched the app for ChatGPT on the App Store for iOS devices. The app brings onboard the power of ChatGPT on your iPhone without needing to access it via a browser. ChatGPT app on iOS is made to give you instant answers without sifting through endless search results and ads …

Adobe Updates Creative Cloud Apps With AI Culling, Offline Remote Tool, And New Video Effects

Adobe has pushed a bunch of updates across its Creative Cloud apps, including Lightroom, Photoshop, Premiere Pro, and After Effects. The updates are rolling out to Creative Cloud subscribers (₹1,675.60 per month for individuals) from this week. The most meaningful change here, in my opinion, is Photoshop’s Remote Tool; you can now use it offline. …

OpenAI’s GPT-5.5 Instant Is Here: Fewer Hallucinations, Less Clutter, Better Answers

OpenAI didn’t make a big deal out of it, but the company pushed out a new default model for ChatGPT users across the globe: GPT-5.5 Instant instead of the earlier GPT-5.3 (in place since March). The update is free, it’s already live, and it fixes some of the persistent complaints ChatGPT users have had about …