SAN FRANCISCO — OpenAI’s most recent version to its artificial intelligence model can emulate human cadences in spoken responses and even attempt to discern people’s emotions.
The effect is reminiscent of Spike Jonze’s 2013 film Her, in which the (human) main character falls in love with an artificially intelligent operating system, which results in some issues.
Open AI Launches GPTo, Improving ChatGPT’s Text, Visual And Audio Capabilities
While few will find the new model appealing, Open AI claims it performs quicker than earlier versions and can reason across text, audio, and video in real-time.
GPT-4o, which stands for “omni,” will power Open AI’s popular ChatGPT chatbot and will be available to everyone, including those using the free version, in the coming weeks, the company revealed during a brief live-streamed update. CEO Sam Altman, who was not a presenter at the event, put the word “her” on the social networking platform X.
During a presentation with Chief Technology Officer Mira Murati and other executives, the AI bot interacted in real time, adding emotion — especially “more drama” — to its voice when asked. It also helped guide through the processes required to solve a simple math issue without initially spitting out the answer, as well as a more difficult software coding challenge on a computer screen.
It also attempted to predict a person’s emotional state by analyzing a selfie video of their face (deciding he was pleased because he was smiling) and translated English and Italian to demonstrate how it could help people who speak different languages communicate.
Open AI Launches GPTo, Improving ChatGPT’s Text, Visual And Audio Capabilities
Gartner analyst Chirag Dekate said the upgrade, which lasted less than 30 minutes, created the impression that Open AI is catching up to larger competitors.
“Many of the demos and capabilities showcased by Open AI seemed familiar because we had seen advanced versions of these demos showcased by Google in their Gemini 1.5 pro launch,” Dekate stated. “While OpenAI had a first-mover advantage last year with ChatGPT and GPT3, when compared to their peers, especially Google, we now are seeing capability gaps emerge.”
Open AI Launches GPTo, Improving ChatGPT’s Text, Visual And Audio Capabilities
Google plans to have its I/O developer conference on Tuesday and Wednesday, which will likely reveal upgrades to its own Gemini model.
SOURCE – (AP)