OpenAI has unveiled GPT-4o,Watch I Don't Love You Yet Online a new AI model that combines text, vision, and audio.
At its highly anticipated livestream event, OpenAI CTO Mira Murati shared that GPT-4o can process text, audio, and vision in one model. GPT-4o will be available for free to all ChatGPT users. It is also available in the API, and is half the price and twice as fast as GPT -4 Turbo. The "o" in the name stands for "omni," referencing its combined modalities in one model.
SEE ALSO: Apple and OpenAI 'finalizing terms' for bringing ChatGPT to iOS 18The announcement confirmed previous rumors of a voice assistant. Previously, there were separate models for voice and image modalities. But GPT-4o is "natively multimodal," said OpenAI CEO Sam Altman on X.
This Tweet is currently unavailable. It might be loading or has been removed.
Now, the GPT-4o brings the modalities together, decreasing lag and making it responsive in real-time. That means you can interrupt the model. It can also sense emotions and tones and express its own emotions and tones, making it sound extremely dramatic or robotic. It can even sing (if you want it to).
The soothing female voice used in the demo also sounds a lot like Scarlett Johansson's voice assistant character in the film Her.
Another demo showcased GPT-4o's ability to help with math problems using its vision modality. It can walk the user through a basic math problem when solving for X. By highlighting code on the screen, ChaGPT with GPT-4o can process and understand what the code is and help improve it.
From user inquiries, ChatGPT with GPT-4o showed off its ability to translate in real-time and understand emotions.
This Tweet is currently unavailable. It might be loading or has been removed.
Murati started the event by sharing the availability of a new desktop app.
Previously, OpenAI was rumored to announce a ChatGPT search engine or a new transformer model GPT-5 ahead of Google I/O. CEO Sam Altman shot down those rumors ahead of Monday's event, but they are still believed to be in development.
Topics ChatGPT OpenAI
(Editor: {typename type="name"/})
Trump's foreign aid freeze halts funding for digital diplomacy bureau
Scenes from the Brooklyn Botanic Garden in Wintertime
Windows on the World: The View from Himeji City, Japan
Nishioka vs. Alcaraz 2025 livestream: Watch Australian Open for free
Say “I Love You” with Vintage Issues of “The Paris Review”
Google 'Ask for me:' AI that calls businesses on your behalf for pricing and availability
History and Mystery: A Century of Chinese Photobooks
接受PR>=1、BR>=1,流量相当,内容相关类链接。