OpenAI announces new multimodal desktop GPT with new voice and vision capabilities – Computerworld

Post: OpenAI announces new multimodal desktop GPT with new voice and vision capabilities – Computerworld

coder_prem

Hi, I'm Prem. I'm professional WordPress Web Developer. I developed this website. And writing articles about Finance, Startup, Business, Marketing and Tech is my hobby.
Hope you will always get informative articles which will help you to startup your business.
If you need any kind of wordpress website then feel free to contact me at webexpertprem@gmail.com

Recent Posts

Find the Best AI Answering Service for Smarter Conversation

April 17, 2026

London-based startup Lua raises €4.9 million to scale its human-agent collaboration platform

April 16, 2026

Where the Real Advantage Lies in 2026

April 16, 2026

Ticketmaster and Live Nation had monopoly on big concert venues, jury finds – National

April 16, 2026

There’s a version of loneliness that only arrives inside a crowded room full of people who like you, and it comes from the slow realization that what they like is a performance you can no longer remember choosing to start

April 16, 2026

Chirag Dekate, a Gartner vice president analyst, said that while he was impressed with OpenAI’s multimodal large language model (LLM), the company was clearly playing catch-up to competitors, in contrast to its earlier status as an industry leader in generative AI tech.

“You’re now starting to see GPT enter into the multimodal era,” Dekate said. “But they’re playing catch-up to where Google was three months ago when it announced Gemini 1.5, which is its native multimodal model with a one-million-token context window.”

Still, the capabilities demonstrated by GPT-4o and its accompanying ChatGPT chatbot are impressive for a natural language processing engine. It displayed a better conversational capability, where users can interrupt it and begin new or modified queries, and it is also versed in 50 languages. In one onstage live demonstration, the Voice Mode was able to translate back and forth between Murati speaking Italian and Barret Zoph, OpenAI’s head of post-training, speaking English.

During a live demonstration, Zoph also wrote out an algebraic equation on paper while ChatGPT watched through his phone’s camera lens. Zoph then asked the chatbot to talk him through the solution.

While the voice recognition and conversational interactions were extremely human-like, there were also noticeable glitches in the interactive bot where it cut out during conversations and picked things back up moments later.

The chatbot then was asked to tell a bedtime story. The presenters were able to interrupt the chatbot and have it add more emotion to its voice intonation and even change to a computer-like rendition of the story.

Lora Helmin

Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum.

Find the Best AI Answering Service…

London-based startup Lua raises €4.9 million to scale its human-agent collaboration platform

Startups

Ryan April 16, 2026

London-based startup Lua raises €4.9 million…

Marketing

Ryan April 16, 2026

Where the Real Advantage Lies in…

“PHL TECH Magazine is your go-to online destination for all things tech. Stay updated with the latest news, trends, and innovations in the world of technology. Explore in-depth articles, interviews, and reviews that delve into the exciting realm of gadgets, software, startups, and more. Join our vibrant community and immerse yourself in the ever-evolving world of tech with PHL TECH Magazine.”

About PHL TECH MAGAZINE

Your one-stop source for startup, business, finance, and tech news. Stay informed with our timely articles on innovation, entrepreneurship, and the latest industry trends.

Get interesting news

Subscribe to our newsletter and we’ll send you the emails of latest posts.

Post: OpenAI announces new multimodal desktop GPT with new voice and vision capabilities – Computerworld

coder_prem

Categories

Recent Posts

Find the Best AI Answering Service for Smarter Conversation

London-based startup Lua raises €4.9 million to scale its human-agent collaboration platform

Where the Real Advantage Lies in 2026

Ticketmaster and Live Nation had monopoly on big concert venues, jury finds – National

There’s a version of loneliness that only arrives inside a crowded room full of people who like you, and it comes from the slow realization that what they like is a performance you can no longer remember choosing to start

Lora Helmin

Leave a Reply Cancel reply

Related Popular Posts

Find the Best AI Answering Service…

London-based startup Lua raises €4.9 million…

Where the Real Advantage Lies in…

About PHL TECH MAGAZINE

Recent Posts

Genuine TFSA contribution mistake still leads to CRA tax and penalty

Is retirement still possible after a costly divorce at age 61?

How to stay on budget when travelling this summer

Get interesting news