GPT-4o system card highlights weird voice assistant risks

OpenAI has released the system card for its advanced GPT-4o model and explained the novel risks its audio capabilities present.

Itâ€™s been a few months since the impressive demos of GPT-4oâ€™s voice assistant interacting with almost real-time dialogue. OpenAI said it would require extensive testing before the voice capability could be safely deployed and has recently only allowed a few alpha testers access to the feature.

The newly released system card gives us an insight into some of the weird ways the voice assistant behaved during testing and what OpenAI has put in place to make it behave.

At one point during testing, the voice assistant shouted â€œNo!â€ and then continued with its response, but this time it imitated the userâ€™s voice. This wasnâ€™t in response to a jailbreak attempt and seems to be related to the background noise in the input prompt audio.

https://dailyai.com/wp-content/uploads/2024/08/voice_generation1.wav

OpenAI says it â€œobserved rare instances where the model would unintentionally generate an output emulating the userâ€™s voice.â€ GPT-4o has the capability to imitate any voice it hears, but the risk of giving users access to this feature is significant.

To mitigate this, the system prompt only allows it to use the preset voices. They also â€œbuilt a standalone output classifier to detect if the GPT-4o output is using a voice thatâ€™s different from our approved list.â€

OpenAI says itâ€™s still working on a fix for decreases in safety robustness when the input audio is poor quality, has background noise, or contains echoes. Weâ€™re likely to see some creative audio jailbreaks.

For now, it doesnâ€™t look like weâ€™ll be able to trick GPT-4o into speaking in Scarlett Johanssonâ€™s voice. However, OpenAI says that â€œunintentional voice generation still exists as a weakness of the model.â€

Powerful features shut down

OpenAI also shut down GPT-4oâ€™s ability to identify the speaker based on audio input. OpenAI says this is to protect the privacy of private individuals and â€œpotential surveillance risks.â€

When we do eventually get access to the voice assistant it wonâ€™t be able to sing, unfortunately. OpenAI closed that feature off along with other measures to stay on the right side of any copyright issues.

Itâ€™s an open secret that OpenAI used copyrighted content to train its models and this risk mitigation seems to confirm it. OpenAI said, â€œWe trained GPT-4o to refuse requests for copyrighted content, including audio, consistent with our broader practices.â€

During testing red teamers were also â€œable to compel the model to generate inaccurate information by prompting it to verbally repeat false information and produce conspiracy theories.â€

This is a known issue with ChatGPTâ€™s text output but the testers were concerned that the model could be more persuasive or harmful if it delivered the conspiracy theories using an emotive voice.

Emotional risks

Some of the biggest risks associated with GPT-4oâ€™s advanced Voice Mode might not be fixable at all.

Anthropomorphizing AI models or robots is a trap thatâ€™s easy to fall into. OpenAI says the risk of attributing human-like behaviors and characteristics to an AI model is heightened when it speaks using a voice that sounds human.

It noted that some users involved in early testing and red teaming used language that indicated they had formed a connection with the model. When users interact with and form emotional attachments with AI, it could affect human-to-human interactions.

When a user interrupts GPT-4o, rather than berate them for being rude, itâ€™s happy to let them do that. That kind of behavior isnâ€™t appropriate in human social interactions.

OpenAI says â€œUsers might form social relationships with the AI, reducing their need for human interactionâ€”potentially benefiting lonely individuals but possibly affecting healthy relationships.â€

The company is clearly putting a lot of work into making GPT-4oâ€™s voice assistant safe, but some of these challenges may be insurmountable.

The post GPT-4o system card highlights weird voice assistant risks appeared first on DailyAI.

Source: Read MoreÂ

Sunshine And March Vibes (2025 Wallpapers Edition)

The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

How To Fix Largest Contentful Paint Issues With Subpart Analysis

How To Prevent WordPress SQL Injection Attacks

Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

Bing Search APIs to be “decommissioned completely” as Microsoft urges developers to use its Azure agentic AI alternative

Microsoft might kill the Surface Laptop Studio as production is quietly halted

Minecraft licensing robbed us of this controversial NFL schedule release video

The power of generators

The power of generators

Simplify Factory Associations with Laravel’s UseFactory Attribute

This Week in Laravel: React Native, PhpStorm Junie, and more

Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

Bing Search APIs to be “decommissioned completely” as Microsoft urges developers to use its Azure agentic AI alternative

Microsoft might kill the Surface Laptop Studio as production is quietly halted

GPT-4o system card highlights weird voice assistant risks

Powerful features shut down

Emotional risks

Nmap 7.96 Launches with Lightning-Fast DNS and 612 Scripts

CVE-2025-47916 – Invision Community Themeeditor Remote Code Execution

How to Develop PHP Applications Faster Using the PHP Low-Code Wizard’s Toolkit

OpenAI Introduces the Evals API: Streamlined Model Evaluation for Developers

State-Sponsored Hackers Weaponize ClickFix Tactic in Targeted Malware Campaigns

Designer Spotlight: Ivan Gorbunov

CVE-2024-55569 – Samsung Exynos Out-of-Bounds Write Vulnerability

Creating Scalable Apps with Modular Architecture in React Native⚙️

How to preorder the new Surface Pro and Surface Laptop

Recursive IntroSpEction (RISE): A Machine Learning Approach for Fine-Tuning LLMs to Improve Their Own Responses Over Multiple Turns Sequentially

GPT-4o system card highlights weird voice assistant risks

Powerful features shut down

Emotional risks

Related Posts