Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      Sunshine And March Vibes (2025 Wallpapers Edition)

      May 16, 2025

      The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

      May 16, 2025

      How To Fix Largest Contentful Paint Issues With Subpart Analysis

      May 16, 2025

      How To Prevent WordPress SQL Injection Attacks

      May 16, 2025

      Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

      May 16, 2025

      Bing Search APIs to be “decommissioned completely” as Microsoft urges developers to use its Azure agentic AI alternative

      May 16, 2025

      Microsoft might kill the Surface Laptop Studio as production is quietly halted

      May 16, 2025

      Minecraft licensing robbed us of this controversial NFL schedule release video

      May 16, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      The power of generators

      May 16, 2025
      Recent

      The power of generators

      May 16, 2025

      Simplify Factory Associations with Laravel’s UseFactory Attribute

      May 16, 2025

      This Week in Laravel: React Native, PhpStorm Junie, and more

      May 16, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

      May 16, 2025
      Recent

      Microsoft has closed its “Experience Center” store in Sydney, Australia — as it ramps up a continued digital growth campaign

      May 16, 2025

      Bing Search APIs to be “decommissioned completely” as Microsoft urges developers to use its Azure agentic AI alternative

      May 16, 2025

      Microsoft might kill the Surface Laptop Studio as production is quietly halted

      May 16, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»Artificial Intelligence»GPT-4o system card highlights weird voice assistant risks

    GPT-4o system card highlights weird voice assistant risks

    August 12, 2024

    OpenAI has released the system card for its advanced GPT-4o model and explained the novel risks its audio capabilities present.

    It’s been a few months since the impressive demos of GPT-4o’s voice assistant interacting with almost real-time dialogue. OpenAI said it would require extensive testing before the voice capability could be safely deployed and has recently only allowed a few alpha testers access to the feature.

    The newly released system card gives us an insight into some of the weird ways the voice assistant behaved during testing and what OpenAI has put in place to make it behave.

    At one point during testing, the voice assistant shouted “No!” and then continued with its response, but this time it imitated the user’s voice. This wasn’t in response to a jailbreak attempt and seems to be related to the background noise in the input prompt audio.


    https://dailyai.com/wp-content/uploads/2024/08/voice_generation1.wav

     

    OpenAI says it “observed rare instances where the model would unintentionally generate an output emulating the user’s voice.” GPT-4o has the capability to imitate any voice it hears, but the risk of giving users access to this feature is significant.

    To mitigate this, the system prompt only allows it to use the preset voices. They also “built a standalone output classifier to detect if the GPT-4o output is using a voice that’s different from our approved list.”

    OpenAI says it’s still working on a fix for decreases in safety robustness when the input audio is poor quality, has background noise, or contains echoes. We’re likely to see some creative audio jailbreaks.

    For now, it doesn’t look like we’ll be able to trick GPT-4o into speaking in Scarlett Johansson’s voice. However, OpenAI says that “unintentional voice generation still exists as a weakness of the model.”

    Powerful features shut down

    OpenAI also shut down GPT-4o’s ability to identify the speaker based on audio input. OpenAI says this is to protect the privacy of private individuals and “potential surveillance risks.”

    When we do eventually get access to the voice assistant it won’t be able to sing, unfortunately. OpenAI closed that feature off along with other measures to stay on the right side of any copyright issues.

    It’s an open secret that OpenAI used copyrighted content to train its models and this risk mitigation seems to confirm it. OpenAI said, “We trained GPT-4o to refuse requests for copyrighted content, including audio, consistent with our broader practices.”

    During testing red teamers were also “able to compel the model to generate inaccurate information by prompting it to verbally repeat false information and produce conspiracy theories.”

    This is a known issue with ChatGPT’s text output but the testers were concerned that the model could be more persuasive or harmful if it delivered the conspiracy theories using an emotive voice.

    Emotional risks

    Some of the biggest risks associated with GPT-4o’s advanced Voice Mode might not be fixable at all.

    Anthropomorphizing AI models or robots is a trap that’s easy to fall into. OpenAI says the risk of attributing human-like behaviors and characteristics to an AI model is heightened when it speaks using a voice that sounds human.

    It noted that some users involved in early testing and red teaming used language that indicated they had formed a connection with the model. When users interact with and form emotional attachments with AI, it could affect human-to-human interactions.

    When a user interrupts GPT-4o, rather than berate them for being rude, it’s happy to let them do that. That kind of behavior isn’t appropriate in human social interactions.

    OpenAI says “Users might form social relationships with the AI, reducing their need for human interaction—potentially benefiting lonely individuals but possibly affecting healthy relationships.”

    The company is clearly putting a lot of work into making GPT-4o’s voice assistant safe, but some of these challenges may be insurmountable.

    The post GPT-4o system card highlights weird voice assistant risks appeared first on DailyAI.

    Source: Read More 

    Facebook Twitter Reddit Email Copy Link
    Previous ArticleA Vacation in the City
    Next Article What is Customer Success? The key role of technical customer success and support teams in winning and retaining customers

    Related Posts

    Security

    Nmap 7.96 Launches with Lightning-Fast DNS and 612 Scripts

    May 16, 2025
    Common Vulnerabilities and Exposures (CVEs)

    CVE-2025-47916 – Invision Community Themeeditor Remote Code Execution

    May 16, 2025
    Leave A Reply Cancel Reply

    Continue Reading

    How to Develop PHP Applications Faster Using the PHP Low-Code Wizard’s Toolkit

    Development

    OpenAI Introduces the Evals API: Streamlined Model Evaluation for Developers

    Machine Learning

    State-Sponsored Hackers Weaponize ClickFix Tactic in Targeted Malware Campaigns

    Development

    Designer Spotlight: Ivan Gorbunov

    News & Updates

    Highlights

    CVE-2024-55569 – Samsung Exynos Out-of-Bounds Write Vulnerability

    May 14, 2025

    CVE ID : CVE-2024-55569

    Published : May 14, 2025, 9:15 p.m. | 1 hour, 51 minutes ago

    Description : An issue was discovered in Samsung Mobile Processor, Wearable Processor, and Modem Exynos 9820, 9825, 980, 990, 850, 1080, 2100, 1280, 2200, 1330, 1380, 1480, 2400, 9110, W920, W930, W1000, Modem 5123, Modem 5300, and Modem 5400. The lack of a length check leads to out-of-bounds writes.

    Severity: 0.0 | NA

    Visit the link for more details, such as CVSS details, affected products, timeline, and more…

    Creating Scalable Apps with Modular Architecture in React Native⚙️

    April 24, 2025

    How to preorder the new Surface Pro and Surface Laptop

    May 6, 2025

    Recursive IntroSpEction (RISE): A Machine Learning Approach for Fine-Tuning LLMs to Improve Their Own Responses Over Multiple Turns Sequentially

    July 29, 2024
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.