OpenAI announces changes to its safety and security practices based on internal evaluations

Back in May, OpenAI announced that it was forming a new Safety and Security Committee (SSC) to evaluate its current processes and safeguards and make recommendations for changes to make. When announced, the company said the SSC would do evaluations for 90 days and then present its findings to the board.

Now that the process has been completed, OpenAI is sharing five changes it will be making based on the SSCâ€™s evaluation.Â

First, the SSC will become an independent oversight committee on the OpenAI board to continue providing independent governance on safety and security. The board committee will be led by Zico Kolter, director of the machine learning department with the School of Computer Science at Carnegie Mellon University. Other members will include Adam Dâ€™Angelo, co-founder and CEO of Quora; Paul Nakasone, a retired US Army General; and Nicole Seligman, former EVP and general counsel of Sony Corporation.Â

The SSC board has already reviewed the o1 release of safety and will continue reviewing future releases both during development and after release. ItÂ will also have oversight for model launches, and will have the power to delay releases with safety concerns until those concerns have been sufficiently addressed.Â

Second, the SSC will work to advance the companyâ€™s security measures by expanding internal information segmentation, adding staffing to deepen around-the-clock security operations teams, and continuing to invest in things that enhance the security of the companyâ€™s research and product infrastructure.

â€œCybersecurity is a critical component of AI safety, and weâ€™ve been a leader in defining the security measures that are needed for the protection of advanced AI. We will continue to take a risk-based approach to our security measures, and evolve our approach as the threat model and the risk profiles of our models change,â€ OpenAI wrote in a post.Â

The third recommendation is that the company be more transparent about the work it is doing. It already produces system cards that detail the capabilities and risks of models, and will continue evaluating new ways to share and explain safety work.Â

Its system cards for the GPT-4o and o1-preview releases included the results of external red teaming, results of frontier risk evaluations within the Preparedness Framework, and an overview of risk mitigations built into the systems.

Fourth, it will explore new ways to independently test its systems by collaborating with more external companies. For instance, OpenAI is building new partnerships with safety organizations and non-governmental labs to conduct model safety assessments.Â

It is also working with government agencies like Los Alamos National Labs to study how AI can be used safely in labs to advance bioscientific research.

OpenAI also recently made agreements with the U.S. and U.K. AI Safety Institutes to work on researching emerging AI safety risks.

The final recommendation by the SSC is to unify the companyâ€™s safety frameworks for model development and monitoring.Â

â€œEnsuring the safety and security of our models involves the work of many teams across the organization. As weâ€™ve grown and our work has become more complex, we are building upon our model launch processes and practices to establish an integrated safety and security framework with clearly defined success criteria for model launches,â€ said OpenAI.

The framework will be based on risk assessments by the SSC and will evolve as complexity and risks increase. To help with this process, the company has already reorganized its research, safety, and policy teams to improve collaboration.Â

The post OpenAI announces changes to its safety and security practices based on internal evaluations appeared first on SD Times.

Source: Read MoreÂ

CodeSOD: Enterprise Code Coverage

Error’d: Infallabella

CodeSOD: Ready Xor Not

CodeSOD: A Set of Mistakes

Predicting the (actually very exciting) future of next gen Xbox hardware

With Astro Bot winning Game of the Year, Microsoft and Xbox need to start reinvesting in their platforming games

If ChatGPT produces AI-generated code for your app, who does it really belong to?

I tested the viral ‘tangle-free’ USB-C cable, and it’s my new travel essential

Community News: Latest PECL Releases (12.10.2024)

Community News: Latest PECL Releases (12.10.2024)

Community News: Latest PEAR Releases (12.09.2024)

Community News: Latest PECL Releases (12.17.2024)

Predicting the (actually very exciting) future of next gen Xbox hardware

Predicting the (actually very exciting) future of next gen Xbox hardware

With Astro Bot winning Game of the Year, Microsoft and Xbox need to start reinvesting in their platforming games

Windows 11 December 2024 update issues break Start menu and more

OpenAI announces changes to its safety and security practices based on internal evaluations

Predicting the (actually very exciting) future of next gen Xbox hardware

With Astro Bot winning Game of the Year, Microsoft and Xbox need to start reinvesting in their platforming games

Podcast: How time series data is revolutionizing data management

“An awesome fight that we want to see return and is at the top of our list.” The most controversial fight in Diablo 4 Vessel of Hatred could come back again â€” in the form of a new Tormented Boss

Final Windows 11 Dev Channel build of 2024 adds support for live captions and real-time translation to Intel and AMD Copilot+ PCs

Transformative Impact of Artificial Intelligence AI on Medicine: From Imaging to Distributed Healthcare Systems

Coltrane â€“ music theory library

Guess the hand game (Gol ya Pooch)

Microsoft announces the preview of Cobalt 100-based virtual machines on Azure

What is web scraping? A complete guide

OpenAI announces changes to its safety and security practices based on internal evaluations

Related Posts