ai safety research - Search News

AI, o3 and o1

OpenAI trained o1 and o3 to ‘think’ about its safety policy

OpenAI announced a new family of AI reasoning models on Friday, o3, which the startup claims to be more advanced than o1 or anything else it’s released. These improvements appear to have come from scaling test-time compute,

Hosted on MSN · 5h

OpenAI to advance o1 and o3 AI models with new safety training paradigm

OpenAI announced the release of a new family of AI models, dubbed o3. The company claims the new products are more advanced than its previous models, including o1. The advancements, according to the startup,

DMR News (English) on MSN · 12h

OpenAI Announces o3 Model Launching Next Year

OpenAI is set to introduce its next-generation AI model, o3, early next year, with safety researchers gaining access to the technology. The announcement, made during OpenAI’s “12 Days of OpenAI” livestream,

HT Tech on MSN · 14h

OpenAI introduces new AI models, o3 and o3 mini- Know their capabilities and launch timeline

OpenAI o3 and o3 mini capabilities are showcased during ?12 Days of OpenAI?. Know how these powerful AI models are closer to AGI.

Hosted on MSN · 13h

OpenAI announces o3, o3 Mini AI models; due for release in 2025: Details

As the ‘Shipmass,’ by OpenAI comes to an end, the artificial intelligence (AI) company announced its latest launch. The AI company said on December 20, that it was testing new reasoning AI models named o3 and o3 mini.

Every on MSN · 1d

Everything You Need to Know About OpenAI's New Model, o3

Context Window Hello, and happy Sunday! It’s yet another week in which a new best AI model, OpenAI's o3. was released—scroll down to read about it in Alex Duffy's analysis of the company's 12 days of Shipmas.

The Fundamentals Of Designing Autonomy Evaluations For AI Safety

Developing AI safety tests offers opportunities to meaningfully contribute to AI safety while advancing our understanding of ...

5don MSN

Exclusive: New Research Shows AI Strategically Lying

Experiments by Anthropic and Redwood Research show how Anthropic's model, Claude, is capable of strategic deceit ...

cc.gatech.edu11d

Research in AI Safety Lands Recent Graduate on Forbes 30 Under 30

Forbes named a recent Georgia Tech graduate to its 30 Under 30 in Science for 2025. Announced days before Fall 2024 ...

openaccessgovernment7h

North American research special focus

Funding and advancing science in technology standards, artificial intelligence, the humanities, social sciences, and ...

1don MSN

AI Models Were Caught Lying to Researchers in Tests — But It's Not Time To Worry Just Yet

OpenAI's o1 model, which users can access on ChatGPT Pro, showed "persistent" scheming behavior, according to Apollo Research ...

Tech Xplore on MSN3d

Can we convince AI to answer harmful requests?

New research from EPFL demonstrates that even the most recent large language models (LLMs), despite undergoing safety ...

AI Safety Clock Ticks Closer To ‘Midnight,’ Signifying Rising Risk

IMD business school created an AI Safety Clock — similar to the Doomsday Clock — ticking toward global calamity. While not as ...

6don MSN

New Tests Reveal AI’s Capacity for Deception

A paper by Apollo Research found that in certain contrived scenarios, AI systems can engage in deceptive behavior.

New Anthropic study shows AI really doesn’t want to be forced to change its views

A study from Anthropic's Alignment Science team shows that complex AI models may engage in deception to preserve their ...

AI News Round-Up 2024: 10 Biggest Stories That Dominated the Year

TechRepublic looks back at the biggest AI news of 2024, from Apple putting AI in phones to global governments weighing in.

18h

10 AI Predictions For 2025

Meta is the world’s standard bearer for open-weight AI. In a fascinating case study in corporate strategy, while rivals like ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results