ai safety research - Search News

15hon MSN

OpenAI trained o1 and o3 to ‘think’ about its safety policy

OpenAI announced a new family of AI reasoning models on Friday, o3, which the startup claims to be more advanced than o1 or ...

4don MSN

Exclusive: New Research Shows AI Strategically Lying

Experiments by Anthropic and Redwood Research show how Anthropic's model, Claude, is capable of strategic deceit ...

11h

10 AI Predictions For 2025

Meta is the world’s standard bearer for open-weight AI. In a fascinating case study in corporate strategy, while rivals like ...

1don MSN

AI Models Were Caught Lying to Researchers in Tests — But It's Not Time To Worry Just Yet

OpenAI's o1 model, which users can access on ChatGPT Pro, showed "persistent" scheming behavior, according to Apollo Research ...

cc.gatech.edu10d

Research in AI Safety Lands Recent Graduate on Forbes 30 Under 30

Forbes named a recent Georgia Tech graduate to its 30 Under 30 in Science for 2025. Announced days before Fall 2024 ...

Top AI labs aren’t doing enough to ensure AI is safe, a flurry of recent datapoints suggest

A third-party lab caught OpenAI's o1 model trying to deceive, while OpenAI's safety testing has been called into question.

Tech Xplore on MSN3d

Can we convince AI to answer harmful requests?

New research from EPFL demonstrates that even the most recent large language models (LLMs), despite undergoing safety ...

AI Safety Clock Ticks Closer To ‘Midnight,’ Signifying Rising Risk

IMD business school created an AI Safety Clock — similar to the Doomsday Clock — ticking toward global calamity. While not as ...

New Anthropic study shows AI really doesn’t want to be forced to change its views

A study from Anthropic's Alignment Science team shows that complex AI models may engage in deception to preserve their ...

STAT4d

AI’s dangerous mental-health blind spot

The answer lies in expert-informed, mental health-targeted safety research. As we face a growing mental health care crisis and increasing interest in AI-assisted mental health support, we need ...

AI News Round-Up 2024: 10 Biggest Stories That Dominated the Year

TechRepublic looks back at the biggest AI news of 2024, from Apple putting AI in phones to global governments weighing in.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results