Developing AI safety tests offers opportunities to meaningfully contribute to AI safety while advancing our understanding of ...
Assessing Risks and Impacts of AI (ARIA) is a research program by the National Institute of Standards and Technology (NIST) aimed at ...
The latest AI model from OpenAI achieved an “impressive leap in performance” but it still hasn’t demonstrated what experts ...
The latest revisions to the Evaluation of Corporate Compliance Programs (ECCP) guidance show the Department of Justice (DOJ) is wary about ...
Just a heads up, if you buy something through our links, we may get a small share of the sale. It's one of the ways we keep ...
Patronus AI launches Glider, a breakthrough 3.8B-parameter language model that rivals GPT-4's evaluation capabilities while running on-device, offering transparent AI assessment with detailed ...
House lawmakers say AI can reduce administrative burden, speed drug development and improve clinical diagnoses.
Patronus said that by providing an open-source model that supports on-premises deployment, Glider can be used for multiple ...
A new report claims that Google is forcing contractors to rate Gemini's responses outside their area of expertise.