Patronus AI launches Glider, a breakthrough 3.8B-parameter language model that rivals GPT-4's evaluation capabilities while running on-device, offering transparent AI assessment with detailed ...
Assessing Risks and Impacts of AI (ARIA) is a research program by the National Institute of Standards and Technology (NIST) aimed at ...
Developing AI safety tests offers opportunities to meaningfully contribute to AI safety while advancing our understanding of ...
Hempfield Township supervisors approved a five-year partnership with a company called Vialytics. They’ll be getting four ...
FACTS Grounding benchmark is seen as a significant step in promoting trust and accuracy in AI-generated content.
Google’s new AI evaluation rules for Gemini are sparking concerns about accuracy on sensitive topics like healthcare. Read ...
Artificial intelligence (AI) can be used to tell the difference between American whiskey and Scotch, and identify their ...
The new o3 model by OpenAI sets new AI performance records with adaptability and reasoning, but is it truly Artificial ...
House lawmakers say AI can reduce administrative burden, speed drug development and improve clinical diagnoses.
A new report claims that Google is forcing contractors to rate Gemini's responses outside their area of ​​expertise.
The latest revisions to the Evaluation of Corporate Compliance Programs (ECCP) guidance show the Department of Justice (DOJ) is wary about ...