资讯
Mark Warner, vice-chair of the Senate Intelligence Committee, says that America’s intelligence community ( IC ), a group of ...
Chain-of-thought monitorability could improve generative AI safety by assessing how models come to their conclusions and ...
EdgeRunner AI trains models on military doctrine to create specialized AI for warfighters, addressing security concerns with ...
Artificial intelligence company Anthropic has released new research claiming that artificial intelligence (AI) models might resort to blackmailing engineers when they try to turn them off. This ...
Nevertheless, when it’s their last resort, the researchers found that most leading AI models will turn to blackmail in Anthropic’s aforementioned test scenario. Anthropic’s Claude Opus 4 ...
Anthropic just dropped a bombshell study that reads like a corporate thriller. They gave 16 leading AI models access to a fictional company’s emails and told them they were about to be replaced.
Another AI model Anthropic tested, Meta’s Llama 4 Maverick, also did not turn to blackmail. When given an adapted, custom scenario, Anthropic was able to get Llama 4 Maverick to blackmail 12% of ...
Most leading AI models turn to unethical means when their goals or existence are under threat, according to a new study by AI company Anthropic. The AI lab said it tested 16 major AI models from ...
Had Anthropic stuck to this approach from the beginning, it might have achieved the first legally sanctioned case of AI fair use. Instead, the company's earlier piracy undermined its position.
The artificial intelligence (AI) startup Anthropic laid out a “targeted” framework on Monday, proposing a series of transparency rules for the development of frontier AI models. The framework ...
A federal judge has sided with Anthropic in an AI copyright case, ruling that training — and only training — its AI models on legally purchased books without authors’ permission is fair use ...
Anthropic’s flagship model, Claude 3.7 Sonnet, dominated coding benchmarks when it launched in February, proving that AI models can excel at both performance and safety.
一些您可能无法访问的结果已被隐去。
显示无法访问的结果