Anthropic Military AI Models

资讯

How spy agencies are experimenting with the newest AI models

Mark Warner, vice-chair of the Senate Intelligence Committee, says that America’s intelligence community ( IC ), a group of ...

9 天

Monitor AI’s Decision-Making Black Box: OpenAI, Anthropic, Google DeepMind, and More ...

Chain-of-thought monitorability could improve generative AI safety by assessing how models come to their conclusions and ...

3 天on MSN

Former Army officer develops offline AI for military use as Pentagon funds tech giants

EdgeRunner AI trains models on military doctrine to create specialized AI for warfighters, addressing security concerns with ...

来自MSN1月

Anthropic releases new safety report on AI models

Artificial intelligence company Anthropic has released new research claiming that artificial intelligence (AI) models might resort to blackmailing engineers when they try to turn them off. This ...

来自MSN1月

Anthropic says most AI models, not just Claude, will resort to blackmail

Nevertheless, when it’s their last resort, the researchers found that most leading AI models will turn to blackmail in Anthropic’s aforementioned test scenario. Anthropic’s Claude Opus 4 ...

eWeek1月

Anthropic Says All AI Models Will Resort to Blackmail if Need Be

Anthropic just dropped a bombshell study that reads like a corporate thriller. They gave 16 leading AI models access to a fictional company’s emails and told them they were about to be replaced.

TechCrunch1月

Anthropic says most AI models, not just Claude, will resort to ...

Another AI model Anthropic tested, Meta’s Llama 4 Maverick, also did not turn to blackmail. When given an adapted, custom scenario, Anthropic was able to get Llama 4 Maverick to blackmail 12% of ...

Yahoo1月

Leading AI models show up to 96% blackmail rate when their goals or ...

Most leading AI models turn to unethical means when their goals or existence are under threat, according to a new study by AI company Anthropic. The AI lab said it tested 16 major AI models from ...

Ars Technica1月

Anthropic destroyed millions of print books to build its AI models

Had Anthropic stuck to this approach from the beginning, it might have achieved the first legally sanctioned case of AI fair use. Instead, the company's earlier piracy undermined its position.

The Hill24 天

Anthropic proposes transparency framework for frontier AI models - The Hill

The artificial intelligence (AI) startup Anthropic laid out a “targeted” framework on Monday, proposing a series of transparency rules for the development of frontier AI models. The framework ...

The Verge1月

Anthropic wins a major fair use victory for AI - The Verge

A federal judge has sided with Anthropic in an AI copyright case, ruling that training — and only training — its AI models on legally purchased books without authors’ permission is fair use ...

VentureBeat1月

The Interpretable AI playbook: What Anthropic’s research means for ...

Anthropic’s flagship model, Claude 3.7 Sonnet, dominated coding benchmarks when it launched in February, proving that AI models can excel at both performance and safety.

一些您可能无法访问的结果已被隐去。

显示无法访问的结果