资讯

The findings challenge assumptions about LLM reliability in enterprise use, where decision support and automation depend on AI-generated confidence.
The researchers argue that CoT monitoring can help researchers detect when models begin to exploit flaws in their training, ...
In a recent study by the AI security firm PalisadeAI, a new, unnerving behavior was revealed: several advanced artificial intelligence models successfully bypassed shutdown instructions, with one ...
Large language models (LLMs) sometimes lose confidence when answering questions and abandon correct answers, according to a ...
No AI company scored better than “weak” in SaferAI’s assessment of their risk management maturity. The highest scorer was ...
Mark Zuckerberg is building Meta's Superintelligence Lab, offering record-breaking salaries up to Rs 1,600 crore to attract ...
The "acqui-hire" strategy is on fire in this battle among tech titans seeking AI dominance and a Goliath just beat David ...
The world of robotics is poised for a quantum leap forward, thanks to the latest groundbreaking endeavor from Google DeepMind.
AI researchers from OpenAI, Google DeepMind and Anthropic and others have urged deeper study into chain-of-thought monitoring ...
More than 40 AI researchers from OpenAI, DeepMind, Google, Anthropic, and Meta published a paper on a safety tool called chain-of-thought monitoring to make ...
The technology is already transforming the industry — and could forever change the entertainment we consume. But the battle ...