OpenAI detects accidental CoT grading in models, finds no monitorability loss
METR: METR is an AI evaluation organization performing independent assessments of frontier model capabilities and risks, including reviews […]
METR: METR is an AI evaluation organization performing independent assessments of frontier model capabilities and risks, including reviews […]
Jan Leike is embarking on a new research project at Anthropic, stepping back from his role leading the alignment team, which he has transferred to Ethan Perez and Spencer Price.…
OpenAI’s Codex saw a remarkable spike in installs, reaching 90 million last week—12 times more than its competitor Claude code and 14 times more than the previous week. This surge…
The White House is reconsidering its firm approach to artificial intelligence as officials respond to security risks highlighted by new AI models capable of uncovering significant vulnerabilities in computer code.…
A recent convention in Phoenix highlighted a surge in competition within the border-security technology industry, driven by advancements in artificial intelligence (AI). Companies showcased AI systems capable of distinguishing between…
Anthropic has announced that its Claude integrations for Excel, PowerPoint, and Word are now generally available, while Claude for Outlook has entered public beta, featuring cross-app conversation context support. This…
Wall Street is witnessing a shift in the AI landscape as shares of Intel and AMD surge, reflecting a growing demand for CPUs amid the rise of AI agents, while…
The Trump administration is set to issue an executive order directing US agencies to collaborate with artificial intelligence companies to bolster defenses against AI-enabled cyber attacks. This initiative marks a…
Google is making a significant move in India with a $15 billion investment plan aimed at transforming Visakhapatnam into a hub for artificial intelligence, dubbed “AI Patnam,” through 2030. This…
Cerebras Systems is set to target an initial public offering (IPO) price range of $125 to $135 per share, amid reports that demand for the IPO is over 20 times…