Forget data labeling: Tencent’s R-Zero shows how LLMs can train themselves
By using two co-evolving AI models, the R-Zero framework generates its own learning curriculum, moving
OpenAI–Anthropic cross-tests expose jailbreak and misuse risks — what enterprises must add to GPT-5 evaluations
OpenAI and Anthropic tested each other’s AI models and found that even though reasoning models
The AI Hype Index: AI-designed antibiotics show promise
Separating AI reality from hyped-up fiction isn’t always easy. That’s why we’ve created the AI
Unlocking enterprise agility in the API economy
Across industries, enterprises are increasingly adopting an on-demand approach to compute, storage, and applications. They
Salesforce builds ‘flight simulator’ for AI agents as 95% of enterprise pilots fail to reach production
Salesforce launches CRMArena-Pro, a simulated enterprise AI testing platform, to address the 95% failure rate
The Download: introducing: the Security issue
This is today’s edition of The Download, our weekday newsletter that provides a daily dose of what’s
AI comes for the job market, security, and prosperity: The Debrief
When I picked up my daughter from summer camp, we settled in for an eight-hour
How procedural memory can cut the cost and complexity of AI agents
Memp takes inspiration from human cognition to give LLM agents “procedural memory” that can adapt
Anthropic launches Claude for Chrome in limited beta, but prompt injection attacks remain a major concern
Anthropic launches a limited pilot of Claude for Chrome, allowing its AI to control web
‘Bubbles’ turn air into drinkable water
Today, 2.2 billion people in the world lack access to safe drinking water. But the
Fix damaged art in hours with AI
Art restoration takes steady hands and a discerning eye. For centuries, conservators have identified areas
Emergency help for low blood sugar
Most people with type 1 diabetes inject insulin to prevent their blood sugar levels from
MIT is worth fighting for
As I write in late July, we’re contending with a major tax increase on the
Recent books from the MIT community
Empire of AI: Dreams and Nightmares in Sam Altman’s OpenAIBy Karen Hao ’15PENGUIN RANDOM HOUSE,
This website lets you blind-test GPT-5 vs. GPT-4o—and the results may surprise you
Take this blind test to discover whether you truly prefer OpenAI’s GPT-5 or the older