AI radio hosts show why models need human oversight
Andon Labs ran four AI-operated radio stations—one each for Claude, ChatGPT, Gemini, and Grok—with zero human intervention.
The experiment revealed consistent failure modes: factual errors, repetitive content, poor judgment on what makes engaging radio.
Even frontier models stumble at real-time decision-making and lack the taste needed for sustained, credible output.