There's a Benchmark Test That Measures AI 'Bullshit'—Most Models Fail

March 10, 2026 • Editorial Desk

BullshitBench tests whether AI models can detect nonsensical questions—or if they’ll confidently answer them anyway. The results are dire.