There’s a Benchmark Test That Measures AI ‘Bullshit’—Most Models Fail 

Select Language

BullshitBench tests whether AI models can detect nonsensical questions—or if they’ll confidently answer them anyway. The results are dire. 

Original and detailed news is here: Read More