‘Embarrassingly simple’ probe finds AI in medical image diagnosis ‘worse than random’
‘Embarrassingly simple’ probe finds AI in medical image diagnosis ‘worse than random’

‘Embarrassingly simple’ probe finds AI in medical image diagnosis ‘worse than random’

We have models that are specifically made to be good at these kinds of tasks. Why would you choose the ones that aren't and then make generalizing claims about how AI sucks in this domain?
Yeah this is probably just straight up misinformation. By no means is a diagnosis going to be made by a generalist multimodal LLM. Diagnosis is a literally a binary classification (although that is an oversimplification) and on medical CV you are optimizing on that directly.
They did not use a LLM.
Not defending this article, but companies & big tech are generalizing the crap out of AI right now, and forcing it into everything.
They could have (and definitely should've) promoted the strengths and weaknesses of their models, specifically regarding what it can and can't do. But they don't. They get more money when their shareholders & customers think it's the next best thing for everything.