ChatGPT generates cancer treatment plans that are full of errors — Study finds that ChatGPT provided false information when asked to design cancer treatment plans::Researchers at Brigham and Women’s Hospital found that cancer treatment plans generated by OpenAI’s revolutionary chatbot were full of errors.
I suppose most sensible people already know that ChatGPT is not the answer for medical diagnosis.
Prompts were input to the GPT-3.5-turbo-0301 model via the ChatGPT (OpenAI) interface.
If the researcher wanted to investigate whether LLM is helpful, they should develop a model specifically using cancer treatment plans with GPT-4/3.5 before testing it thoroughly, in addition to entering prompts into the model that is available on OpenAI.
There have been a number of articles about how GPT has been out-diagnosing doctors in various domains. To me, that isn’t that surprising as diagnosis is a pattern matching problem, something a neuralnet will be very good at. Human doctors were seen to be discounting rare conditions just because they were rare and so “it was much more likely to be something else” even if the symptoms backed up the conclusion. A computer can be more objective about such things.
…but none of that needs AI/ML. We’ve had expert systems since the 60s.
It’s also very different from constructing a treatment plan, which is what we’re discussing here.
Or they could feed the current model with a reputable source of medical information.
That wouldn’t guarantee correct answers.
It’s arguably more dangerous if ChatGPT gives mostly sane specific medical advice because it makes people put more trust in it than they should.