The problem is that 20% failure rate has no validation and you are 100% liable for the failures of an AI you’re using as a customer support agent, which can end up costing you a ton and killing your reputation. The unfixable problem is that an AI solution takes a ton of effort to validate, way more than just double checking a human answer.
It’s not a 20% failure rate when the chatbot routes calls to a human agent whenever it’s more than x% unsure about what to say.
AI solutions still get the 80% “bottom of the barrel” menial tasks perfectly well.
It wont know it doesn’t know. At the current state of AI, it doesn’t seem to have almost any sense of what is right and wrong or a way to validate that - even when you tell it, it is wrong. Maybe there are systems that can but I am not aware of them.
The current state of AI chatbots, assigns a “confidence level” to every piece of output. It signals perfectly well when and where they should look for more information… but humans have been pushing them to “output something, anything”, instead of excusing itself for not knowing something, or running some additional processes in order to look for the missing information.
As of this year, Copilot has been running web searches to complement its lack of information, and Gemini is running both web searches, and iteratively self-checking its own answer in order to refine it (see “drafts”). It also seems like Gemini might be learning from humanity’s reactions to its wrong answers.
I feel like customer support is one place where AI may actually be used going forward because companies don’t really care if their customers get support. The only wrinkle is that if companies get held to promises the AI makes (there’s that Canada Air incident from last year where the AI offered a refund and the company tried to walk it back).
I’ve had this discussion come up in meetings recently.
CustomGPT is like $500/month for 5000 queries… that limitation and price (if you have a reasonable amount of customers), kind of just means you are better off hiring one employee. I’m not going to ping them for pricing for their enterprise plan beyond that, as going to cost an employee anyways.