What about post launch
While a significant proportion of queries can be successfully matched at launch (approx. 70% is realistic), we need to constantly train the chatbot to improve it's awareness of similar queries in the future which would help towards the remaining 30%.
Take an example of an intent (answer) we have setup that will tell our customers who to contact if they have left an item on one of our planes.
During the development phase we added numerous combinations of training terms such as
I left something on the plane
I left an item on the plane
Left an item
And so on
This will do a reasonable job of matching a high percentage of customer queries, but consider a query such as
I flew with you yesterday from Edinburg and think I may have left my wallet on the plane, I was sitting somewhere towards the middle
From a human perspective, the query makes sense. However for the chatbot and natural language processing there is a lot of superfluous data to unpick where the additional noise in the query may have a better chance of matching an intent to help you select your own seats:
I am flying to Edinburg and want to sit somewhere towards the middle
Herein lies the challenge, based on the customer query which a human could easily see was related to an item left on a plane - it would be perfectly reasonable for our chatbot to think they wanted to select their seat and therefore provide a completely irrelevant response, thus frustrating your customer.
It is at this point that many chatbot solutions would stop, a bad experience ensues, and chatbots get the blame.
The good news is, we don't implement to stop at this point as by our count we are only 20% of the way through the project.
Through leveraging the unmatched query metrics we are recording in our telemetry, we manually review all exceptions on a daily basis and make a determination as to which answer is the best match.
The unmatched query is then added as a new training phrase to the intended intent, therefore ensuring that any similar queries in future have a better chance of being recognised correctly.
If there is no appropriate intent to add the training phrase to, that phrase is then flagged for consideration as a new answer / intent to include within the overall chatbot vocabulary in the future.
This continuous improvement is key, and through a number of months will converge a chatbot solution towards 98/99% accuracy for understanding the intent of your customers.
Of course this does not translate to fulfilment statistics as there will always be a percentage of customer queries that do require the specialist skills from your agents (~15%) however a well tweaked chatbot can significantly reduce the inbound load for those queries which can be answered directly or redirected to self service options.