How to evaluate Chatbot output.

I have developed a GPT-2 based chatbot in German language. I am wondering if there is any metric other then perplexity. to measure the output quality. Would be nice if you can provide me a GitHub link or example.

submitted by /u/Old-Responsibility61
[link] [comments]


Posted

in

by

Tags: