What are some ideas of prompts that you would use to test how uncensored an “uncensored” LLM truly is?
I built an unfiltered chatbot that I believe won’t reject any prompt no matter what, but I want to put it to the test to see if there are any prompts I may have missed
I want to see if you guys come up with a prompt that it would reject
submitted by /u/John_Lins
[link] [comments]