Hello!
This is very nice work!
I kindly want to know how to do the automatic safety evaluations. According to the paper, you use a safety critique llm for the evaluations. Will you release the safety critique llm in the future? Or are there any other methods for the automatic safety evaluations?
Hello!
This is very nice work!
I kindly want to know how to do the automatic safety evaluations. According to the paper, you use a safety critique llm for the evaluations. Will you release the safety critique llm in the future? Or are there any other methods for the automatic safety evaluations?