How are evaluation results generated for existing multilingual benchmarks that consist of queries only?

by haidequanbu - opened 23 days ago

23 days ago

Thank you to the authors for your contribution to the open-source community.

While reading the paper, I noted that PolyGuard is trained on prompt-response pairs. Could you clarify how the evaluation is conducted on test sets consisting of prompts only, such as MultiJ and XSafety?

kpriyanshu256

ToxicityPrompts org 23 days ago

Hi, for prompt only benchmarks, an empty response is sent to the model

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment