Allow reporting in Rank Assitant Replies mode #3454

POBIX · 2023-03-07T23:38:00Z

POBIX
Mar 7, 2023

There should be a way to report harmful assistant replies. Especially when they're synthetic, I'd like to be able to take more drastic action than ranking them worst.

olliestanley · 2023-03-08T12:48:51Z

olliestanley
Mar 8, 2023
Collaborator

I think the current workflow/plan for reward model training would not be able to use the labels, we only want rankings for the synthetic data as it stands. We don't want to ask site users to do extra work if it won't be utilised. If someone from the ML team wants to correct me feel free though

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow reporting in Rank Assitant Replies mode #3454

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Allow reporting in Rank Assitant Replies mode #3454

POBIX Mar 7, 2023

Replies: 1 comment

olliestanley Mar 8, 2023 Collaborator

POBIX
Mar 7, 2023

olliestanley
Mar 8, 2023
Collaborator