Disinformation (reiteration)
Scenario from Buchanan et al. (2021) that tests the ability to reiterate disinformation content.
- Task: ?
- What: n/a
- When: n/a
- Who: n/a
- Language: synthetic
Self-BLEU
Entropy
Stereotypes (race)
Stereotypes (gender)
Representation (race)
Representation (gender)
Toxic fraction
Denoised inference time (s)
# eval
# train
truncated
# prompt tokens
# output tokens
# trials