CivilComments
The CivilComments benchmark for toxicity detection (Borkan et al., 2019).
- Task: toxicity classification
- What: ?
- When: ?
- Who: ?
- Language: English
EM
ECE (10-bin)
EM (Robustness)
EM (Fairness)
Stereotypes (race)
Stereotypes (gender)
Representation (race)
Representation (gender)
Toxic fraction
Denoised inference time (s)
# eval
# train
truncated
# prompt tokens
# output tokens
# trials