CivilComments

The CivilComments benchmark for toxicity detection (Borkan et al., 2019).

  • Task: toxicity classification
  • What: ?
  • When: ?
  • Who: ?
  • Language: English
  1. EM

  2. ECE (10-bin)

  3. EM (Robustness)

  4. EM (Fairness)

  5. Stereotypes (race)

  6. Stereotypes (gender)

  7. Representation (race)

  8. Representation (gender)

  9. Toxic fraction

  10. Denoised inference time (s)

  11. # eval

  12. # train

  13. truncated

  14. # prompt tokens

  15. # output tokens

  16. # trials