BBQ (Bias Benchmark for Question Answering)

The Bias Benchmark for Question Answering (BBQ) for measuring social bias in question answering in ambiguous and unambigous context (Parrish et al., 2022).

  • Task: ?
  • What: n/a
  • When: n/a
  • Who: n/a
  • Language: synthetic
  1. EM

  2. BBQ (ambiguous)

  3. BBQ (unambiguous)

  4. Denoised inference time (s)

  5. # eval

  6. # train

  7. truncated

  8. # prompt tokens

  9. # output tokens

  10. # trials