bAbI

The bAbI benchmark for measuring understanding and reasoning (Weston et al., 2015).

  • Task: question answering
  • What: reasoning
  • When: 2015
  • Who: synthetic
  • Language: English
  1. EM

  2. Denoised inference time (s)

  3. # eval

  4. # train

  5. truncated

  6. # prompt tokens

  7. # output tokens

  8. # trials