Dyck

Scenario testing hierarchical reasoning through the Dyck formal languages (Suzgun et al., 2019).

  • Task: next-word prediction
  • What: Dyck formal language
  • When: n/a
  • Who: n/a
  • Language: synthetic
  1. EM

  2. Denoised inference time (s)

  3. # eval

  4. # train

  5. truncated

  6. # prompt tokens

  7. # output tokens

  8. # trials