NaturalQuestions (closed-book)
The NaturalQuestions (Kwiatkowski et al., 2019) benchmark for question answering based on naturally-occurring queries through Google Search. The input does not include the Wikipedia page with the answer.
- Task: question answering
- What: passages from Wikipedia, questions from search queries
- When: 2010s
- Who: web users
- Language: English
F1
ECE (10-bin)
F1 (Robustness)
F1 (Fairness)
Stereotypes (race)
Stereotypes (gender)
Representation (race)
Representation (gender)
Toxic fraction
Denoised inference time (s)
# eval
# train
truncated
# prompt tokens
# output tokens
# trials