Vary number of in-context examples

Vary the number of in-context training examples.

  1. Mean win rate

  2. NaturalQuestions (open-book) - F1

  3. CNN/DailyMail - ROUGE-2

  4. IMDB - EM

  5. CivilComments - EM