mirror of
https://github.com/anthropics/claude-cookbooks.git
synced 2025-10-06 01:00:28 +03:00
Update building_evals.ipynb
multipe -> multiple
This commit is contained in:
committed by
GitHub
parent
126b05a907
commit
7367e82f40
@@ -374,7 +374,7 @@
|
||||
"Now you know about different grading design patterns for evals, and are ready to start building your own. As you do, here are a few guiding pieces of wisdom to get you started.\n",
|
||||
"- Make your evals specific to your task whenever possible, and try to have the distribution in your eval represent ~ the real life distribution of questions and question difficulties.\n",
|
||||
"- The only way to know if a model-based grader can do a good job grading your task is to try. Try it out and read some samples to see if your task is a good candidate.\n",
|
||||
"- Often all that lies between you and an automatable eval is clever design. Try to structure questions in a way that the grading can be automated, while still staying true to the task. Reformatting questions into multipe choice is a common tactic here.\n",
|
||||
"- Often all that lies between you and an automatable eval is clever design. Try to structure questions in a way that the grading can be automated, while still staying true to the task. Reformatting questions into multiple choice is a common tactic here.\n",
|
||||
"- In general, your preference should be for higher volume and lower quality of questions over very low volume with high quality."
|
||||
]
|
||||
}
|
||||
|
||||
Reference in New Issue
Block a user