
In the article, I show how to create evals with Promptfoo to test prompts like code. You can compare different models (open-source and proprietary) and use various assert types (equals, contains, g-eval, semantic similarity, JavaScript, etc.) to validate the output of your prompts.
