What is the proper way to prompt JudgeLM for single answer + reference free evaluation? The paper talks about this, but I don't see any explicit examples in the appendix nor in the hosted demo