Skip to content

Choose best answer based on perplexity with parallel generation? #3533

Answered by KerfuffleV2
Mihaiii asked this question in Q&A
Discussion options

You must be logged in to vote

but I was told that perplexity calculation is based on an external text source

Right, you feed it wikitext and (from what I know) perplexity is based on how accurately it predicts what is actually in that text. When you're just generating stuff though, you don't have a reference to compare the token it predict with. So you can't say if it got the right answer or not.

@Mihaiii

I'm thinking of parallel generation of 3-4 answers and calculating the perplexity of all of them

So you probably can't really do this. However you can possibly look at the code for the mirostat samplers. They calculate a surprise value for the token they picked and also keep track of mu which gets updated based o…

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@KerfuffleV2
Comment options

Answer selected by Mihaiii
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants