You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When I run the API server, error shows on calling api:
exllamav2-openai-server/server.py", line 156, in _gen_single_token
token, prob, eos = ExLlamaV2Sampler.sample(logits, gen_settings, self.sequence_ids[:1, :], random.random(), self.tokenizer, prefix_token)
ValueError: too many values to unpack (expected 3)
I fixed it by update the server.py, although I need to hack some of the exllamav2 source code to make the code works together.
vim server.py
#token, prob, eos = ExLlamaV2Sampler.sample(logits, gen_settings, self.sequence_ids[:1, :], random.random(), self.tokenizer, prefix_token)
result = ExLlamaV2Sampler.sample(logits, gen_settings, self.sequence_ids[:1, :], random.random(), self.tokenizer, None)
token, _, _, prob, eos = result