generated from amazon-archives/__template_Apache-2.0
-
Notifications
You must be signed in to change notification settings - Fork 28
Open
Description
I would request the organizers to look into the matter that the baseline predictions are not compatible with other pretrained models.
For instance, the condition shown in the snippet below
elif token.startswith(start_token_pattern) and token != '▁́':
is not applicable to other models as only xlm-roberta is producing tokens in such a way that they have an '_' (underscore) at the beginning while other models are not.
It is present here in the code:
multiconer-baseline/utils/reader_utils.py
Line 92 in a44705c
| elif token.startswith(start_token_pattern) and token != '▁́': |
Also, running the prediction module on other models requires us to add a separate parameter of encoder_model so that it can run on those models, otherwise gives this error.

Metadata
Metadata
Assignees
Labels
No labels