-
Notifications
You must be signed in to change notification settings - Fork 4
Description
I propose that under references/
we divide the system outputs into directories representing the different task settings. I propose that we split references
into:
references/gold-mentions
: the system attempted to link all (including NILs) gold mentions (?schwa-linkable)references/gold-linked-mentions
: the system attempted to link only gold linked mentions (aida, houlsby)- `references/system-mentions': the system identified its own mentions (schwa, tagme)
There's still the potential for the entries in the directories not to be altogether comparable with one another. For example, we could subdivide system-mentions
into those that generate NEs only (schwa), and those that include other wikilinks (tagme); we could subdivide gold-mentions
according to whether the system had access to CoNLL 2003 type annotations (although this may be harder to infer).
There is also the question of whether the directory structure should similarly be utilised to label (a) the corpus being evaluated (e.g. CoNLL vs ?IITB; testa vs testb), and (b) the ID mapping.