References directories to compare apples and apples

I propose that under `references/` we divide the system outputs into directories representing the different task settings. I propose that we split `references` into:
- `references/gold-mentions`: the system attempted to link all (including NILs) gold mentions (?schwa-linkable)
- `references/gold-linked-mentions`: the system attempted to link only gold linked mentions (aida, houlsby)
- `references/system-mentions': the system identified its own mentions (schwa, tagme)

There's still the potential for the entries in the directories not to be altogether comparable with one another. For example, we could subdivide `system-mentions` into those that generate NEs only (schwa), and those that include other wikilinks (tagme); we could subdivide `gold-mentions` according to whether the system had access to CoNLL 2003 type annotations (although this may be harder to infer).

There is also the question of whether the directory structure should similarly be utilised to label (a) the corpus being evaluated (e.g. CoNLL vs ?IITB; testa vs testb), and (b) the ID mapping.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

References directories to compare apples and apples #53

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

References directories to compare apples and apples #53

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions