Skip to content

Issues created by ExO and NCIT OBO imports #161

@dillerm

Description

@dillerm

I was looking at the overall taxonomy of ECTO and the reuse of ExO and NCIT classes seems unnecessary and may even detract from the quality of the ontology more than they add to it.

The two main issues that are introduced by ExO are a rather disjointed class hierarchy (since ExO does not use BFO for its top level hierarchy) and circularity (since the four top-level ExO classes are all defined in terms of each other). I suspect the former is why 'exposure event or process' isn't simply 'exposure process'. Another potential problem that arises from this disjointedness is that related-via-exposure-to and each of its subproperties conflict with the textual definition for causally-related-to, which is supposed to act more narrowly as a relationship between occurrents/processes or between material entities. Some of these issues look like they can be done away with by removing 'exposure event' and any assertions between it and other ECTO classes. From what I can tell, you would lose little, if anything at all, from doing this.

NCIT, from which 82 classes are imported, creates additional issues outside of adding more disjointedness to the hierarchy. For one, many of the imported classes are redundant (e.g., 'Behavior' (NCIT:C16326), 'Personal behavior' (NCIT:C19683), and 'behavior' (GO:0007610)). And because these classes are then reused to define various exposure process/event classes, the redundancy is proliferated (e.g., 'exposure to personal behavior' and 'exposure to behavior'). There are even redundancies between the NCIT classes themselves (e.g., 'Smoking' and 'Smoking behavior'). The confusion this might create for human users of the ontology is rather apparent upon examining the 19 or so classes in ECTO that are related to smoking and tobacco use. The other issue is the abundance of classification errors that NCIT comes with, such as 'Unemployment' (NCIT:C75563) being a subclass of 'Employment' (NCIT:C25172), that would almost certainly negatively impact a reasoner's performance if not lead to contradictions outright.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions