Skip to content

Exceptions for v2 to v3 conversion #291

@lcreteig

Description

@lcreteig

I noticed that the py-ard implementation for converting v2 to v3 typings is not yet complete:

py-ard/pyard/db.py

Lines 586 to 600 in d43a035

# TODO: Create mapping table using both the allele list history and
# deleted alleles as reference.
# Temporary Example
v2_to_v3_example = {
"A*0104": "A*01:04:01:01N",
"A*0105N": "A*01:04:01:01N",
"A*0111": "A*01:11N",
"A*01123": "A*01:123N",
"A*0115": "A*01:15N",
"A*0116": "A*01:16N",
"A*01160": "A*01:160N",
"A*01162": "A*01:162N",
"A*01178": "A*01:178N",
"A*01179": "A*01:179N",
"DRB5*02ZB": "DRB5*02:UTV",

I was wondering if this is still on the roadmap and/or if I can contribute anything to it.

The heuristic conversion in ard._predict_v3() does not work in all cases, because there's a bunch of exceptions. For example, if I apply it to the Current Name column of the IPD-IMGT/HLA pre-2010 nomenclature file, I get a different result than in the Name as of April 2010 column in 1,045 out of 4,826 cases.

I've made a mapping table such as in the linked snippet above for my own use case, based on the following files:

Can you think of any other exceptions that should be included? In that case I'd really appreciate your feedback.

I'd be happy to share the mapping table, or the code that creates it (I have this in R now, but could probably translate it to Python) if that's of any use to you.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions