Exceptions for v2 to v3 conversion

I noticed that the py-ard implementation for converting v2 to v3 typings is not yet complete:

https://github.com/nmdp-bioinformatics/py-ard/blob/d43a035361cc90cef9bf0c98475ec90396b06a4f/pyard/db.py#L586-L600

I was wondering if this is still on the roadmap and/or if I can contribute anything to it.

The heuristic conversion in `ard._predict_v3()` does not work in all cases, because there's a bunch of exceptions. For example, if I apply it to the `Current Name` column of the [IPD-IMGT/HLA pre-2010 nomenclature file](https://github.com/ANHIG/IMGTHLA/blob/Latest/Nomenclature_2009.txt), I get a different result than in the `Name as of April 2010` column in 1,045 out of 4,826 cases.

I've made a mapping table such as in the linked snippet above for my own use case, based on the following files:
- The [IPD-IMGT/HLA pre-2010 nomenclature file](https://github.com/ANHIG/IMGTHLA/blob/Latest/Nomenclature_2009.txt)
- The [IPD-IMGT/HLA deleted alleles file](https://github.com/ANHIG/IMGTHLA/blob/Latest/Deleted_alleles.txt). If a v2 allele has been deleted, its correct v3 equivalent is not included in the nomenclature file (but is `"None"`), so I've pulled these from the `Description` column in the deleted alleles file.
- The obsolete allele-specific codes and obsolete DPB1-specific codes from https://bioinformatics.bethematchclinical.org/hla-resources/allele-codes/allele-code-nomenclature/ (Excel files linked under points 4 and 5)

Can you think of any other exceptions that should be included? In that case I'd really appreciate your feedback. 

I'd be happy to share the mapping table, or the code that creates it (I have this in R now, but could probably translate it to Python) if that's of any use to you. 


	# TODO: Create mapping table using both the allele list history and
	# deleted alleles as reference.
	# Temporary Example
	v2_to_v3_example = {
	"A0104": "A01:04:01:01N",
	"A0105N": "A01:04:01:01N",
	"A0111": "A01:11N",
	"A01123": "A01:123N",
	"A0115": "A01:15N",
	"A0116": "A01:16N",
	"A01160": "A01:160N",
	"A01162": "A01:162N",
	"A01178": "A01:178N",
	"A01179": "A01:179N",
	"DRB502ZB": "DRB502:UTV",

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Exceptions for v2 to v3 conversion #291

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Exceptions for v2 to v3 conversion #291

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions