Replies: 3 comments 8 replies
-
Writing mzIdentML is unfortunately somewhat complicated. If you decide to go that route, you can look at my implementation here: https://github.com/lazear/sage/tree/mzid/crates/mzidentml/src |
Beta Was this translation helpful? Give feedback.
-
I totally agree with the problems of XML in mzIdentML and probably more modern (binary) implementations are more useful. So, my reasoning why we should take the implementation of mzid e.g. for the PSMs is, that with this we would cover all possibilities. I know, that each search engine or other downstream processing has its own ideas of naming and what actually is mandatory. But still, you can put it into the mzid representation. And that is definitely the thing I like very much about the PSI formats, speaking of mzid and mzML, NO mzTAB, which let's you model - well, final results at the best, which cannot be used as input. Also, having these implementation (or structs) will make it probably easy in the future to allow support for any binary, more future proof formats. But right now, mzid would be the best full featured interchange format for all search engine results, or actually any ID results in proteomics. |
Beta Was this translation helpful? Give feedback.
-
I think two distinct and intersiting topics are raising here:
We could maybe start to answer the first point because this is more pragmatic, and has less consequences (despite the required amount of work). Regarding the second point there a lot of considerations, and IMO this will take time to find a consensus. And if we decide to answer positively to the first question, we can then adopt an mzid PSM representation within the mzid module. This would give a viable solution to people wanted to deal with PSM level data. And if we find later a strong rationale for a common PSM representation, this would give us a starting point. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
I would like to propose to implement an mzid reader and possibly also a writer.
One other thing, that is urgently important for this is to define the a PSM struct, which can then be used by all implementations handling identifications. Here, I would propose to start off with what mzid can offer. We discussed quite a lot while developing the format and I assume that almost everything you can say about a PSM is modeled in it. How we implement those is another thing, but having all the attributes and params defined in mzid would be helpful.
4 votes ·
Beta Was this translation helpful? Give feedback.
All reactions