Description
Description
A clear and concise description of what the bug is.
Author affiliations are available in raw arXiv API feeds, but are not exposed by this package's Result
objects.
Steps to reproduce
Steps to reproduce the behavior; ideally, include a code snippet.
Apparent for any result set.
- There's no mention of affiliations in this package's documentation or in the source code.
(Result)._raw.arxiv_affiliation
is often defined, but it's a single string––the affiliation of one author among several.
Expected behavior
A clear and concise description of what you expected to happen.
Author affiliations should be exposed by the Result.Author
class.
Versions
python
version: *arxiv.py
version: >= 1.0.0
Additional context
Add any other context about the problem here.
This is a long-open issue in feedparser
, perhaps open since 2015: kurtmckee/feedparser#24. There's a detailed breakdown of the interaction with arXiv results here: kurtmckee/feedparser#145 (comment). I suspect arXiv will release their JSON API ––and this client library will be rewritten to use the JSON API––before this feedparser
bug is resolved.
This client library could expose the single author affiliation extracted by feedparser
, but this has negative impacts:
- It may misleadingly suggest that a certain author or institution led the publication in question, which sucks from an ethical perspective.
- Which affiliation is extracted may depend on the order of the authors, which arXiv may not guarantee. The extracted affiliation of a paper may vary.
- The affiliation may not apply to all of the authors for a paper; exposing it is misleading.
If the single author affiliation is useful in your application, despite the noted downsides, access it with (Result)._raw.get('arxiv_affiliation')
.