Skip to content

Shift format registry downloads and normalisation to use Siegfried #15

@anjackson

Description

@anjackson

The current Python implementation is fine, if messy. But for format registries, it's essentially implementing the same thing as Siegfried's roy tool. Rather than keeping this separate tool updated, it could be merged with roy and perhaps modify roy so that it can output the full normalized registry contents as YAML/JSON. This might be quite a lot of work though, and will need to be in Go rather than Python, so probably a long-term goal.

Some of the steps appear to be:

  • add an option to roy inspect so it emits the whole normalised dataset as YAML or similar.
  • add support for all known format registries to roy (FFW, GitHub Linguist, TRiD, ???).
  • modify the wikidata.sig build so the Archiveamatica extensions can be omitted (like -pronom)
  • modify the digipres.org and sentinel systems to run roy to gather the data and aggregate that instead.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions