Skip to content

discover the link from wfs:getcapabilities to metadata-link-content #2

@pvgenuchten

Description

@pvgenuchten

A wfs-getcapabilities document typically has a number of links to metadata content. If a crawler accesses a new getcapabilities document, via the WFS-proxy, it will discover metadata via those links, which may or may not already be available in a coupled catalogue (if available). A process should be able to grab that metadata and make it crawlable for the search engine (convert to schema.org).

Two approaches:

  • a CSW proxy may instantly convert the iso19139 document to schema.org and expose it to the crawler with a urn of http://{csw-proxy}?url={full-csw-request-as-get}
  • A process will verify if the CSW document is in the coupled catalogue, and if so expose that document to the crawler using a urn http://{catalogue}/dataset/{namespace}/{uuid}, if the catalogue doesn't contain the document, the process will trigger a harvest of that document into the catalogue (how will the catalogue later synchronise that resource)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions