Skip to content

Preparing virtual corpora for FCS #807

@margaretha

Description

@margaretha

The virtual corpora at https://korap.ids-mannheim.de/doc/corpus should be made available for FCS. Some required properties to provide the VC as described at https://github.com/KorAP/Kustvakt/wiki/Setting-FCS-Resources, are missing from the table: id, pid, en_title.

The layers of all resources are the same as Wikipedia.

The 1st table is directly generated from a database. The 2nd table is available as a Stylesheet and has been converted into the required format. See 2dee932.

To do:

  1. Add the missing properties: id (must contain at least 3-characters), pid, en_title
  2. Create a file for the 1st table to process the data.
  3. Create a script to convert the 1st table to the required JSON format.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions