Skip to content

Committee report parsing #4

@wfdd

Description

@wfdd

There's two challenges here:

  • Parsing all of the various incarnations of the attendee table
  • Pairing reports with biils

The former simply requires work. The latter, well... Given that reports don't carry bill identifiers, we're gonna have to look up bills by their title on the date of the plenary a report was posted. This might pose an issue with renumbered amendments, and perennial bills and regulations, which are inconsistently assigned the year of the latest resubmission.

Bill titles will have to be normalised to replace Latin glyphs within Greek lexemes, which are a by-product of Parliament's shitty CMS—presumably.

We're also gonna need to find a good way to generate reproducible UIDs for committee reports.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions