Skip to content

Triage OSM tag errors #25

@newsch

Description

@newsch

There are around 600 wikipedia/wikidata tags in the full planet dump that cannot be parsed by our current setup.
See #19 and #23 for more details.

They should all be values that don't conform to the expected format.
Some of these can be fixed on OSM by us, we can leave notes on others.

  • Add error for titles greater than 255 bytes of UTF-8
  • Add error for langs greater than some amount - check standard
  • Add a subcommand to dump the errors to disk in a structured way (started in Add osm tag file parsing #23).
  • Categorize them by solution
  • Add any new issues for parsing problems
  • Contribute fixes/notes to OSM

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions