Skip to content

Latin script: segmenter should support word segmentation #175

@Kerollmops

Description

@Kerollmops

We should be able to support splitting words by methods other than the text casing. Libraries like instant-segment exist to do that.

  • redneckbossryan -> redneck, boss and ryan can be extracted
  • massachusetsinstutitute -> massachusetts, institute

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions