Skip to content

Supported Languages in Text Cleaning are not accepted #14

@marciorpcoelho

Description

@marciorpcoelho

Describe the bug
After applying the language detection step, I tried to clean the text and get token information like number, symbols, count, etc., but unfortunately, I can't apply the Text Cleaning step as i keep running into the error that it found unsupported languages, which are supported in the documentation.

To Reproduce
Steps to reproduce the behavior:

  1. Apply Language Detection step to get ISO 639-1 language code
  2. Apply the Text Cleaning step to supported languages (ex. english, german, etc.)
  3. See error

Expected behavior
The text cleanup to applied without issues.

Screenshots
Screenshots are provided with example data and plugin configuration.
data_example
text_cleaning_configuration
error_log

Additional context

  • DSS version 10.0.4

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions