Skip to content
This repository was archived by the owner on Feb 28, 2021. It is now read-only.
This repository was archived by the owner on Feb 28, 2021. It is now read-only.

Parallelize censoring #9

@rominf

Description

@rominf

I think dask is a good solution because it has a nice API and can be used in a cluster.

The easiest and most effective parallelization is to map words after tokenization.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or requestlow-prioritySomething not very important

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions