Skip to content

There is a question about using haystack in Chinese #4585

Answered by ZanSara
mc112611 asked this question in Questions
Discussion options

You must be logged in to vote

Hey @mc112611, unfortunately Haystack is not well equipped for Chinese text processing right now. The best idea would be to implement your own PreProcessor to properly handle Chinese text. As long as you subclass BaseComponent, you will be able to use it in a Pipeline like any other Haystack node. Here is some information on how to do it: https://docs.haystack.deepset.ai/docs/custom_nodes. If later you find out that your custom PreProcessor works well and you want to contribute, we would be really happy to accept a PR for this!

Replies: 1 comment 2 replies

Comment options

You must be logged in to vote
2 replies
@mc112611
Comment options

@mc112611
Comment options

Answer selected by mc112611
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants