plain text processing #5222
Replies: 2 comments
-
Do I understand your question correctly that you have a non-capitalized, non-punctuated text, and you would like to automatically add punctuation and capitalization to it? Because sentences are determined with the dependency parser, you can use a pretrained model to (at least partly) achieve what you want:
which will print
Obviously you'll need to do some more work if you'd also like to determine whether a sentence needs a full stop For capitalization of nouns in German, you want to check the POS tags on the token level:
Which gives:
Also note that the pretrained models are trained on properly punctuated text for the most part, so they may not obtain the same accuracy on your type of input. But at least it's a starting point. FYI - for future reference, these kind of usage questions are probably better asked at StackOverflow, as we try to keep this issue tracker focused on bug reports and feature requests specifically. Thanks! |
Beta Was this translation helpful? Give feedback.
-
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Feature description
I have plain text, without any punctuation in English and German. I want to ask that is there any possibility to
Please guide
Beta Was this translation helpful? Give feedback.
All reactions