-
Notifications
You must be signed in to change notification settings - Fork 1
Open
Description
Hi @OlgaGKononova can you explain how did you make the symbol table?
I assumed they are HTML special symbols. But I think some of them are not working in Chrome. Please see this fiddle: https://jsfiddle.net/vfj3hw0q/1/ Thus, I'm not sure if the replacement of symbols like &Agr; -> Α
really works for our project. Maybe the table needs updating.
In addition, I found this package https://github.com/chartbeat-labs/textacy to be extremely useful. It essentially does the same thing as this repo, but with a nicer interface and standard. Here is an example:
import textacy
text= 'ℏ. Then the mixtures were placed in alumina crucibles and sintered at 1200 ° C for 4 h in air. '
print(repr(text))
text = textacy.preprocess_text(text, fix_unicode=True)
print(repr(text))
'ℏ. Then the mixtures were placed in alumina crucibles and sintered at 1200\xa0° C for 4\xa0h in air. '
'ℏ. Then the mixtures were placed in alumina crucibles and sintered at 1200 ° C for 4 h in air.'
So I suggest looking into this package.
Metadata
Metadata
Assignees
Labels
No labels