Add sequence extracted text from pdf/image #473
Unanswered
Pandurangbhor
asked this question in
Q&A
Replies: 1 comment
-
From what I understand you want to replace the space between each word with an arrow. Given the output of a general PDF text extraction is one string you can simply use: text = text.replace(' ', ' -> ') You can also separate sentences if formatted this way by splitting them at any present sentence ending symbols text = text.split('.') I hope this was useful to you. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I am able to get all text and sentences from pdf engineering drawing file.but I am unable to get the sequence.For example first word then arrow pointed towards second word.so I want sequence by the arrow pointed.Kindly give the solution.
Beta Was this translation helpful? Give feedback.
All reactions