Add sequence extracted text from pdf/image #473

Pandurangbhor · 2022-11-08T16:13:21Z

Pandurangbhor
Nov 8, 2022

I am able to get all text and sentences from pdf engineering drawing file.but I am unable to get the sequence.For example first word then arrow pointed towards second word.so I want sequence by the arrow pointed.Kindly give the solution.

UmbertoFasci · 2022-11-11T14:15:13Z

UmbertoFasci
Nov 11, 2022

From what I understand you want to replace the space between each word with an arrow. Given the output of a general PDF text extraction is one string you can simply use:

text = text.replace(' ', ' -> ')

You can also separate sentences if formatted this way by splitting them at any present sentence ending symbols . for example.

text = text.split('.')

I hope this was useful to you.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add sequence extracted text from pdf/image #473

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Add sequence extracted text from pdf/image #473

Uh oh!

Pandurangbhor Nov 8, 2022

Replies: 1 comment

Uh oh!

UmbertoFasci Nov 11, 2022

Pandurangbhor
Nov 8, 2022

UmbertoFasci
Nov 11, 2022