-
I am trying to extract links from this particular PDF file(attached). But when I use page.get_links() it shows an empty list. I don't understand why. Here is the sample code: import pymupdf
doc = pymupdf.open('faster_rcnn.pdf')
page = doc[1]
print(page.get_links()) |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments
-
The page indeed contains no links! |
Beta Was this translation helpful? Give feedback.
-
Okay. It seems so. I think that I will need to use regex and get_text option. |
Beta Was this translation helpful? Give feedback.
The page indeed contains no links!
There exists text that looks like a link, but this is just text.
PDF viewers usually still react to this type of text format as if a link had been technically defined. This causes your confusion.