Skip to content

Commit 91bd772

Browse files
committed
Added a test pdf file and made minor changes
1 parent 98d64ca commit 91bd772

File tree

2 files changed

+10
-5
lines changed

2 files changed

+10
-5
lines changed

AUTOMATION/pdfToText.py

Lines changed: 10 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,10 +1,15 @@
1+
import argparse
12
import pdfminer.high_level
23

3-
# Extract with Pdfminer.six Module
4-
def With_PdfMiner():
5-
with open('test.pdf','rb') as fh:
6-
doc = pdfminer.high_level.extract_text(fh)
4+
# Extract text with Pdfminer.six Module
5+
def With_PdfMiner(pdf):
6+
with open(pdf,'rb') as file_handle:
7+
doc = pdfminer.high_level.extract_text(file_handle)
78
print(doc)
89

910
if __name__ == '__main__':
10-
With_PdfMiner()
11+
parser = argparse.ArgumentParser()
12+
parser.add_argument("file", help = "PDF file from which we extract text")
13+
args = parser.parse_args()
14+
# print()
15+
With_PdfMiner(args.file)

AUTOMATION/test.pdf

7.76 KB
Binary file not shown.

0 commit comments

Comments
 (0)