Skip to content

Commit 86c36bb

Browse files
authored
Add files via upload
1 parent 7b000fa commit 86c36bb

File tree

1 file changed

+16
-0
lines changed

1 file changed

+16
-0
lines changed

PDFtoExcel.py

Lines changed: 16 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,16 @@
1+
import pdfplumber
2+
import pandas as pd
3+
4+
print("请先把PDF文件放入该程序所处文件夹中\n最终输出文件为data.xlsx,存放在该程序所处文件夹中")
5+
path = input("请输入待转换的PDF文件名(记得带上后缀.pdf):")
6+
7+
with pdfplumber.open(path) as pdf:
8+
totalPages = len(pdf.pages)
9+
df = pd.DataFrame()
10+
for pageNumber in range(totalPages):
11+
page = pdf.pages[pageNumber]
12+
table = page.extract_table()
13+
dfPage = pd.DataFrame(table)
14+
df = pd.concat([df, dfPage], ignore_index = True)
15+
#print(dfPage)
16+
df.to_excel('data.xlsx', index = False)

0 commit comments

Comments
 (0)