Skip to content

[Bug] java.lang.OutOfMemoryError: Java heap space при парсинге документов #521

@psydok

Description

@psydok

Проблема, озвученная в #489, полностью актуальна для версии 2.3 и 2.3.2 (докер образ). Полностью воспроизводится с тем же файлом из прошлого ишью.

Exception in thread "main" java.lang.OutOfMemoryError: Java heap space
        at org.apache.fontbox.cmap.CMap.readCode(CMap.java:165)
        at org.apache.pdfbox.pdmodel.font.PDType0Font.readCode(PDType0Font.java:553)
        at org.apache.pdfbox.contentstream.PDFStreamEngine.showText(PDFStreamEngine.java:690)
        at org.apache.pdfbox.contentstream.PDFStreamEngine.showTextStrings(PDFStreamEngine.java:633)
        at org.apache.pdfbox.contentstream.operator.text.ShowTextAdjusted.process(ShowTextAdjusted.java:53)
        at org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:849)
        at org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:495)
        at org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:469)
        at org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:142)
        at org.apache.pdfbox.text.LegacyPDFStreamEngine.processPage(LegacyPDFStreamEngine.java:146)
        at org.apache.pdfbox.text.PDFMarkedContentExtractor.processPage(PDFMarkedContentExtractor.java:41)
        at model.Document.parseTags(Document.java:317)
        at model.Document.load(Document.java:82)
        at DedocTableExtractor.extract(DedocTableExtractor.java:148)
        at DedocTableExtractor.run(DedocTableExtractor.java:118)
        at DedocTableExtractor.main(DedocTableExtractor.java:72)

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions