Automatically remove brand logos and unwanted slides from PDFs
-
Updated
Jul 14, 2025 - Python
Automatically remove brand logos and unwanted slides from PDFs
Distributed GCS-GCS multilingual PDF processing service built for horizontal scaling and concurrency, can be deployed using docker compose for voluminous processing
PDFTextStripper is a lightweight Python utility that removes only the text layer from PDF files — while preserving images, colors, vector drawings, and overall structure.
Add a description, image, and links to the pdf-cleaning topic page so that developers can more easily learn about it.
To associate your repository with the pdf-cleaning topic, visit your repo's landing page and select "manage topics."