PDF Translator

A powerful web application that translates PDF documents to various languages while preserving the original layout, formatting, and background colors.

Features

Exact Layout Preservation: Maintains the original PDF layout including text positioning, images, and graphics
Background Color Detection: Preserves the background color of each text block in the translated document
Font Style Retention: Maintains bold, italic, and other text formatting from the original document
Real-time Progress Updates: Provides WebSocket-based progress tracking during translation
Multiple Language Support: Translates to any language supported by OpenAI's models
Web Interface: User-friendly interface for uploading and translating PDFs
Intelligent Text Wrapping: Handles cases where translated text is longer than the original

⭐ Support This Project

If you find this project useful, please consider giving it a star on GitHub! Your support helps make this project better.

🤝 Contributing

Contributions are welcome and greatly appreciated! Here's how you can contribute:

Fork the repository
Create your feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add some amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

Feel free to check the Issues page for any open tasks or report bugs.

Translation Examples

Original Document

Translated Document

Notice how the application preserves:

The exact layout and positioning of all elements
Background colors of text blocks
Font styles and formatting
All images and graphical elements

How It Works

The PDF Translator follows these steps:

Text Extraction: Extracts text blocks from the original PDF while preserving their positions, font styles, sizes, and page numbers
Background Color Detection: Analyzes each text block area to identify its background color
Translation: Sends the extracted text to OpenAI's API for translation
PDF Recreation: Creates a new PDF by:
- Copying the original PDF pages exactly
- Covering original text with rectangles matching the detected background color
- Adding translated text in the same positions with matching font styles
Progress Tracking: Provides real-time updates throughout the process via WebSocket communication

Installation

Prerequisites

Python 3.8+
OpenAI API key

Setup

Clone this repository:

git clone https://github.com/Codehash001/gpt-pdf-translator.git
cd gpt-pdf-translator

Install the required dependencies:

pip install -r requirements.txt

Create a .env file in the project directory with your OpenAI API key:

OPENAI_API_KEY=your_openai_api_key_here

Usage

Web Interface

Start the web server:

python main.py

Open your browser and navigate to http://localhost:8000
Upload a PDF file, select the target language, and click "Translate"
Monitor the real-time progress updates during translation
Download the translated PDF when complete

API Endpoints

The application provides the following API endpoints:

POST /translate-pdf: Upload and translate a PDF file
GET /download/{filename}: Download a translated PDF file
WebSocket /ws/{task_id}: Connect to receive real-time progress updates

Technical Implementation

Background Color Detection

The application detects the background color of each text block by:

Rendering a small area around the text block as an image
Analyzing the color distribution to find the most common color
Using that color when covering the original text before adding the translation

This ensures that colored backgrounds, highlighted text, and other design elements are maintained in the translated document.

Text Wrapping

When translated text is longer than the original (common in many language pairs), the application:

Calculates the available space in the original text block
Estimates how many characters can fit per line
Applies intelligent word wrapping to keep the text within the original boundaries
Adjusts line spacing as needed to accommodate the text

Limitations

Very complex PDF layouts with mixed text directions might require manual adjustment
PDFs with custom fonts will use standard substitutes in the translated version
Documents with text embedded in images require separate OCR processing (not included)

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

OpenAI for providing the translation API
PyMuPDF (fitz) for PDF processing capabilities
FastAPI for the web framework

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
converted		converted
example		example
templates		templates
uploads		uploads
.env.example		.env.example
.gitignore		.gitignore
EXAMPLES.md		EXAMPLES.md
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PDF Translator

Features

⭐ Support This Project

🤝 Contributing

Translation Examples

Original Document

Translated Document

How It Works

Installation

Prerequisites

Setup

Usage

Web Interface

API Endpoints

Technical Implementation

Background Color Detection

Text Wrapping

Limitations

Contributing

License

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Codehash001/gpt-pdf-translator

Folders and files

Latest commit

History

Repository files navigation

PDF Translator

Features

⭐ Support This Project

🤝 Contributing

Translation Examples

Original Document

Translated Document

How It Works

Installation

Prerequisites

Setup

Usage

Web Interface

API Endpoints

Technical Implementation

Background Color Detection

Text Wrapping

Limitations

Contributing

License

Acknowledgments

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages