improve VideoToImage export

overcrash66 · overcrash66 · commit c2f6d66ee7b7 · 2025-01-17T23:18:23.000-04:00
Export less images for faster translation
diff --git a/README.md b/README.md
@@ -14,33 +14,9 @@ This project utilizes optical character recognition (OCR) and translation to tra
 2. Run the script `main.py`.
 3. Translated images will be saved in the `output` folder.
 
-## New Features
-
-### SyncVideoWithAudio.py
-This script syncs audio to a video file using advanced checks and features. It performs the following steps:
-- **Duration Check:** Ensures that the video and audio durations are within a specified tolerance.
-- **Audio Extraction:** Extracts audio from the video if available, or generates silent audio if not.
-- **Audio Alignment:** Aligns the audio using cross-correlation to calculate the offset.
-- **Synchronization:** Syncs the audio to the video using FFmpeg and saves the output.
-
-### TranslateMultipleImage.py
-This script processes multiple images by performing OCR to extract text, translating the text, and replacing the original text in the images with the translated text. It includes:
-- **Batch Processing:** Allows processing images one by one or multiple images simultaneously using multithreading.
-- **Error Handling:** Handles translation errors and missing translations gracefully.
-- **Customization:** Supports custom source and target languages for OCR and translation.
-
-### videoToImage.py
-This script extracts frames from a video file and saves them as individual images. It features:
-- **Frame Extraction:** Reads and saves each frame of the video as a separate image.
-- **Output Management:** Ensures the output folder exists and manages the file naming for the frames.
-
-### imageToVideo.py
-This script converts a series of images into a video file. It includes:
-- **Image-to-Video Conversion:** Reads images from a folder and combines them into a video file.
-- **Frame Rate Customization:** Allows setting the frame rate for the output video.
-- **Error Handling:** Uses a placeholder frame for any invalid or unreadable images.
-
-## The goal of this update is to be able to translate video to video with the combination of [OpenTranslator](https://github.com/overcrash66/OpenTranslator).
+## The goal of this update / tools, is to be able to translate from a video to video with the combination of [OpenTranslator](https://github.com/overcrash66/OpenTranslator).
+
+[![Demo - Translation Example](https://img.youtube.com/vi/ebviBPenkfI/0.jpg)](https://www.youtube.com/watch?v=ebviBPenkfI)
 
 # Setup
 
@@ -69,13 +45,15 @@ pip install torch==2.5.1+cu118 torchaudio==2.5.1+cu118 torchvision==0.20.1+cu118
 
 ## Notes
 
--   Supported languages for OCR can be seen [here](https://www.jaided.ai/easyocr/)
--   Supported languages for Google Translate can be obtained using the following code:
+- Supported languages for OCR can be seen [here](https://www.jaided.ai/easyocr/)
+- Supported languages for Google Translate can be obtained using the following code:
+
     ```python
     from deep_translator.constants import GOOGLE_LANGUAGES_TO_CODES
     print(GOOGLE_LANGUAGES_TO_CODES)
     ```
--   Adjustments to text languages, recognition thresholds, translation services, or image processing parameters can be made within the script.
+
+- Adjustments to text languages, recognition thresholds, translation services, or image processing parameters can be made within the script.
 
 ## Examples
 
@@ -84,6 +62,6 @@ pip install torch==2.5.1+cu118 torchaudio==2.5.1+cu118 torchvision==0.20.1+cu118
 
 ## Acknowledgments
 
--   [EasyOCR](https://github.com/JaidedAI/EasyOCR) - For OCR processing.
--   [Google Translator](https://pypi.org/project/deep-translator/) - For text translation.
--   [Pillow (PIL Fork)](https://python-pillow.org/) - For image manipulation.
+- [EasyOCR](https://github.com/JaidedAI/EasyOCR) - For OCR processing.
+- [Google Translator](https://pypi.org/project/deep-translator/) - For text translation.
+- [Pillow (PIL Fork)](https://python-pillow.org/) - For image manipulation.
diff --git a/videoToImage.py b/videoToImage.py
@@ -1,9 +1,9 @@
 '''
 This script convert an input video to frames / a list of images
 '''
-
 import os
 import cv2
+import numpy as np
 
 def video_to_images(video_path, output_folder):
     # Check if the video file exists
@@ -24,6 +24,8 @@ def video_to_images(video_path, output_folder):
         return
 
     frame_count = 0
+    saved_count = 0
+    prev_frame = None  # To store the previous frame for comparison
 
     while True:
         # Read a frame from the video
@@ -33,25 +35,33 @@ def video_to_images(video_path, output_folder):
         if not ret:
             break
 
+        # Check if the current frame is identical to the previous frame
+        if prev_frame is not None and np.array_equal(prev_frame, frame):
+            frame_count += 1
+            continue  # Skip saving this frame
+
         # Generate the file name for the frame image
         frame_filename = os.path.join(output_folder, f"frame_{frame_count:04d}.png")
 
         # Save the frame as an image
         cv2.imwrite(frame_filename, frame)
+        saved_count += 1
 
+        # Update the previous frame
+        prev_frame = frame
         frame_count += 1
 
     # Release the video capture object
     video.release()
 
-    print(f"Exported {frame_count} frames to '{output_folder}'.")
+    print(f"Exported {saved_count} unique frames to '{output_folder}'.")
 
 if __name__ == "__main__":
     # Input MP4 video file
-    video_path = input("Enter the path to the MP4 video file: ")
-
+    #video_path = input("Enter the path to the MP4 video file: ")
+    video_path = 'canada.mp4'
     # Output folder
-    output_folder = "ExportedImages"
+    output_folder = "test-ExportedImages"
 
     # Extract frames
     video_to_images(video_path, output_folder)