Vidsnap(IN DEVELOPMENT)

Smart Lecture Notes Generator

A CLI-based tool for Windows that processes local lecture videos, detects slide changes, and generates structured notes. Built in Python with modular components, and designed for future GUI integration and advanced features like OCR and summarization.

Features

Phase 1 (Completed)
- Detect slide transitions in video using SSIM-based frame comparison (OpenCV).
- Extract unique slide images and compile them into a single PDF report (img2pdf / PyPDF2).
- Modular code structure with robust CLI argument parsing and logging.
Phase 2 (In Progress)
- OCR text extraction from slides using Tesseract (pytesseract).
- Export enriched Markdown notes with slide images and extracted text.
Phase 3 (In Progress)
- Automated extractive summarization of slide content using Gensim TextRank.

Tech Stack

Language: Python 3
Video & Image Processing: OpenCV, scikit‑image
PDF Generation: img2pdf, PyPDF2
OCR: Tesseract OCR, pytesseract
Summarization: Gensim
CLI: argparse, logging

Prerequisites

Python 3.7 or higher
Tesseract OCR installed and added to PATH (for Phase 2 features)
FFmpeg (optional, for high-performance frame extraction)

Usage

Phase 1: Slide Extraction & PDF Report

python main.py --input "path/to/lecture.mp4" --output "slides.pdf"

--input: Path to local video file.
--output: Path to generated PDF containing slides.

Phase 2: OCR & Markdown Export (Coming Soon)

python main.py --input "lecture.mp4" --ocr --markdown --output-dir "notes/"

--ocr: Enable Tesseract OCR on extracted slides.
--markdown: Generate a Markdown file with images and extracted text.

Phase 3: Summarization (Coming Soon)

python main.py --input "lecture.mp4" --ocr --summarize --output-dir "notes/"

--summarize: Add extractive summary of slide text to output.

Project Structure

SmartLectureNotes/
├── slide_detector.py      # Detects slide changes
├── pdf_generator.py       # Generates PDF from slide images
├── ocr_processor.py       # Performs OCR on slide images
├── summarizer.py          # Summarizes text using Gensim
├── main.py                # CLI entry point
├── requirements.txt       # Python dependencies
└── README.md              # Project overview and usage

Roadmap

Phase 2: Complete OCR integration and Markdown export.
Phase 3: Implement summarization and improve output formatting.
GUI Integration: Build a desktop GUI using PyQt or Tkinter wrapping the CLI core.
Packaging: Bundle as a standalone Windows executable via PyInstaller.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
combined.py		combined.py
full.py		full.py
full_hash.py		full_hash.py
jpgtopdf.py		jpgtopdf.py
jpgtopdf_withdetection.py		jpgtopdf_withdetection.py
slides_output.pdf		slides_output.pdf
ssim.py		ssim.py
start.py		start.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Vidsnap(IN DEVELOPMENT)

Smart Lecture Notes Generator

Features

Tech Stack

Prerequisites

Usage

Phase 1: Slide Extraction & PDF Report

Phase 2: OCR & Markdown Export (Coming Soon)

Phase 3: Summarization (Coming Soon)

Project Structure

Roadmap

About

Uh oh!

Releases

Packages

Languages

cunyame/Vidsnap

Folders and files

Latest commit

History

Repository files navigation

Vidsnap(IN DEVELOPMENT)

Smart Lecture Notes Generator

Features

Tech Stack

Prerequisites

Usage

Phase 1: Slide Extraction & PDF Report

Phase 2: OCR & Markdown Export (Coming Soon)

Phase 3: Summarization (Coming Soon)

Project Structure

Roadmap

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages