Skip to content

1fge/matricula-online-scraper

Repository files navigation

Matricula Online Scraper

Matricula Online is an online resource that contains various records used for genealogical research. This program can download any collection of scanned images from the site while preserving relevant information about the document and individual scans.

Installation

Make sure you are using Python >=3.6

To Install Files:

git clone https://github.com/1fge/matricula-online-scraper

To Install Required Modules:

pip install -r requirements.txt

Usage

After installing the dependencies, the scraper can be used directly from the command line.

To Download a Single Archive:

python main.py -o ./images -u https://data.matricula-online.eu/en/deutschland/akmb/militaerkirchenbuecher/0001

To Download a List of Archive URLs from a File:

python main.py -o ./images -t ./urls.txt

To Download a Range of Pages from an Archive:

python main.py -o ./images -r 10 -u https://data.matricula-online.eu/en/deutschland/akmb/militaerkirchenbuecher/0001

If you run into any problems, feel free to create an issue. Furthermore, your contribution is encouraged, so feel free to make a pull request if you think something can be improved. Enjoy!

About

A library used to download scanned images from Matricula Online

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages