Skip to content
Change the repository type filter

All

    Repositories list

    • Core libraries by the PRImA Research Lab
      HTML
      141663Updated Jul 30, 2024Jul 30, 2024
    • NAME-XML

      Public
      XML schemas for named entities and relations
      0300Updated Dec 2, 2023Dec 2, 2023
    • Java based viewer for PAGE XML files (layout + text content). Also supports ALTO XML, FineReader XML, and HOCR.
      HTML
      93582Updated May 25, 2023May 25, 2023
    • Text-related functionality (text comparison / evaluation, filtering, export etc.)
      Java
      0010Updated May 31, 2022May 31, 2022
    • PAGE-XML

      Public
      PAGE XML format collection for document image page content and more
      XSLT
      767101Updated Jul 7, 2021Jul 7, 2021
    • Web-based page layout editor created for EMOP (Early Modern OCR Project).
      Java
      51110Updated May 21, 2021May 21, 2021
    • Web-based viewer and editor for PAGE XML
      HTML
      1800Updated May 21, 2021May 21, 2021
    • Command line tool to convert page layout files to the latest PAGE XML format. It supports all previous versions of the PAGE format as well as ALTO XML, FineReader XML, and HOCR
      HTML
      42470Updated Jan 30, 2021Jan 30, 2021
    • Java command line tool to convert PAGE XML files with layout and text content to PDF
      HTML
      21040Updated Apr 27, 2020Apr 27, 2020
    • PAGE Metadata Scanner is a command line tool that scans a single PAGE XML file (document layout and text content) and outputs its properties in CSV format.
      HTML
      2300Updated Nov 12, 2019Nov 12, 2019
    • root

      Public
      Some general stuff concerning PRImA tools
      0000Updated Sep 20, 2019Sep 20, 2019
    • Tool to call Google Cloud Vision OCR and save the result as PAGE XML
      Java
      0200Updated Sep 15, 2019Sep 15, 2019
    • Partial source code of PRImA Layout Evaluation Tool
      C++
      0200Updated Sep 6, 2019Sep 6, 2019
    • Semantic labelling - Ontology, search and matching algorithms, workflow tools
      Java
      4911Updated Oct 18, 2018Oct 18, 2018
    • Image processing functions used by PRImA tools
      C++
      1400Updated Oct 3, 2017Oct 3, 2017
    • Library with user interface elements and client-server communication classes based on Google Web Toolkit (GWT) that can be used for crowdsourcing applications.
      HTML
      31410Updated Oct 3, 2017Oct 3, 2017