Impresso Datalab Notebooks

About

The Impresso project develops application interfaces to facilate historical transmedia research through:

the Impresso Web App, a user interface for content exploration and visualisation.
the Impresso Datalab, a suite of tools for data exploration and analysis.

Specifically, the Impresso Datalab enables custom analyses of the Impresso corpus, and the semantic indexation of external document collections also with Impresso models. We offer access to the Impresso corpus, data and models via the Impresso Public API, a dedicated Python library, and HuggingFace. For more information, be sure to visit the Datalab website.

Impresso Public API: The software component that provides third-party access to the Impresso backend.
Impresso Python Library: The preferred method for users to interact with the Impresso Public API.
Impresso Models: A collection of models trained to annotate the Impresso Corpus, made publicly available to facilitate the annotation of external documents, enabling comparison and analysis of semantic enrichments. Impresso models can be accessed through the Impresso Hugging Face organisation and via annotation services offered through the API.

Before getting started, check out how to create an account and obtain an API token on the Impresso Datalab website.

Notebooks

Getting Started

The notebooks in the starter folder will help you get started with the Impresso Public API and Python library:

Explore and Visualise your Impresso data

The notebooks in the explore-vis folder help you build complementary views on your Impresso data:

Annotate your Documents with Impresso Models

The notebooks in the annotate folder demonstrate how to use Impresso models, either from the Hugging Face hub or through the Impresso API. These notebooks guide you in annotating your documents to produce annotations that are compatible with those in the Impresso corpus.

About Impresso

Impresso project

Impresso - Media Monitoring of the Past is an interdisciplinary research project that aims to develop and consolidate tools for processing and exploring large collections of media archives across modalities, time, languages and national borders. The first project (2017-2021) was funded by the Swiss National Science Foundation under grant No. CRSII5_173719 and the second project (2023-2027) by the SNSF under grant No. CRSII5_213585 and the Luxembourg National Research Fund under grant No. 17498891.

Copyright

License

This program is provided as open source under the GNU Affero General Public License v3 or later.

Name		Name	Last commit message	Last commit date
Latest commit History 274 Commits
.github/workflows		.github/workflows
annotate		annotate
explore-vis		explore-vis
starter		starter
workshop_resources		workshop_resources
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
Dockerfile		Dockerfile
LICENSE		LICENSE
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Impresso Datalab Notebooks

About

Contents

Notebooks

Getting Started

Explore and Visualise your Impresso data

Annotate your Documents with Impresso Models

About Impresso

Impresso project

Copyright

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 12

Uh oh!

Languages

License

impresso/impresso-datalab-notebooks

Folders and files

Latest commit

History

Repository files navigation

Impresso Datalab Notebooks

About

Contents

Notebooks

Getting Started

Explore and Visualise your Impresso data

Annotate your Documents with Impresso Models

About Impresso

Impresso project

Copyright

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 12

Uh oh!

Languages

Packages