LLM4Humanities

LLM4Humanities is a Python-based toolkit and web app for assisting qualitative researchers in annotating textual data using Large Language Models (LLMs).

It provides an end-to-end workflow that combines manual coding, automated classification, and evaluation metrics to use and help you decide whether LLMs can reliably support your annotation tasks.

Key Features

No coding skills required using the app
Supports manual and automatic annotation of qualitative data
Built-in evaluation metrics to compare human and model agreement
Designed for multi-class, and binary classification
Compatible with OpenAI, Anthropic, Gemini and Azure API keys

How It Works

1. Manual Annotation

Use the Manual Annotator web app to label a subset of your dataset if you need to.
This serves as a reference to evaluate model performance.

2. Automated Annotation

You can choose between:

Web App (LLM4Humanities):
- Configure prompts and LLM providers
- Run classification scenarios on your labeled subset
- Measure agreement between model and human coders
Step-by-Step Notebooks (Google Colab):
- Guides you through the same workflow in code cells
- Requires minimal coding knowledge—each step is clearly explained
- Ideal if you prefer more control, customization, or wish to inspect intermediate results
- Examples dataset are provided

3. Evaluate Model Performance

We provide several metrics for you to use as you wish to make your own informed choice (in the app or in the notebooks):

Cohen's Kappa
Measures agreement between the LLM and human annotators.
Krippendorff's Alpha
For ordinal or nominal labels, compute confidence intervals via bootstrapping.
ALT-Test (arXiv:2501.10970)
A robust non-inferiority test comparing the model to each annotator in turn.
Requires at least 3 human annotators.
Classification Metrics
Per-class breakdown of true/false positives, recall, and error rates.

Data Requirements

Your dataset should be in CSV or Excel format, with:

One row per entry to classify
One or more textual columns that will be shown to the model
At least one unique identifier column
For evaluate the LLM classification, at least one annotation column for human labels

Running without installation

Use the web interface

Click the link below to run the manual annotator:

Run the Manual annotator App

Click the link below to run the web app:

Run the LLM4Humanities App

Run in Google Colab

Click the badge below to run the notebooks directly in Google Colab:

Binary classification notebook:

Multiclass classification notebook:

Complex classification notebook:

Run Locally (Full Control)

If you prefer to run the analysis directly on your machine, follow these installation steps.

Clone the repository.

git clone https://github.com/OlivierLClerc/qualitative_analysis_project.git
cd qualitative_analysis_project

Create a Virtual Environment

conda create -n qualitative_analysis python=3.10
conda activate qualitative_analysis

Install the required packages:

pip install -r requirements.txt

Set up your API credentials.

Copy or rename .env.example to .env. Populate it with your LLM credentials (OpenAI, Azure, or Together keys and endpoints).

Example:

# OpenAI
OPENAI_API_KEY=your_openai_key_here

# Azure environment variables
AZURE_API_KEY=your_azure_api_key_here
AZURE_OPENAI_ENDPOINT=your_azure_endpoint_here
AZURE_API_VERSION=your_azure_api_version_here

# Together AI Configuration
TOGETHERAI_API_KEY=your_togetherai_api_key_here

Run the apps or notebooks

Usage

To run the Manual Annotator, use:

streamlit run manual_annotator_stream.py

To run the LLM4Humanities app, use:

streamlit run app.py

Project Structure

qualitative_analysis_project/
├── codebook/
│   ├── binary_codebook.txt
│   └── multiclass_codebook.txt
├── data/
│   ├── binary_user_case/
│   ├── complex_user_case/
│   │   ├── complex_data.json
│   │   └── complex_data.xlsx
│   ├── multiclass_sample.csv
│   ├── multiclass_user_case/
│   │   └── multiclass_data.csv
│   └── outputs/
├── manual_annotator/
│   ├── __init__.py
│   ├── annotation_filter.py
│   ├── annotator_setup.py
│   ├── app_core.py
│   ├── codebook_upload.py
│   ├── column_selection.py
│   ├── data_download.py
│   ├── data_upload.py
│   ├── label_definition.py
│   └── row_annotation.py
├── notebooks/
│   ├── notebook_binary_colab.ipynb
│   ├── notebook_multiclass_colab.ipynb
│   └── notebook_sequential_binary_colab.ipynb
├── qualitative_analysis/
│   ├── __init__.py
│   ├── alt_test.py
│   ├── config.py
│   ├── cost_estimation.py
│   ├── data_processing.py
│   ├── evaluation.py
│   ├── logging.py
│   ├── model_interaction.py
│   ├── notebooks_functions.py
│   ├── parsing.py
│   ├── prompt_construction.py
│   └── prompt_engineering.py
├── streamlit_app/
│   ├── __init__.py
│   ├── analysis.py
│   ├── app_core.py
│   ├── codebook_management.py
│   ├── column_selection.py
│   ├── data_upload.py
│   ├── evaluation.py
│   ├── field_selection.py
│   ├── llm_configuration.py
│   └── session_management.py
├── .env.example
├── .gitignore
├── .pre-commit-config.yaml
├── app.py
├── manual_annotator_stream.py
├── README.md
└── requirements.txt

Main Files and Folders

`app.py`

The main entry point for the LLM4Humanities app. It imports the modularized app from the streamlit_app package.

`manual_annotator_stream.py`

The main entry point for the Manual Annotator app. It imports the ManualAnnotatorApp class and sets up the Streamlit interface.

`manual_annotator/`

Contains the modules for the Manual Annotator app.

`streamlit_app/`

Contains the modules for the LLM4Humanities app.

`notebooks/`

Contains Jupyter notebooks demonstrating user-case for:

Binary classification
Multiclass classification
Complex classification

`data/`

Holds sample data files for the different classification scenarios, organized into user case directories, plus an outputs/ subfolder where processed results can be saved.

`qualitative_analysis/`

The main Python package containing modules used by the apps and the notebooks for:

Configuration management
Data processing
Model interaction
Prompt construction and engineering
Evaluation metrics
Cost estimation
Logging

`codebook/`

Contains text files defining classification rules or codebooks for the user-case.

Other Files

.env.example – Template for environment variables needed for API credentials.
.pre-commit-config.yaml – Configuration for pre-commit hooks (linting, formatting, etc.).
.gitignore – Specifies files to be ignored by Git.
requirements.txt – Lists all Python dependencies required for the project.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLM4Humanities

Key Features

How It Works

1. Manual Annotation

2. Automated Annotation

3. Evaluate Model Performance

Data Requirements

Running without installation

Use the web interface

Run in Google Colab

Run Locally (Full Control)

Usage

Project Structure

Main Files and Folders

`app.py`

`manual_annotator_stream.py`

`manual_annotator/`

`streamlit_app/`

`notebooks/`

`data/`

`qualitative_analysis/`

`codebook/`

Other Files

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 133 Commits
data		data
manual_annotator		manual_annotator
notebooks		notebooks
qualitative_analysis		qualitative_analysis
streamlit_app		streamlit_app
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
app.py		app.py
manual_annotator_stream.py		manual_annotator_stream.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

flowersteam/LLM4Humanities

Folders and files

Latest commit

History

Repository files navigation

LLM4Humanities

Key Features

How It Works

1. Manual Annotation

2. Automated Annotation

3. Evaluate Model Performance

Data Requirements

Running without installation

Use the web interface

Run in Google Colab

Run Locally (Full Control)

Usage

Project Structure

Main Files and Folders

app.py

manual_annotator_stream.py

manual_annotator/

streamlit_app/

notebooks/

data/

qualitative_analysis/

codebook/

Other Files

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

`app.py`

`manual_annotator_stream.py`

`manual_annotator/`

`streamlit_app/`

`notebooks/`

`data/`

`qualitative_analysis/`

`codebook/`

Packages