KROMA

Ontology Matching with Knowledge Retrieval and Large Language Models (ISWC 2025)

Environment Setup

We provide the minimum environment requirements to support the running of our project. This means there can be a slight difference depending on the actual automatic dependency-solving result of different systems.

Should one be interested in reproducing a certain method, please look up the corresponding requirement file and install listed packages accordingly.

pip install --upgrade pip
pip install -r requirements.txt

Dataset and Access Preparation

Currently, our method features 6 datasets (Mouse-Human, NCIT-DOID, Nell-DBpedia, YAGO-Wikidata, ENVO-SWEET, MI-MatOnto). Please provide the downloaded datasets in experiments/dataset/.

Our paper features models provided by TogetherAI's API. So please supply your TogetherAI access token in the .env file.

Experiment Reproduction

We supply a sample script to run an experiment on ENVO-SWEET track with Llama-3.3-70B in the scripts/run_envo_sweet.sh. We provided a dynamic results saving to the script so that the results files will be automatically updated with the newest predicted pairs.

Result Digestion

The final results can be find under 2 folders. For example, after running the ENVO-SWEET experiment, you can find the results under results/baseline/envo_sweet for accepted alignments and reviews/baseline/envo_sweet for reviews needed by experts

Codebase Design and Contribution

Should you want to add a new evaluation, you may consider adding an experiments/configs/method/<llm> folder for the LLMs you want to evaluated on, and supply corresponding .jsonl for the dataset and the configurations you want to evaluate on. For a new dataset, add a folder experiments/configs/dataset/<dataset>.json to point towards the dataset you wish to add.

The OAEI datasets will be stored at experiments/datasets/OAEI/<track>/<dataset>. If you want to test multiple times, we advised to cache the query results and provide a path to it at experiments/configs/dictionary.json.

Updates

KROMA is currently undergoing maintenance to resolve dependency issues and ensure a seamless release. The final version will be available soon.

Should you need to refer to this work or find our codebase useful, please consider citing our work as:

@inproceedings{nguyen_2025_kroma,
    title={KROMA: Knowledge Retrieval Ontology Matching using Large Language Models},
    author={Lam Nguyen and Erika Barcelos and Roger French and Yinghui Wu},
    journal={Proceedings of the 24th International Semantic Web Conference Research Track},
    year={2025},
}

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
agents		agents
algorithms		algorithms
config		config
experiments/configs		experiments/configs
inference		inference
metrics		metrics
model_providers		model_providers
retrieval		retrieval
scripts		scripts
tasks		tasks
test		test
utils		utils
.gitignore		.gitignore
README.md		README.md
full.pdf		full.pdf
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

KROMA

Environment Setup

Dataset and Access Preparation

Experiment Reproduction

Result Digestion

Codebase Design and Contribution

Updates

About

Uh oh!

Releases

Packages

Uh oh!

Languages

lamng3/kroma

Folders and files

Latest commit

History

Repository files navigation

KROMA

Environment Setup

Dataset and Access Preparation

Experiment Reproduction

Result Digestion

Codebase Design and Contribution

Updates

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages