NLPmatchPublic

This is a repository showing code, data and methodology used to match Europages products with the best related Ecoinvent product. There are 2 subfolders present, containing code and data:

Code:

_00_main.py: The main python script which ignites the process and runs all .py scripts in subsequent order.
_01_DPR_similarities.py: Reads the finetuned DPR model containing embeddings of products, and selects for each Europages product the top 5 candidates for best fitting Ecoinvent product based on a similarity score.
_02_fewshot_val.py: For each of the top 5 candidates for a Europages product, the Ecoinvent product matches are being validated by a finetuned GPT model, giving a positive or negative validation result.
_04_matched_full.py: After GPT validation, the matches are further filtered on Activity, Main Activity and Geography to make sure that the matches fit the best observation in Ecoinvent.
_05_sector_resolution.py: For companies of which no product was assigned en Ecoinvent product, the best fitting tilt_subsector still has to be found. This is done with a GPT prompt.

Data:

Input: This folder contains all datasets that serve as input for the matching process, but might also be used in other processes within the tilt pipeline.
Intermediate: Contains intermediate datasets that are created in the matching process, but are not part of the end result and are therefore stored in an intermediate folder.
Output: Contains output files that are result of the matching process, and are used in the next part of the tilt data pipeline.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
matching		matching
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

NLPmatchPublic

About

Uh oh!

Releases

Packages

Languages

2DegreesInvesting/NLPmatchPublic

Folders and files

Latest commit

History

Repository files navigation

NLPmatchPublic

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages