Privacy-preserving record linkage for Tufts CTSI and collaborators
- Ensure that you have Python with pip installed.
- Clone the repo:
git clone https://github.com/TuftsCTSI/pprl - Open the new directory:
cd pprl - Run the setup script:
./setup.sh - Verify your setup by running the test suite:
./runtests.sh
-
Add the patient identifiers CSV file to
user_files. -
Add the secret file to
user_files. -
Update
create_CLKs.yml -
Run
./create_CLKs.py. The linkages CSV will be added touser_files. -
Distribute the linkages.
-
Delete any sensitive files from
user_files.
-
Add the patient identifiers CSV files to
user_files. -
Update
match_CLKs.yml -
Run
./match_CLKs.py -
Distribute the matches.
-
Delete any sensitive files from
user_files.