Skip to content

AMCeScience/active-learning-pub

Repository files navigation

Active learning miner

Developed and tested on Windows 10 inside a venv environment using Python 3.7.7 and pip 19.2.3

Setup

Setup your environment by installing the requirements using pip.

pip install -r requirements.txt
  1. Copy the config.example file to config.py.
cp config.example config.py
  1. Run fetch_raw.py to retrieve the data for the PubMed articles defined in the qrel files from the 2017 CLEF eHealth Lab.
python fetch_raw.py
  1. Run insert_docs.py to insert the fetched article data from PubMed into a local database.
python insert_docs.py
  1. Run fetch_validity.py to check the database against the original qrel files.
python fetch_validity.py
  1. Run clean_docs.py to preprocess the articles and store them as a feature matrix.
python clean_docs.py

Experiments

  1. Run run_experiments.py to determine the performance for baseline (using all data) and selected datasets.
python run_experiments.py
  1. Run result_analysis.py to create plots and significance tests.
python result_analysis.py

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages