-
Notifications
You must be signed in to change notification settings - Fork 21
ctakes smoking status
This is version 1.2 of the cTAKES smoking status annotator. The pipeline has been tested on flat text files and CDA documents. The sample provided uses a bar (|) delimited file with multiple records (and patients) per file. The dictionary lookup annotator is limited to smoking status dictionaries provided in the '/resources/ss/data/*.dictionary' files.
The smoking status pipeline processes patient records into five pre-determined categories - past smoker (P), current smoker (C), smoker (S), non-smoker (N), and unknown (U). The definition of smoking status was adapted from I2B2 Natural Language Processing Challenges for Clinical Records.
- PAST SMOKER (P): A patient whose record asserts either that they are a past smoker or that they were a smoker a year or more ago but who have not smoked for at least one year.
- CURRENT SMOKER (C): A patient whose record asserts that they are a current smoker (or that they smoked without indicating that they stopped more than a year ago) or that they were a smoker within the past year.
- SMOKER (S): A patient who is either a CURRENT or a PAST smoker but, whose medical record does not provide enough information to classify the patient as either a CURRENT or a PAST smoker.
- NON-SMOKER (N): A patient whose record indicates that they have never smoked.
- UNKNOWN (U): The patient's record does not mention anything about smoking.
Uses SVM for smoking status classification.
Source class: PcsClassifier
Source package: org.apache.ctakes.smokingstatus.ae
Parent class: org.apache.uima.fit.component.JCasAnnotator_ImplBase
| Parameter | Description | Class | Required | Default |
|---|---|---|---|---|
| KeyWordsPath | Path to file containing key words. | String | Yes | |
| ModelPath | Path to file containing the model. | String | Yes | |
| StopWordsPath | Path to file containing stop words. | String | Yes | |
| CaseSensitive | yes/no for case sensitivity. | String | No | yes |
- Piper File Submitter
- UMLS Package Fetcher
- Dictionary Creator
- Simple Pipeline Fabricator
- Pipeline Installation Facility
- ctakes-pbj module
- Getting started with PBJ
- Python pbj-component
- Python pbj-pipeline
- Python pbj-scripts
- Python pbj-tools
- pbj-user-pipeline
- examples
- ctakes-assertion
- ctakes-assertion-zoner
- ctakes-chunker
- ctakes-clinical-pipeline
- ctakes-constituency-parser
- ctakes-context-tokenizer
- ctakes-core
- ctakes-coreference
- ctakes-dependency-parser
- ctakes-dictionary-lookup
- ctakes-dictionary-lookup-fast
- ctakes-distribution
- ctakes-dockhand
- ctakes-drug-ner
- ctakes-examples
- ctakes-fhir
- ctakes-gui
- ctakes-lvg
- ctakes-mastif-zoner
- ctakes-ne-contexts
- ctakes-pbj
- ctakes-pos-tagger
- ctakes-preprocessor
- ctakes-regression-test
- ctakes-relation-extractor
- ctakes-side-effect
- ctakes-smoking-status
- ctakes-template-filler
- ctakes-temporal
- ctakes-tiny-rest
- ctakes-type-system
- ctakes-utils
- ctakes-web-rest
- ctakes-ytex
- ctakes-ytex-uima
- ctakes-ytex-web