Skip to content

mindds/GPT-CI-Staging

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 

Repository files navigation

GPT-CI-Staging

This repository provides code for the study titled "A GPT-4o-powered framework for identifying cognitive impairment stages in electronic health records" which uses GPT-4o to automate the extraction and interpretation of cognitive information from electronic health records (EHRs). The framework was evaluated across two key clinical tasks: Cognitive Impairment (CI) stage classification and Clinical Dementia Rating (CDR) scoring.

Study Overview

In this study, we introduce a GPT-4o-powered framework for automating cognitive assessment from unstructured clinical notes. Our evaluation used two real-world datasets:

  1. CI Stage Classification
    We applied the framework to classify patients as Cognitively Unimpaired (CU), Mild Cognitive Impairment (MCI), or Dementia using a dataset of 1,002 Medicare fee-for-service patients from the Mass General Brigham (MGB) Healthcare Accountable Care Organization (ACO).
    GPT-4o’s performance was compared with several other frameworks to assess its language understanding capabilities and potential in clinical settings. To benchmark GPT-4o’s performance and assess its potential in clinical settings, we compared it with three alternative deep learning frameworks:

    1. USE Framework: Sentence-level keyword filtering, Universal Sentence Encoder (USE) embeddings, Recursive Feature Elimination (RFE), and XGBoost classifier.
    2. DementiaBERT Framework: Sentence-level keyword filtering, DementiaBERT embeddings (fine-tuned on dementia-related clinical language), and XGBoost classifier.
    3. Hybrid Framework: GPT-4o-generated summaries of chunked notes, followed by embedding via DementiaBERT and XGBoost classification.
  2. CDR Scoring
    We further evaluated GPT-4o on the task of assigning global Clinical Dementia Rating (CDR) scores using specialist notes from patients who visited the MGB memory clinic.

Beyond performance evaluation, we explored the design of an interactive AI agent that integrates the GPT-4o-powered framework to enable real-time interaction and decision support for cognitive diagnoses.

Repository Structure

  • ci_staging/
    Contains the full pipeline for CI stage classification, including GPT inference, evaluation and comparison across different frameworks.

    ├─ use_model/: USE framework: keyword-filtered sentences + USE + RFE + XGBoost

    ├─ dementia_bert/: DementiaBERT framework and Hybrid framework

  • cdr_scoring/
    Contains the pipeline for CDR score assignment, from preprocessing to prompting of GPT and downstream results analysis.

⚠️ Notes

  • No protected health information (PHI) is included in this repository. All code is shared for reproducibility and academic use.

Citation

Please cite this repository if you use the codes or models in your research:

@article{leng2025gptci,
  title     = {A GPT-4o-powered framework for identifying cognitive impairment stages in electronic health records},
  author    = {Leng, Yu and He, Yingnan and Amini, Samad and Magdamo, Colin and Paschalidis, Ioannis and Mukerji, Shibani S. and Moura, Lidia M. V. R. and Westover, M. Brandon and Vranceanu, Ana-Maria and Ritchie, Christine S. and Blacker, Deborah and Dickson, John R. and Das, Sudeshna},
  journal   = {npj Digital Medicine},
  volume    = {8},
  number    = {1},
  pages     = {401},
  year      = {2025},
  publisher = {Nature Publishing Group},
  doi       = {10.1038/s41746-025-01834-5},
  pmid      = {40610683},
  pmcid     = {PMC12229571}
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •