Skip to content

Code/analysis/simulations for robust and efficient causal inference from electronic health record based (observational) cohort studies with missing study eligibility criteria.

Notifications You must be signed in to change notification settings

lbenz730/semiparametric_missing_elig

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Robust Causal Inference for Point Exposures with Missing Eligibility Criteria

Benz, L., Mukherjee, R., Wang, R., Arterburn, D., Fischer, H., Lee, C., Shortreed, S.M., Haneuse, S., and Levis, A.W. "Robust Causal Inference for EHR-based Studies of Point Exposures with Missingness in Eligibility Criteria" Under Review (Pre-Print)

R Scripts (scripts/)

  • helpers.R: File of helper functions

Simulations (simulations/aligned_t0)

Folder of scripts for a setting where time zero ($t_0$) is aligned for all subjects so we only need to consider eligibility, missingness, etc. at a single time per subject, and matching is not needed (insofar as it is a mechanism for establishing time zero). Contains an implementation of $\widehat\theta_\text{EIF}, \widehat\theta_\text{IF}$ and $\widetilde\theta_\text{IF}$, $\widehat\theta_\text{CC}$ and $\widehat\theta_\text{IWOR}$

  • estimators.R: Implementation of estimators
  • generate_data.R: Functions to generate simulated data
  • inform_sims.R: Exploratory analysis to guide range of models for consideration in simulated datasets
  • run_simulation.R: Main simulation wrapper.
  • pnp.R: $\mu_0$ calibration between parametric and non-parametric models (Generates Figure S1)
  • specify_inputs.R: Script which specifies simulation parameters and estimators for consideration
  • latex_tables.R: Generates all tables for describing simulation results and parameters.

Data (data/)

Folder of scripts used to clean and process EHR data for use in data application

  • rygb_vsg_data_prep.R: Prep analysis dataset(s) for data application of the aligned time zero case (RYGB vs. VSG).
  • surgical_px_cleaning.R Clean some chart review for surgical procedure types

Analysis (analysis/aligned_t0)

Folder of scripts used for data application analysis

  • diabetes_figure.R: Plot of diabetes figure showing frequency of certain measurements for select surgical patients (Generates Figure 1)
  • elig_figures.R: Plot eligibility distributions (Generates Figure 2 and S2)
  • plot_nuisances.R: Plotting code for distributions of nuisance functions (Generates Figures 3, S3, and S4)
  • plot_results.R: Plot point estimates and 95% confidence intervals (Generates Figure 4)

Weight Change Analysis

  • fit_CC_outcome_regression_estimator.R: Naive ATT analysis ( $\hat\theta_\text{CC}$) for weight change outcome
  • fit_iwor_estimator.R: IWOR ATT analysis (with $\hat\theta_\text{IWOR}$) for weight change outcome
  • fit_IF_estimator.R: IF ATT analysis (with $\hat\theta_\text{IF}$) for weight change outcome
  • fit_EIF_estimator.R: EIF ATT analysis (with $\hat\theta_\text{EIF}$) for weight change outcome

T2DM Remission Analysis

  • fit_CC_outcome_remission: Naive ATT analysis ( $\hat\theta_\text{CC}$) for diabetes remission outcome
  • fit_iwor_estimato_remission.R: IWOR ATT analysis (with $\hat\theta_\text{IWOR}$) for diabetes remission outcome
  • fit_IF_estimator_remission.R: IF ATT analysis (with $\hat\theta_\text{IF}$) for diabetes remission outcome
  • fit_EIF_estimator_remission.R: EIF ATT analysis (with $\hat\theta_\text{EIF}$) for diabetes remission outcome

Data (data/)

  • Simulation inputs + results
  • Data application results

Figures (figures/)

Figures saved out from various analyses

Figures (tables/)

Tables saved out from various analyses

Jobs (jobs/)

.sh files for batch jobs on the cluster

  • aligned_t0_sims_loop.sh: SBATCH job file for running simulations for comnination of estimator/simulation parameters
  • run_aligned_t0_loop.sh: Wrapper for fully 2-D job array for aligned_t0_sims_loop.sh .
  • data_application.sh: Wrapper for submitting all the jobs for the data application, in the application/ sub-directory.

About

Code/analysis/simulations for robust and efficient causal inference from electronic health record based (observational) cohort studies with missing study eligibility criteria.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published