CIMEHR - Gaussian Clinically Informative Visiting and Observation
Processes in Electronic Health Record (EHR) Data
Fits semiparametric joint models for longitudinal
electronic health record (EHR) data that addresses two-stage
hierarchical missingness mechanism. The first stage is the
visiting process, and the second stage is the observation
process. The core CIMEHR method (Clinical Informative
Missingness for Electronic Health Records) uses a three-stage
procedure: partial likelihood with log-normal frailty for visit
intensity, probit regression with shared latent factor-linked
random effects for observation, and weighted least squares with
risk-set centering for the outcome. These three stages are
connected through a shared latent factor that induces
dependence across all three processes. A data simulator and
implementations of common benchmark methods (linear mixed
models, multiple imputation, and others) are included for
comparative studies. Detailed methods are described in Yang,
Shi, and Mukherjee (2026) <doi:10.48550/arXiv.2602.15374>.