TY - JOUR
T1 - A fast Monte Carlo expectation-maximization algorithm for estimation in latent class model analysis with an application to assess diagnostic accuracy for cervical neoplasia in women with atypical glandular cells
AU - Kang, Le
AU - Carter, Randy
AU - Darcy, Kathleen
AU - Kauderer, James
AU - Liao, Shu Yuan
N1 - Funding Information:
This study was supported by National Cancer Institute grants to the Gynecologic Oncology Group (GOG) Administrative Office and the GOG Tissue Bank (CA 27469), and the GOG Statistical and Data Center (CA 37517). Le Kang was supported in part by an appointment to the Research Participation Program at the Center for Devices and Radiological Health administered by the Oak Ridge Institute for Science and Education through an interagency agreement between the US Department of Energy and the US Food and Drug Administration. The authors would like to thank the anonymous referees for helpful discussions and comments that greatly improved the article. A supplementary R code to implement the fast MCEM algorithm described in this article and the real data example with patient ID information removed are available upon request.
PY - 2013/12
Y1 - 2013/12
N2 - In this article, we use a latent class model (LCM) with prevalence modeled as a function of covariates to assess diagnostic test accuracy in situations where the true disease status is not observed, but observations on three or more conditionally independent diagnostic tests are available. A fast Monte Carlo expectation-maximization (MCEM) algorithm with binary (disease) diagnostic data is implemented to estimate parameters of interest; namely, sensitivity, specificity, and prevalence of the disease as a function of covariates. To obtain standard errors for confidence interval construction of estimated parameters, the missing information principle is applied to adjust information matrix estimates. We compare the adjusted information matrix-based standard error estimates with the bootstrap standard error estimates both obtained using the fast MCEM algorithm through an extensive Monte Carlo study. Simulation demonstrates that the adjusted information matrix approach estimates the standard error similarly with the bootstrap methods under certain scenarios. The bootstrap percentile intervals have satisfactory coverage probabilities. We then apply the LCM analysis to a real data set of 122 subjects from a Gynecologic Oncology Group study of significant cervical lesion diagnosis in women with atypical glandular cells of undetermined significance to compare the diagnostic accuracy of a histology-based evaluation, a carbonic anhydrase-IX biomarker-based test and a human papillomavirus DNA test.
AB - In this article, we use a latent class model (LCM) with prevalence modeled as a function of covariates to assess diagnostic test accuracy in situations where the true disease status is not observed, but observations on three or more conditionally independent diagnostic tests are available. A fast Monte Carlo expectation-maximization (MCEM) algorithm with binary (disease) diagnostic data is implemented to estimate parameters of interest; namely, sensitivity, specificity, and prevalence of the disease as a function of covariates. To obtain standard errors for confidence interval construction of estimated parameters, the missing information principle is applied to adjust information matrix estimates. We compare the adjusted information matrix-based standard error estimates with the bootstrap standard error estimates both obtained using the fast MCEM algorithm through an extensive Monte Carlo study. Simulation demonstrates that the adjusted information matrix approach estimates the standard error similarly with the bootstrap methods under certain scenarios. The bootstrap percentile intervals have satisfactory coverage probabilities. We then apply the LCM analysis to a real data set of 122 subjects from a Gynecologic Oncology Group study of significant cervical lesion diagnosis in women with atypical glandular cells of undetermined significance to compare the diagnostic accuracy of a histology-based evaluation, a carbonic anhydrase-IX biomarker-based test and a human papillomavirus DNA test.
KW - MCEM estimation
KW - adjusted information matrix
KW - bootstrap standard errors
KW - diagnostic accuracy
KW - imperfect gold standard
KW - latent class model
UR - http://www.scopus.com/inward/record.url?scp=84885374698&partnerID=8YFLogxK
U2 - 10.1080/02664763.2013.825704
DO - 10.1080/02664763.2013.825704
M3 - Article
AN - SCOPUS:84885374698
SN - 0266-4763
VL - 40
SP - 2699
EP - 2719
JO - Journal of Applied Statistics
JF - Journal of Applied Statistics
IS - 12
ER -