TY - JOUR
T1 - Combining markers with and without the limit of detection
AU - Dong, Ting
AU - Liu, Catherine Chunling
AU - Petricoin, Emanuel F.
AU - Tang, Liansheng Larry
PY - 2014/4/15
Y1 - 2014/4/15
N2 - In this paper, we consider the combination of markers with and without the limit of detection (LOD). LOD is often encountered when measuring proteomic markers. Because of the limited detecting ability of an equipment or instrument, it is difficult to measure markers at a relatively low level. Suppose that after some monotonic transformation, the marker values approximately follow multivariate normal distributions. We propose to estimate distribution parameters while taking the LOD into account, and then combine markers using the results from the linear discriminant analysis. Our simulation results show that the ROC curve parameter estimates generated from the proposed method are much closer to the truth than simply using the linear discriminant analysis to combine markers without considering the LOD. In addition, we propose a procedure to select and combine a subset of markers when many candidate markers are available. The procedure based on the correlation among markers is different from a common understanding that a subset of the most accurate markers should be selected for the combination. The simulation studies show that the accuracy of a combined marker can be largely impacted by the correlation of marker measurements. Our methods are applied to a protein pathway dataset to combine proteomic biomarkers to distinguish cancer patients from non-cancer patients.
AB - In this paper, we consider the combination of markers with and without the limit of detection (LOD). LOD is often encountered when measuring proteomic markers. Because of the limited detecting ability of an equipment or instrument, it is difficult to measure markers at a relatively low level. Suppose that after some monotonic transformation, the marker values approximately follow multivariate normal distributions. We propose to estimate distribution parameters while taking the LOD into account, and then combine markers using the results from the linear discriminant analysis. Our simulation results show that the ROC curve parameter estimates generated from the proposed method are much closer to the truth than simply using the linear discriminant analysis to combine markers without considering the LOD. In addition, we propose a procedure to select and combine a subset of markers when many candidate markers are available. The procedure based on the correlation among markers is different from a common understanding that a subset of the most accurate markers should be selected for the combination. The simulation studies show that the accuracy of a combined marker can be largely impacted by the correlation of marker measurements. Our methods are applied to a protein pathway dataset to combine proteomic biomarkers to distinguish cancer patients from non-cancer patients.
KW - Diagnostic accuracy
KW - Limit of detection
KW - Linear discriminant analysis
KW - ROC curve
UR - http://www.scopus.com/inward/record.url?scp=84896707416&partnerID=8YFLogxK
U2 - 10.1002/sim.6027
DO - 10.1002/sim.6027
M3 - Article
C2 - 24132938
AN - SCOPUS:84896707416
SN - 0277-6715
VL - 33
SP - 1307
EP - 1320
JO - Statistics in Medicine
JF - Statistics in Medicine
IS - 8
ER -