TY - JOUR
T1 - Combining viral genetics and statistical modeling to improve HIV-1 time-of-infection estimation towards enhanced vaccine efficacy assessment
AU - Rossenkhan, Raabya
AU - Rolland, Morgane
AU - Labuschagne, Jan P.L.
AU - Ferreira, Roux Cil
AU - Magaret, Craig A.
AU - Carpp, Lindsay N.
AU - Matsen, Frederick A.
AU - Huang, Yunda
AU - Rudnicki, Erika E.
AU - Zhang, Yuanyuan
AU - Ndabambi, Nonkululeko
AU - Logan, Murray
AU - Holzman, Ted
AU - Abrahams, Melissa Rose
AU - Anthony, Colin
AU - Tovanabutra, Sodsai
AU - Warth, Christopher
AU - Botha, Gordon
AU - Matten, David
AU - Nitayaphan, Sorachai
AU - Kibuuka, Hannah
AU - Sawe, Fred K.
AU - Chopera, Denis
AU - Eller, Leigh Anne
AU - Travers, Simon
AU - Robb, Merlin L.
AU - Williamson, Carolyn
AU - Gilbert, Peter B.
AU - Edlefsen, Paul T.
N1 - Publisher Copyright:
© 2019 by the authors. Licensee MDPI, Basel, Switzerland.
PY - 2019/7/1
Y1 - 2019/7/1
N2 - Knowledge of the time of HIV-1 infection and the multiplicity of viruses that establish HIV-1 infection is crucial for the in-depth analysis of clinical prevention efficacy trial outcomes. Better estimation methods would improve the ability to characterize immunological and genetic sequence correlates of efficacy within preventive efficacy trials of HIV-1 vaccines and monoclonal antibodies. We developed new methods for infection timing and multiplicity estimation using maximum likelihood estimators that shift and scale (calibrate) estimates by fitting true infection times and founder virus multiplicities to a linear regression model with independent variables defined by data on HIV-1 sequences, viral load, diagnostics, and sequence alignment statistics. Using Poisson models of measured mutation counts and phylogenetic trees, we analyzed longitudinal HIV-1 sequence data together with diagnostic and viral load data from the RV217 and CAPRISA 002 acute HIV-1 infection cohort studies. We used leave-one-out cross validation to evaluate the prediction error of these calibrated estimators versus that of existing estimators and found that both infection time and founder multiplicity can be estimated with improved accuracy and precision by calibration. Calibration considerably improved all estimators of time since HIV-1 infection, in terms of reducing bias to near zero and reducing root mean squared error (RMSE) to 5–10 days for sequences collected 1–2 months after infection. The calibration of multiplicity assessments yielded strong improvements with accurate predictions (ROC-AUC above 0.85) in all cases. These results have not yet been validated on external data, and the best-fitting models are likely to be less robust than simpler models to variation in sequencing conditions. For all evaluated models, these results demonstrate the value of calibration for improved estimation of founder multiplicity and of time since HIV-1 infection.
AB - Knowledge of the time of HIV-1 infection and the multiplicity of viruses that establish HIV-1 infection is crucial for the in-depth analysis of clinical prevention efficacy trial outcomes. Better estimation methods would improve the ability to characterize immunological and genetic sequence correlates of efficacy within preventive efficacy trials of HIV-1 vaccines and monoclonal antibodies. We developed new methods for infection timing and multiplicity estimation using maximum likelihood estimators that shift and scale (calibrate) estimates by fitting true infection times and founder virus multiplicities to a linear regression model with independent variables defined by data on HIV-1 sequences, viral load, diagnostics, and sequence alignment statistics. Using Poisson models of measured mutation counts and phylogenetic trees, we analyzed longitudinal HIV-1 sequence data together with diagnostic and viral load data from the RV217 and CAPRISA 002 acute HIV-1 infection cohort studies. We used leave-one-out cross validation to evaluate the prediction error of these calibrated estimators versus that of existing estimators and found that both infection time and founder multiplicity can be estimated with improved accuracy and precision by calibration. Calibration considerably improved all estimators of time since HIV-1 infection, in terms of reducing bias to near zero and reducing root mean squared error (RMSE) to 5–10 days for sequences collected 1–2 months after infection. The calibration of multiplicity assessments yielded strong improvements with accurate predictions (ROC-AUC above 0.85) in all cases. These results have not yet been validated on external data, and the best-fitting models are likely to be less robust than simpler models to variation in sequencing conditions. For all evaluated models, these results demonstrate the value of calibration for improved estimation of founder multiplicity and of time since HIV-1 infection.
KW - Acute and early HIV-1 infection
KW - Founder multiplicity
KW - HIV-1
KW - HIV-1 primary infection
KW - Infection time
KW - Leave-one-out-cross-validation (LOOCV)
KW - Sequence analysis
KW - Vaccine efficacy assessment
UR - http://www.scopus.com/inward/record.url?scp=85068656662&partnerID=8YFLogxK
U2 - 10.3390/v11070607
DO - 10.3390/v11070607
M3 - Article
C2 - 31277299
AN - SCOPUS:85068656662
SN - 1999-4915
VL - 11
JO - Viruses
JF - Viruses
IS - 7
M1 - 607
ER -