TY - JOUR
T1 - Factors influencing estimates of HIV-1 infection timing using BEAST
AU - the RV217 Study Team
AU - Dearlove, Bethany
AU - Tovanabutra, Sodsai
AU - Owen, Christopher L.
AU - Lewitus, Eric
AU - LiID, Yifan
AU - Sanders-Buell, Eric
AU - Bose, Meera
AU - O'Sullivan, Anne Marie
AU - Kijak, Gustavo
AU - Miller, Shana
AU - Poltavee, Kultida
AU - Lee, Jenica
AU - Bonar, Lydia
AU - Harbolick, Elizabeth
AU - Ahani, Bahar
AU - Pham, Phuc
AU - Kibuuka, Hannah
AU - Maganga, Lucas
AU - Nitayaphan, Sorachai
AU - Sawe, Fred K.
AU - Kim, Jerome H.
AU - Eller, Leigh Anne
AU - Vasan, Sandhya
AU - Gramzinski, Robert
AU - Michael, Nelson L.
AU - Robb, Merlin L.
AU - Rolland, Morgane
AU - Ananworanich, Jintanat
AU - Peel, Sheila
AU - Jagodozinski, Linda
AU - Malia, Jennifer
AU - Manak, Mark
AU - Milazzo, Mark
AU - Li, Qun
AU - Schech, Steve
AU - Spitz, Julie Dorsey
AU - Dawson, Peter
AU - Sekiziyivu, Prossy
AU - Kiweewa, Francis
AU - Millard, Monica
AU - Shaffer, Doug N.
AU - Kosgei, Josphat
AU - Oundo, Joseph
AU - Ntinginya, Nyanda
AU - Lueer, Cornelia
AU - Kisinda, Abisai
AU - Kroidl, Inge
AU - Hoelscher, Michael
AU - Kroidl, Arne
AU - O'Connell, Robert J.
N1 - Publisher Copyright:
© 2021 Public Library of Science. All rights reserved.
PY - 2021/2/1
Y1 - 2021/2/1
N2 - While large datasets of HIV-1 sequences are increasingly being generated, many studies rely on a single gene or fragment of the genome and few comparative studies across genes have been done. We performed genome-based and gene-specific Bayesian phylogenetic analyses to investigate how certain factors impact estimates of the infection dates in an acute HIV-1 infection cohort, RV217. In this cohort, HIV-1 diagnosis corresponded to the first RNA positive test and occurred a median of four days after the last negative test, allowing us to compare timing estimates using BEAST to a narrow window of infection. We analyzed HIV-1 sequences sampled one week, one month and six months after HIV-1 diagnosis in 39 individuals. We found that shared diversity and temporal signal was limited in acute infection, and insufficient to allow timing inferences in the shortest HIV-1 genes, thus dated phylogenies were primarily analyzed for env, gag, pol and near full-length genomes. There was no one best-fitting model across participants and genes, though relaxed molecular clocks (73% of best-fitting models) and the Bayesian skyline (49%) tended to be favored. For infections with single founders, the infection date was estimated to be around one week pre-diagnosis for env (IQR: 3-9 days) and gag (IQR: 5-9 days), whilst the genome placed it at a median of 10 days (IQR: 4-19). Multiply-founded infections proved problematic to date. Our ability to compare timing inferences to precise estimates of HIV-1 infection (within a week) highlights that molecular dating methods can be applied to withinhost datasets from early infection. Nonetheless, our results also suggest caution when using uniform clock and population models or short genes with limited information content.
AB - While large datasets of HIV-1 sequences are increasingly being generated, many studies rely on a single gene or fragment of the genome and few comparative studies across genes have been done. We performed genome-based and gene-specific Bayesian phylogenetic analyses to investigate how certain factors impact estimates of the infection dates in an acute HIV-1 infection cohort, RV217. In this cohort, HIV-1 diagnosis corresponded to the first RNA positive test and occurred a median of four days after the last negative test, allowing us to compare timing estimates using BEAST to a narrow window of infection. We analyzed HIV-1 sequences sampled one week, one month and six months after HIV-1 diagnosis in 39 individuals. We found that shared diversity and temporal signal was limited in acute infection, and insufficient to allow timing inferences in the shortest HIV-1 genes, thus dated phylogenies were primarily analyzed for env, gag, pol and near full-length genomes. There was no one best-fitting model across participants and genes, though relaxed molecular clocks (73% of best-fitting models) and the Bayesian skyline (49%) tended to be favored. For infections with single founders, the infection date was estimated to be around one week pre-diagnosis for env (IQR: 3-9 days) and gag (IQR: 5-9 days), whilst the genome placed it at a median of 10 days (IQR: 4-19). Multiply-founded infections proved problematic to date. Our ability to compare timing inferences to precise estimates of HIV-1 infection (within a week) highlights that molecular dating methods can be applied to withinhost datasets from early infection. Nonetheless, our results also suggest caution when using uniform clock and population models or short genes with limited information content.
UR - http://www.scopus.com/inward/record.url?scp=85101390524&partnerID=8YFLogxK
U2 - 10.1371/JOURNAL.PCBI.1008537
DO - 10.1371/JOURNAL.PCBI.1008537
M3 - Article
C2 - 33524022
AN - SCOPUS:85101390524
SN - 1553-734X
VL - 17
JO - PLoS Computational Biology
JF - PLoS Computational Biology
IS - 2
M1 - e1008537
ER -