QColors: An algorithm for conservative viral quasispecies reconstruction from short and non-contiguous next generation sequencing reads

Huang, Austin; Kantor, Rami; DeLong, Allison; Schreier, Leeann; Istrail, Sorin

doi:10.3233/ISB-2012-0454

QColors: An algorithm for conservative viral quasispecies reconstruction from short and non-contiguous next generation sequencing reads

Issue title: Workshop on Computational Advances in Molecular Epidemiology, Atlanta, GA, USA, November 12, 2011

Article type: Research Article

Authors: Huang, Austin^{; ;} | Kantor, Rami^; | DeLong, Allison | Schreier, Leeann | Istrail, Sorin^;

Affiliations: Division of Infectious Disease, Brown University, Providence, RI, USA | Center for Computational Molecular Biology, Brown University, Providence, RI, USA | Center for Statistical Sciences, Brown University, Providence, RI, USA | Department of Computer Science, Brown University, Providence, RI, USA

Note: [] Corresponding author: Austin Huang, Division of Infectious Disease, Computer Science Department, Brown University, Box 1910, Providence, RI, 02912, USA. Tel.: +1 401 863 7719; Fax: +1 401 863 7657; E-mail: [email protected].

Abstract: Next generation sequencing technologies have recently been applied to characterize mutational spectra of the heterogeneous population of viral genotypes (known as a quasispecies) within HIV-infected patients. Such information is clinically relevant because minority genetic subpopulations of HIV within patients enable viral escape from selection pressures such as the immune response and antiretroviral therapy. However, methods for quasispecies sequence reconstruction from next generation sequencing reads are not yet widely used and remains an emerging area of research. Furthermore, the majority of research methodology in HIV has focused on 454 sequencing, while many next-generation sequencing platforms used in practice are limited to shorter read lengths relative to 454 sequencing. Little work has been done in determining how best to address the read length limitations of other platforms. The approach described here incorporates graph representations of both read differences and read overlap to conservatively determine the regions of the sequence with sufficient variability to separate quasispecies sequences. Within these tractable regions of quasispecies inference, we use constraint programming to solve for an optimal quasispecies subsequence determination via vertex coloring of the conflict graph, a representation which also lends itself to data with non-contiguous reads such as paired-end sequencing. We demonstrate the utility of the method by applying it to simulations based on actual intra-patient clonal HIV-1 sequencing data.

DOI: 10.3233/ISB-2012-0454

Journal: In Silico Biology, vol. 11, no. 5-6, pp. 193-201, 2012

Received 16 January 2012

Revision received 19 June 2012

Accepted 6 July 2012

Published: 2012

Price: EUR 27.50

North America

IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA

Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]

For editorial issues, like the status of your submitted paper or proposals, write to [email protected]

Europe

IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands

Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]

For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]

Asia

Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China

Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]

For editorial issues, like the status of your submitted paper or proposals, write to [email protected]

如果您在出版方面需要帮助或有任何建, 件至: [email protected]

Share this:

North America

Europe

Asia