Re: Homologous Recombination is Very Rare or Absent in Human Influenza A Virus
170 insufficient signal to infer phylogenetic incongruence. Since support for phylogenetic
195 Thus, laboratory error, such as template switching during amplification in a mixed or
221 influenza A virus evolution, the basis of our more frequent observation of mosaicism in A/H3N2
232 anonymous reviewers for helpful suggestions
147
RESULTS AND DISCUSSION
148
Two of the ten human influenza A/H3N2 virus data sets (PB2 and NP) analyzed here
149
contained sequences with statistically significant mosaic structure, as determined by 3SEQ, and
150
with putative recombinant sections that were each sufficiently long (>100 nt) that they could be
151
re-analyzed by phylogenetic recombination detection methods. Three of the remaining eight
152
A/H3N2 data sets (PA, NA, MP) and one of the A/H1N1 data sets (NA) also resulted in 3SEQ p153
values that revealed a strong signal of mosaicism, but in all these cases the inferred breakpoints
154
were either close to the gene segment?s endpoints, or very close to each other, making it
155
impossible to infer a credible phylogeny. The remaining five A/H3N2 data sets (PB1, HA,
156
HA413, NS, NA413 ? the 413-suffix meaning that it is the HA or NA data set containing 413
157
sequences) and seven of the A/H1N1 data sets (PB2, PB1, PA, HA, NP, MP, NS) did not
158
contain any statistically significant mosaic signals that survived a Dunn-?id?k correction in
159
3SEQ. Recombination analysis results are summarized in Table 1 for A/H3N2 and Table 2 for
160
A/H1N1. The two putative recombinant data sets are discussed in more detail below.
161
The H3N2 PB2 data set assembled here contained 912 distinct sequences, one of which
162
? A/New York/11/2003 ? statistically supported a mosaic structure with both mosaic regions
163
longer than 100nt. The two most likely parental sequences, identified as A/Hong Kong/14/1974
164
(major parent) and A/New York/424/1999 (minor parent), revealed a strong mosaic signal
165
(corrected p = 0.013) in relation to A/New York/11/2003. However, while the phylogenies
166
inferred for the minor (positions 202?2189, Figure 1a) and major (positions 1?201 and 2190?
167
2347, Figure 1b) segments revealed topological movement of the putative recombinant
168
sequence relative to the parental sequences, a general lack of phylogenetic resolution, reflected
169
in low levels of bootstrap support (particularly in the major segment), meant that there was
170 insufficient signal to infer phylogenetic incongruence. Since support for phylogenetic
incongruence is necessarily made up of two components ? the phylogenetic
171 relationship among
172
the parents and recombinant on the major and minor trees ? we call the signal ?weak? if one of
173
the components receives only low bootstrap support.
174
For the NP data set of A/H3N2 viruses, a single sequence ? A/Christchurch/14/2004 ?
175
supported a mosaic structure with both candidate recombinant regions longer than 100 nt. The
176
candidate parental sequences identified by 3SEQ were A/Beijing/1/1968 as the major parent
177
and A/New York/153/1999 as the minor parent (clonality among these three isolates is rejected
178
at corrected p = 0.032). The ML tree for the region 98-1454 is presented in Figure 2a while that
179
for regions 1-97 and 1455-1570 is shown in Figure 2b. In these phylogenies, the putative
180
recombinant sequence was clearly more closely related to a different parent in each sequence
181
region. The phylogenies also revealed sequence A/New York/381/2004 as a better candidate
182
for the minor parent than A/New York/153/1999; the mosaic signal when assuming A/New
183
York/381/2004 as the minor parent in the recombination event was still strong (corrected p =
184
0.052). However, as in the PB2 data set, the lack of bootstrap support in the phylogeny inferred
185
for the major segment indicates that there is in reality an insufficiently strong signal for
186
phylogenetic incongruence to conclude that homologous recombination has occurred.
187
For the two candidate recombinants A/New York/11/2003 (PB2) and
188
A/Christchurch/14/2004 (NP), it is also puzzling that the parental sequences were sampled 25
189
and 31 years apart, respectively. Hence, for one of these recombination events to have
190
occurred, a lineage of viruses closely related to an ?archaic? virus (either A/Hong Kong/14/1974
191
or A/Beijing/1/1968) must have circulated until at least 1999 and recombined with A/New
192
York/424/1999 or A/New York/153/1999. Given the rapid rate of influenza A virus mutation
193
through frequent RNA polymerase error, as well as the rapid lineage turnover driven by positive
194
selection on the major antigenic proteins (6, 11, 12, 23), this scenario seems extremely unlikely.
195 Thus, laboratory error, such as template switching during amplification in a mixed or
contaminated sample, is a likely explanation of these apparent homologous
196 recombination
197
events.
198
In sum, our study has revealed that no sequence of human influenza A virus contains a
199
clear signature of phylogenetic incongruence indicative of the action of homologous RNA
200
recombination. Given that more than 10,000 distinct sequences were analyzed, this constitutes
201
strong evidence that homologous recombination plays only a very minor role, if any, in the
202
evolution of human influenza A virus. More generally, the occurrence of phylogenetic
203
incongruence does not in itself constitute conclusive evidence for this process. Specifically,
204
because our analysis is necessarily based on viral consensus sequences rather than the myriad
205
individual viral molecules that characterize any infection, it is equally plausible that the
206
?recombinants? detected here in fact represent cases of mixed infection in individual hosts
207
followed by the amplification and sequencing of different viral molecules, thereby producing
208
laboratory-generated artificial recombinants. Hence, to demonstrate conclusively the
209
occurrence of homologous recombination in influenza A virus it will be necessary either to clone
210
(or plaque purify) and sequence multiple viral genomes from an individual host and demonstrate
211
the presence of the recombinant and both parental genotypes within the sample (1), or to show
212
that recombinant sequences form a distinct circulating lineage, with readily identifiable parents,
213
that is transmitted among multiple individuals in a population (30).
214
Finally, although there were 315 sequences in the data analyzed here that carried a
215
strong mosaic signal as identified by 3SEQ, it was impossible to verify the vast majority of these
216
as recombinants since the putative recombinant regions were too short to infer a credible
217
phylogenetic history. It is therefore possible that homologous recombination, should it occur in
218
influenza A virus, more commonly involves the transfer of very short sections of RNA, a process
219
that would be undetectable by the majority of other methods devised to detect recombination. If
220
homologous recombination of short segments is determined to be a relevant process in
221 influenza A virus evolution, the basis of our more frequent observation of mosaicism in A/H3N2
viruses compared to A/H1N1 viruses will need to be investigated further. However,
222 by far the
223
strongest signal in the influenza A virus sequence data analyzed here is that of strict clonality,
224
supporting most models of influenza virus evolution proposed to date.
225
226
226
ACKNOWLEDGEMENTS
227
The research undertaken in this study was funded in part by Resources for the Future (MFB),
228
NIH/NIGMS grant P50GM071508 (MFB), National Institutes of Health Grant GM28016 (MFB),
229
NIH grant number GM080533-01 (ECH), and the Intramural Research Program of the NIH, and
230
the NIAID (JKT). We thank John Zollweg and Linda Woodard at the Cornell University Center
231
for Advanced Computing for suggesting algorithmic improvements to 3SEQ, as well as two
232 anonymous reviewers for helpful suggestions
Comment