Abstract
Free full text
An appraisal of gene targets for phylogenetic classification of canine distemper virus: Is the hemagglutinin the best candidate?
Abstract
Sequence analysis of the canine distemper virus (CDV) hemagglutinin (H) gene may provide important insights on virus-host interactions and has also been frequently used for CDV phylogenetic classification. Herein, we performed an in silico analysis of CDV complete genomes (CGs) available in GenBank in order to investigate the suitability of H for CDV classification into lineages/genotypes. In addition, we analyzed the other viral genes for their potential use in CDV classification. Initially, we collected 116 CDV CGs from GenBank and compared their phylogenetic classification with that of their respective H nucleotide (nt) and amino acid (aa) sequences. Subsequently, we calculated the geodesic distance between the CG and H phylogenetic trees. These analyses were later performed with other CDV genes. All CDV CGs were also evaluated for possible recombination events. Nucleotide and aa analyses of H misclassified some Vaccine/America 1/Asia 3 lineage sequences compared to CG analysis, finding supported by both Maximum Likelihood (ML) and Bayesian Markov Chain Monte Carlo (B-MCMC) methods. Moreover, aa-based H analysis showed additional disagreements with the classification obtained by CG. The geodesic distance between the H and CG trees was 0.0680. Strong recombination signals were identified in the H gene, including Vaccine/America 1/Asia 3 lineage sequences. In contrast, C and P were the only genes that fully reproduced the CG classification (by ML and/or B-MCMC) and that did not show strong recombination signals. Furthermore, the P phylogenetic tree showed the lowest geodesic distance from the CG tree (0.0369). These findings suggest C and P as potential targets for CDV phylogenetic classification, especially when full genome sequencing is not possible. Finally, since our results were obtained considering the CDV CGs available to date, future analyses performed as more CDV sequences become available will be useful to assess probable issues of H-based phylogeny and to consolidate the suitability of the C and P genes for CDV classification.
1. Introduction
Canine distemper virus (CDV) (Canine morbillivirus) belongs to the genus Morbillivirus, family Paramyxoviridae and is a pathogen distributed worldwide. CDV infection is often associated with severe disease in domestic dogs, but it has also been identified in wild species, such as carnivore and marine mammals, and non-human primates (Martella et al., 2008; ICTV, 2019). CDV infections may compromise several organs and systems, leading to respiratory, dermatological, gastrointestinal and central nervous system (CNS) signs, the latter frequently resulting in neurological sequelae (Rendon-Marin et al., 2019).
The CDV genome is composed by a single-stranded, negative-sense RNA molecule of approximately 15,700 nucleotides (nt) in length. The viral genome contains six genes, which encode eight viral proteins (Sidhu et al., 1993; Nguyen et al., 2016). The nucleocapsid (N) and matrix (M) genes encode proteins with structural functions. The phosphoprotein (P) gene encodes the P protein, which together with the large (L) protein (L gene), is involved in transcription and replication of the viral genome. The P gene also encodes V and C proteins, which participate in pathogenesis, virulence and immune evasion mechanisms. The hemagglutinin (H) and fusion (F) genes encode the glycoproteins located in the viral envelope, with roles in receptor binding and membrane fusion, respectively (Lamb and Parks, 2013).
Glycoprotein H is the main target for neutralizing antibodies and, as such, undergoes strong selective pressure from the host immune response, displaying the highest sequence variability among the CDV genes (Harder and Osterhaus, 1997; Mochizuki et al., 1999). This variability has encouraged the use of H as a target for phylogenetic studies, which, in turn, may lead to classification of CDV into distinct lineages (or genotypes) (Nguyen et al., 2016; Jo et al., 2019; Piewbang et al., 2020). Hence, CDV isolates/strains sharing less than 95% amino acid (aa) identity in the H protein have been classified into distinct lineages (Mochizuki et al., 1999; Loots et al., 2018; Duque-Valencia et al., 2019a). Furthermore, it is suggested that sequences with at least 98% nt identity may be clustered into the same subgenotype (Budaszewski et al., 2014; Costa et al., 2021).
Several CDV lineages have been described based on phylogenetic analysis of the H gene, considering nt or aa sequences. In general, lineages are named according to their region of identification and/or distribution, e.g., America (or North America) 1, Asia 1, Europe 1 (or South America 1), Arctic Like and Caspian (Jo et al., 2019; Duque-Valencia et al., 2020; Piewbang et al., 2020; Costa et al., 2021). When more than one lineage is identified in the same region, they are distinguished by cardinal numbers, e.g., Asia 1 and Asia 2 (Lan et al., 2007). Furthermore, viruses of the same lineage identified in different regions are named after both regions, e.g., Europe 1/South America 1 (Duque-Valencia et al., 2020).
Considering the role of CDV H glycoprotein in viral tropism and interactions with the immune system, nt- and aa-based analyses of this gene/protein may be also useful for studies of antigenic variability, vaccine protection/failure, and for cross-species spillover discussions (Iwatsuki et al., 2000; Martella et al., 2008; Duque-Valencia et al., 2019b). However, regarding the use of H for CDV classification, it is important to highlight that the evolutionary history of a gene may be different from that of the whole genome (Gregory, 2008). Classifications of natural recombinant viruses, for instance, may be biased by the analysis of single genes (Piewbang et al., 2019).
Herein, we performed an in silico analysis of all CDV CGs available in the GenBank database in order to investigate the suitability of H gene/protein for classification of CDV in lineages/genotypes. In addition, we analyzed the other viral genes and discussed which would be the most suitable target for CDV classification.
2. Material and methods
2.1. Study design
To assess the suitability of the H gene for CDV phylogeny, we compared the classification of isolates/strains (GenBank) based on their CG versus H analyses; the later considering both nt and aa sequences. We also compared the topology and stem lengths of the CG vs. H phylogenetic trees by calculating the geodesic distance. These analyses were subsequently performed with other CDV genes to identify the most suitable target for CDV classification. In addition, all CGs available in GenBank were analyzed regarding possible recombination events, which could represent an obstacle towards a correct virus classification.
2.2. Data collection
All CDV CGs available in the GenBank database (National Center of Biotechnology Information, NCBI) were collected on March 31, 2022. The search terms were “Canine morbillivirus” and “Canine distemper virus”, and only sequences identified as complete genome were included in the study. Furthermore, only sequences identified as isolates or strains were analyzed, excluding sequences from viruses generated by recombinant technology, identified as “rescue” or “recombinant”.
2.3. Phylogenetic analyses
The CG sequences were aligned by the Multiple Alignment using Fast Fourier Transformation software (MAFFT, version 7.490) (Katoh et al., 2019), composing the main dataset of the study. After alignment, seven subsets were assembled, one for each CDV gene: N, P, C, M, F, H and L, which included both untranslated regions (UTRs) and coding sequences (CDS). The V gene was not included because it is edited during RNA transcription (Lamb and Parks, 2013), which would make comparisons between aa- and nt-based phylogenetic analyses difficult.
The best analysis model for CG and individual genes was defined by the jModelTest software (Posada, 2008). The phylogenetic analyses were performed using the Molecular Evolutionary Genetics Analysis software (version 10.2.5) (MEGA-X) (Kumar et al., 2018) and the evolutionary history was inferred by the Maximum Likelihood (ML) method, with 1000 bootstrap replicates. Analysis details are described in Table 1.
Table 1
Genomic region | Genomic position (nt)c | Substitution modeld | Gamma shape parameter | Proportion of invariant sites | BICf | Log likelihood | Geodesic distanceg |
---|---|---|---|---|---|---|---|
CGa | 1–15,690 | GTR+G+I | 0.4495 | 0.29 | 211,971.4595 | −105,023.10 | – |
Nb | 56–1738 | GTR+G+I | 0.4190 | 0.31 | 22,063.3249 | −10,178.53 | 0.0572 |
P | 1742–3396 | GTR+G | 0.4366 | NAe | 21,305.5814 | −9781.75 | 0.0369 |
C | 1823–2347 | GTR+G | 0.4270 | NA | 7496.8788 | −3006.31 | 0.0545 |
M | 3400–4846 | GTR+G+I | 0.6255 | 0.25 | 23,619.2061 | −10,972.45 | 0.0949 |
F | 4850–7055 | GTR+G+I | 0.6255 | 0.25 | 38,155.4624 | −18,270.21 | 0.0746 |
H | 7059–9004 | GTR+G+I | 0.6253 | 0.25 | 31,044.1102 | −14,667.81 | 0.0680 |
L | 9008–15,649 | GTR+G+I | 0.3733 | 0.32 | 78,548.1466 | −38,332.97 | 0.0425 |
To support our main findings, we performed an additional analysis by the Bayesian Markov Chain Monte Carlo (B-MCMC) method since this approach has also been used in CDV phylogenetic studies. Herein, CG and gene alignments were converted to NEXUS format and analyzed in the BEAUti/BEAST v.1.10.4 software package, according to the best nucleotide substitution model (as described above), using a strict molecular clock model. Algorithm was run over 10 million generations, with the first 10% being discarded as burn-in. Output results were submitted to the TreeAnotator v.1.10.4 software to generate a maximum clade credibility tree (MCC). At the end, the MCC tree was visualized in the FigTree v.1.4.4 software.
In addition to nt-based phylogenetic analysis, each gene was also analyzed for aa sequence. The best substitution model was defined by MEGA-X software (version 10.2.5) and phylogenetic analysis was performed using the ML method, with 1000 bootstrap replicates (MEGA-X). Analysis details are available in a supplementary file (File S1).
2.4. Lineages identification and phylogenetic comparisons
The clusters identified in CG tree were named according to lineages described in previous phylogenetic studies. We then compared CDV phylogenetic classifications based on CG vs. individual genes; the latter considering both nt and aa sequences. In addition, we also compared the topology and stem length of the CG vs. gene phylogenetic trees by calculating the geodesic distance, using TreeCmp software, version 2.0 (https://eti.pg.edu.pl/TreeCmp/WEB) (Goluch et al., 2020), considering the parameters and options: “Ref-to-all comparison” and “GeoUnrooted (Geodesic Unrooted distance)” (in “Unrooted metrics”), including “Zero weights allowed” and considering the average distance informed in the “Include summary” (both in “Other options”).
2.5. Recombination analysis
All CGs included in the study were evaluated for possible recombination events. The analysis was performed using the Recombination Detection Program (RDP) (version 4.101) (Martin et al., 2015), including the RDP, GENECONV, MaxChi, BootScan and SiScan methods, with default settings and p-value < 0.01.
3. Results
3.1. Phylogenetic classification of CDV complete genomes
Complete CDV genome sequences were initially collected from GenBank (n = 196) and, after the exclusion criteria, we obtained a dataset of 116 CGs. Some of these sequences have received different classifications in CDV phylogeny studies. Thus, to cover all available classifications and to facilitate the contextualization of our results, we assigned compound names to the lineage of these sequences: Africa/Africa 1/Africa 2 and Africa/South Africa/Africa 1.
Herein, CDV CGs were distributed into lineages Asia 1 (39 sequences), Europe 1/South America 1 (12 sequences), Asia 4/Thailand (7 sequences), Africa/Africa 1/Africa 2 (7 sequences), Africa/South Africa/Africa 1 (2 sequences), Arctic Like (4 sequences), Asia 2 (14 sequences), America 2 (6 sequences), Caspian (3 sequences), Vaccine/America 1/Asia 3 (19 sequences; 3 and 16 clustered in group a and b, respectively). Two sequences (MT448054 and KF640687) were grouped into a yet unidentified lineage, termed “Undetermined 1″; and an additional sequence (MH316137), which did not have a defined lineage, was named “Undetermined 2″ (Fig. 1A and File S2). “Undetermined” classifications were based on ML analysis and previous reports of CDV evolution (Jo et al., 2019; Peserico et al., 2019). Furthermore, MT905031, MW876862 and MK037459, which were not previously classified (MT905031 and MW876862) or defined as Asia 5 based on the H gene (MK037459), were classified as Africa/Africa 1/Africa 2 considering their distribution in the ML and B-MCMC analyses (Fig. 1A and File S2). Details (metadata) of the sequences analyzed in the study are available in File S3.
3.2. Lineage classification: complete genome versus individual genes
3.2.1. Nucleotide sequence
The phylogenetic classification of CDV based on the H gene showed disagreement with the CG analysis (both in ML and B-MCMC): in the CG phylogenetic tree, the America 1/Vaccine/Asia 3 sequences were distributed in two clusters (here termed groups a and b), whereas the H analysis grouped them in a single cluster (Fig. 1B and File S2). No disagreement was identified in the analysis of the C gene, i.e., it fully reproduced the CG classification by ML and B-MCMC analyses (Fig. 1C and File S2).
The phylogenetic trees of N, M, F, P and L genes showed at least one discrepancy from the CG by ML (File S4). The sequence MF926599 was identified as Asia 1 by the CG, but could not be classified by the N analysis. The sequences AY649446 and AY443350 (America 2 lineage by CG) were clustered with Vaccine/America 1/Asia 3 (group a) in the M phylogenetic analysis. In addition, MF926599 (identified as Asia 1 by CG) was classified as Vaccine/America 1/Asia 3 (group b) by the M gene. The sequence AY443350 (America 2 by CG) was classified as Vaccine/America 1/Asia 3 (group a) by the F analysis. In the L phylogenetic tree, the America 2 and Vaccine/America 1/Asia 3 (group a) sequences were grouped in the same cluster. The sequence MH316137, classified as Undetermined 2 by the CG, was grouped with the America 2 lineage in the P analysis. These disagreements were also observed when the CG vs. genes comparisons were performed by B-MCMC, except for the P gene, which reproduced integrally the CG classification obtained by Bayesian inference (File S2).
3.2.2. Amino acid sequence
Phylogenetic analysis based on the aa sequence of the H gene also showed some discrepancies with the CG classification. America 1/Vaccine/Asia 3 group a and b sequences were grouped in the same cluster in the H analysis (similar to that observed in the nt analysis). In addition, both America 2 and Undetermined 1 lineages were ungrouped after H phylogenetic analysis (Fig. 2A). The best models for aa analysis defined here are not available in the BEAUti/BEAST v.1.10.4 software package. However, the Bayesian analysis of H performed with the LG+G, the closest model to the LG+G+F (File S1), also showed the distribution of the America 1/Vaccine/Asia 3 groups a and b in the same cluster (data not shown).
For the P and C genes, the aa-based analysis fully reproduced the CG classification (Fig. 2B and and2C).2C). For the M gene, sequences AF164967 and KU666057 (both America 2 by CG) could not be classified in any lineage, and the America 1/Vaccine/Asia 3 (group b) and Caspian lineages were grouped into the same cluster. Regarding the F phylogenetic tree, the sequence AY443350 (America 2 by CG) could not be classified in any lineage, and KF640687 (Undetermined 1 by CG) was clustered with Asia 4/Thailand. Phylogenetic analyses from the aa sequences of the N and L genes showed the same disagreements described for nt. Phylogenetic analyses of the aa sequences of N, M, F and L are showed in File S5.
3.3. Geodesic distance measurement
The geodesic distance between the phylogenetic trees of CG and H was 0.0680. Regarding the P, L, C, N, F and M genes, the geodesic distances from CG were 0.0369, 0.0425, 0.0545, 0.0572, 0.0746 and 0.0949, respectively.
3.4. Recombination analysis
Eighteen CDV CGs were identified as possible recombinants, which were classified as Africa/Africa 1/Africa 2 (five sequences), Asia 1 (four sequences), America 1/Vaccine/Asia 3 group a (three sequences), America 2 (three sequences), Asia 2 (two sequences), Undetermined 1 lineage (one sequence). Regarding individual genes, 16 possible recombinant events were identified for F, 15 for H, 9 for L, 4 for M and 1 for N gene. No recombination events were predicted for P and C genes. Details of recombination findings are described in File S6.
4. Discussion
Hemagglutinin-based phylogenetic analyses may provide important insights on CDV evolution, potential vaccine failure, and virus jumping between host species (Iwatsuki et al., 2000; Duque-Valencia et al., 2019b). Interestingly, H is also the most frequently used gene for CDV phylogenetic classification (Martella et al., 2008; Nguyen et al., 2016; Duque-Valencia et al., 2019b; Piewbang et al., 2020). Herein, we first investigated whether the H gene/protein would be the best target for CDV phylogenetic classification. Subsequently, we explored the other CDV genes for their potential use in CDV classification.
Initially, we compared the phylogenetic classification of 116 CDV CGs available in GenBank with that obtained by analyzing their respective H. The aa and nt sequences of H were analyzed, as both have been used for CDV classification (Bolt et al., 1997; Mochizuki et al., 1999; Espinal et al., 2014; Nguyen et al., 2016). Interestingly, both H analyses (nt and aa) showed the same disagreement when compared to the CG-based phylogeny: the America 1/Vaccine/Asia 3 lineage was divided into groups a and b by the CG analysis, whereas it grouped in the same cluster in the H analyses. This finding was observed in the ML and B-MCMC analyses. Except for the H trees, groups a and b of the America 1/Vaccine/Asia 3 lineage were also separated in the trees from all other CDV genes based on nt or aa sequences, using ML or B-MCMC methods. Furthermore, the aa-based H analysis also showed additional disagreements with the CG, suggesting that the nt analysis may be the best choice when the H gene is used for CDV classification.
The difference between the H and CG trees (as well as the other genes) suggests that the clustering between groups a and b of the America 1/Vaccine/Asia 3 lineage in H is probably related to a recombination event involving this gene. This hypothesis is strongly supported by the recombination analysis of our dataset, in which the Snyder Hill vaccine strain (JN896987) (group b) was likely the major H donor for the three America 1/Vaccine/Asia 3 (group a) sequences (AY466011, AY542312 and AY445077) that clustered with the America 1/Vaccine/Asia 3 (group b). This result is also in line with that reported by Budaszewski et al. (2016), which suggests the Snyder Hill strain as a possible donor of the H gene for these sequences. Interestingly, the sequences AY466011, AY542312 and AY445077 are from viruses identified in free-living raccoons (Lednicky et al., 2004a; 2004b), whose ease circulation may contribute to co-infections, favoring recombination between different CDVs.
In addition, another 12 sequences showed probable recombination events in the H gene. Probable recombinations in CDV H have also been described in other studies (Han et al., 2008; Ke et al., 2015; Piewbang et al., 2019) and warn to the possibility that individual genes and whole organisms may have different evolutionary histories, which may lead to possible misunderstandings in single target-based classifications. Despite these findings, it should be emphasized that, although H seems not to be the most suitable target for phylogenetic classification, its analysis remains useful in studies of virus-host interactions and antigenic variability as well.
Due to the mismatch between H and CG-based classifications, we analyzed the other CDV genes to identify the one that would reproduce the CG classification. The F, L, M and N genes showed at least one CG discordance, both in the nt and aa analyses, observed in the ML or B-MCMC trees. Some CDV sequences also showed possible recombination events in these genes, especially in F - the gene with the highest number of probable recombinations described here.
Interestingly, the N and F genes have been previously used as targets in phylogenetic studies (Castilho et al., 2007; Sarute et al., 2013; Headley et al., 2015; Duque-Valencia et al., 2019a). The fusion protein signal-peptide (Fsp), an internal region of F gene, for example, was suggested as a target for phylogenetic studies due to its high variability, providing an easier differentiation between sequences (Sarute et al., 2013). This degree of variability, however, may be different from the CG, which may lead to discordant classifications when analyzing these two targets. Furthermore, the recombination evidence in the F gene described in our study, as well as those reported by other authors (Budaszewski et al., 2016), suggest careful use of this gene for phylogenetic classification purposes.
On the other hand, we observed that the classification obtained by the C gene analysis fully reproduced the CG results, both in the nt and aa sequences, in the ML or B-MCMC analysis. Furthermore, the P analysis (for aa in ML, or nt in B-MCMC) also reproduced the CG classification. Importantly, these were the only CDV genes that did not show strong recombination signals. Previous reports have described phylogenetic analyses based on P (Lednicky et al., 2004b; Lan et al., 2007), although some recombination events have been proposed for this genomic region (Budaszewski et al., 2016; Yuan et al., 2017). However, in our dataset, all recombination signals described for this gene were identified as possibly false positive recombination events by the RDP software. Overall, these findings indicate the C and P genes as suitable targets for CDV phylogenetic classification, especially when platforms for CG sequencing, considered the more reliable strategy for viral classification (Duque-Valencia et al., 2019b; Piewbang et al., 2019), are not available. Conversely, it is important to emphasize that C and P-based analyses may not allow for deep insights into CDV-host interactions, for which the H gene, as mentioned above, is probably the most suitable target.
In addition to lineage classification and recombination assessments, we calculated the geodesic distance between the phylogenetic tree of individual genes and that of the CG. This metric has been recently used to evaluate similarity between distinct phylogenetic trees considering both their topological differences and edge lengths (de Oliveira et al., 2021, 2022; Merchioratto et al., 2023). Here, the H gene was the fifth furthest from the CG (value 0.0680). In CDV, CG and H may have different nucleotide substitution rates (Piewbang et al., 2019). As the substitution rate is demonstrated by the edge length of the phylogenetic trees (ML), the difference between CG and H was likely reflected in the geodesic distance between these two targets. Interestingly, the geodesic distance between the P and CG trees found in our study was the smallest (0.0369).
Finally, considering the level of agreement between the phylogenetic analysis of the CDV CG with its respective genes, the geodesic distance between CG and genes, and the recombination findings, we observed that the H gene, especially its aa sequence, is not the most suitable target for CDV phylogenetic classification. Furthermore, considering the same criteria as above, we suggest the C and P genes are the best candidates for CDV classification. It is important to note that these results were obtained considering the CDV CGs available in GenBank to date. Therefore, future analyses – performed as more CDV CGs become available – are welcome to verify the limitations of H-based phylogenetic analyses and to consolidate the suitability of the C and P genes for CDV classification.
Funding
This work was supported by Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) and Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES) - Finance code 001.
Author statement
All authors have contributed to, seen and approved the final and submitted version. We have no conflicts of interest to disclose.
Data statement
The datasets generated and analyzed during the current study are available from the corresponding author on reasonable request.
Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Acknowledgments
ASB thanks to Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) for the scholarship (process 140837/2020-5). JVJSJr was supported by Financiadora de Estudos e Projetos (FINEP) DTI-A-1. EFF (process 301414/2010-6) and RW (process 305867/2018–0) were supported by CNPq research fellowships. Coordenação de Aperfeiçocamento de Pessoal de Nível Superior (CAPES) (Brazil), finance code 001, partially supported the research.
Footnotes
Supplementary material associated with this article can be found, in the online version, at 10.1016/j.virusres.2023.199043.
Appendix. Supplementary materials
File S1: Parameters used in the phylogenetic trees of the CDV amino acid sequences.
File S2: Bayesian phylogenetic tree from the complete CDV genome and genes. Bayesian analyses were performed using the BEAUti/BEAST v.1.10.4 software package. Output results were submitted to the TreeAnotator v.1.10.4 to generate a maximum clade credibility (MCC) tree, which was visualized in the FigTree v.1.4.4 software. Lineages with discordant classification from the CG are in bold and were identified with the lineage color given by the CG. Clusters formed by sequences from different lineages were identified with the name of the two lineages. Sequences that did not cluster in any lineage were identified with “?”.
File S3: Metadata of the sequences analyzed in the study.
File S4: Phylogenetic trees based on nucleotide sequence analysis of CDV genes. One hundred and sixteen CDV complete genomes (CGs) were collected from GenBank. Phylogenetic trees were constructed considering the untranslated (UTR) and coding (CDS) regions of each gene. Lineages with discordant classification from the CG are in bold and were identified with the lineage color given by the CG. Clusters formed by sequences from different lineages were identified with the name of the two lineages. Sequences that did not cluster in any lineage were identified with “?” Analyses were performed with the Maximum Likelihood method and bootstrap of 1000 replicates (Table 1), using the MEGA-X software (version 10.2.5).
File S5: Phylogenetic trees based on analysis of the amino acid sequences of CDV genes. One hundred and sixteen CDV complete genomes (CGs) were obtained from GenBank. Lineages with discordant classification from the CG (in bold) were identified with the lineage color given by the CG. Clusters formed by sequences from different lineages were termed with the name of the two lineages. Sequences that did not cluster in any lineage were identified with “?” Phylogenetic trees were constructed using MEGA-X software (version 10.2.5), Maximum Likelihood method (File S1) and bootstrap of 1000 replicates.
File S6: Data on CDV genomes with strong recombination signals. The 116 CDV complete genomes included in the study were analyzed for possible recombination events by the Recombination Detection Program (version 4.101), using default settings and including RDP, GENECONV, MaxChi, BootScan and SiScan methods, and p-value < 0.01. Data are presented according to the gene in which the recombination signal was identified. The major and minor donors of the genomes are also indicated. *Lineages defined by the CG analysis.
Data availability
Data will be made available on request.
References
- Bolt G., Jensen T.D., Gottschalck E., Arctander P., Appel M.J.G., Buckland R., Blixenkrone M. Genetic diversity of the attachment (H) protein gene of current field isolates of canine distemper virus. J. Gen. Virol. 1997;78:367–372. 10.1099/0022-1317-78-2-367. [Abstract] [CrossRef] [Google Scholar]
- Budaszewski R.F., Streck A.F., Weber M.N., Siqueira F.M., Guedes R.L.M., Canal C.W. Influence of vaccine strains on the evolution of canine distemper virus. Infect. Genet. Evol. 2016;41:262–269. 10.1016/j.meegid.2016.04.014. [Abstract] [CrossRef] [Google Scholar]
- Budaszewski R.F., Pinto L.D., Weber M.N., Caldart E.T., Alves C.D.B.T., Martella V., Ikuta N., Lunge V.R., Canal C.W. Genotyping of canine distemper virus strains circulating in Brazil from 2008 to 2012. Virus Res. 2014;180:76–83. 10.1016/j.virusres.2013.12.024. [Abstract] [CrossRef] [Google Scholar]
- Castilho J.G., Brandão P.E., Carnieli Jr, P., Oliveira R.N., Macedo C.I., Peixoto Z.M.P., Carrieri M.L., Kotait I. Molecular analysis of the N gene of canine distemper virus in dogs in Brazil. Arq. Bras. Med. Vet. Zootec. 2007;59:654–659. 10.1590/S0102-09352007000300016. [CrossRef] [Google Scholar]
- Costa V.G., Saivish M.V., Oliveira P.G., Silva-Júnior A., Moreli M.L., Krüger R.H. First complete genome sequence and molecular characterization of Canine morbillivirus isolated in Central Brazil. Sci. Rep. 2021;11:1–10. 10.1038/s41598-021-92183-2. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
- De Oliveira P.S.B., Silva Júnior J.V.J., Weiblen R., Flores E.F. Subtyping bovine viral diarrhea virus (BVDV): which viral gene to choose? Infect. Genet. Evol. 2021;92 10.1016/j.meegid.2021.104891. [Abstract] [CrossRef] [Google Scholar]
- De Oliveira P.S.B., Silva Júnior J.V.J., Weiblen R., Flores E.F. A new (old) bovine viral diarrhea virus 2 subtype: bVDV-2e. Arch. Virol. 2022 10.1007/s00705-022-05565-w. [Abstract] [CrossRef] [Google Scholar]
- Duque-Valencia J., Diaz F.J., Ruiz-Saenz J. Phylogenomic analysis of two co-circulating canine distemper virus lineages in Colombia. Pathogens. 2020;9:26. 10.3390/pathogens9010026. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
- Duque-Valencia J., Forero-Muñoz N.R., Díaz F.J., Martins E., Barato P., Ruiz-Saenz J. Phylogenetic evidence of the intercontinental circulation of a canine distemper virus lineage in the Americas. Sci. Rep. 2019;9:15747. 10.1038/s41598-019-52345-9. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
- Duque-Valencia J., Sarute N., Olarte-Castillo X.A., Ruíz-Sáenz J. Evolution and interspecies transmission of canine distemper virus – an outlook of the diverse evolutionary landscapes of a multi-host virus. Viruses. 2019;11:582. 10.3390/v11070582. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
- Espinal M.A., Díaz F.J., Ruiz-Saenz J. Phylogenetic evidence of a new canine distemper virus lineage among domestic dogs in Colombia, South America. Vet. Microbiol. 2014;172:168–176. 10.1016/j.vetmic.2014.05.019. [Abstract] [CrossRef] [Google Scholar]
- Goluch T., Bogdanowicz D., Giaro K. Visual TreeCmp: comprehensive comparison of phylogenetic trees on the web. Methods Ecol. Evol. 2020;11:494–499. 10.1111/2041-210X.13358. [CrossRef] [Google Scholar]
- Gregory T.R. Understanding evolutionary trees. Evol. Educ. Outreach. 2008;1:121–137. 10.1007/s12052-008-0035-x. [CrossRef] [Google Scholar]
- Han G.Z., Liu X.P., Li X.S. Cross-species recombination in the haemagglutinin gene of canine distemper virus. Virus Res. 2008;136:198–201. 10.1016/j.virusres.2008.04.022. [Abstract] [CrossRef] [Google Scholar]
- Harder T.C., Osterhaus A.D. Canine distemper virus — a morbillivirus in search of new hosts? Trends Microbiol. 1997;5:120–124. 10.1016/S0966-842X(97)01010-X. [Abstract] [CrossRef] [Google Scholar]
- Headley S.A., Santos T.R., Bodnar L., Saut J.P.E., Silva A.P., Alfieri A.F., Medeiros A.A., Soares N.P., Alfieri A.A. Molecular detection and phylogenetic relationship of wild-type strains of canine distemper virus in symptomatic dogs from Uberlândia, Minas Gerais. Arq. Bras. Med. Vet. Zootec. 2015;67:1510–1518. 10.1590/1678-4162-7052. [CrossRef] [Google Scholar]
- International Committee on Taxonomy of Viruses (ICTV). 2019. ICTV Taxonomy history: Canine morbillivirus. https://talk.ictvonline.org/taxonomy/p/taxonomy-history?taxnode_id=202101613.
- Iwatsuki K., Tokiyoshi S., Hirayama N., Nakamura K., Ohashi K., Wakasa C., Mikami T., Kai C. Antigenic differences in the H proteins of canine distemper viruses. Vet. Microbiol. 2000;71:281–286. 10.1016/S0378-1135(99)00172-8. [Abstract] [CrossRef] [Google Scholar]
- Jo W.K., Peters M., Kydyrmanov A., Van de Bildt M.W.G., Kuiken T., Osterhaus A., Ludlow M. The canine morbillivirus strain associated with an epizootic in caspian seals provides new insights into the evolutionary history of this virus. Viruses. 2019;11:1–16. 10.3390/v11100894. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
- Katoh K., Rozewicki J., Yamada K.D. MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization. Brief. Bioinform. 2019;20:1160–1166. 10.1093/bib/bbx108. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
- Ke G.M., Ho C.H., Chiang M.J., Sanno-Duanda B., Chung C.S., Lin M.Y., Shi Y.Y., Yang M.H., Tyan Y.C., Liao P.C., Chu P.Y. Phylodynamic analysis of the canine distemper virus hemagglutinin gene. BMC Vet. Res. 2015;11:164. 10.1186/s12917-015-0491-9. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
- Kumar S., Stecher G., Li M., Knyaz C., Tamura K. Mega X: molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. 2018;35:1547–1549. 10.1093/molbev/msy096. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
- Lamb R.A., Parks G.D., Knipe D.M., Howley P.M. Fields Virology. 6th ed. LWW; Philadelphia, PA, United States: 2013. pp. 957–995. [Google Scholar]
- Lan N.T., Yamaguchi R., Kawabata A., Uchida K., Sugano S., Tateyama S. Comparison of molecular and growth properties for two different canine distemper virus clusters, Asia 1 and 2, in Japan. J. Vet. Med. Sci. 2007;69:739–744. 10.1292/jvms.69.739. [Abstract] [CrossRef] [Google Scholar]
- Lednicky J.A., Dubach J., Kinsel M.J., Meehan T.P., Bocchetta M., Hungerford L.L., Sarich N.A., Witecki K.E., Braid M.D., Pedrack C., Houde C.M. Genetically distant American canine distemper virus lineages have recently caused epizootics with somewhat different characteristics in raccoons living around a large suburban zoo in the USA. Virol. J. 2004;1:2. 10.1186/1743-422X-1-2. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
- Lednicky J.A., Meehan T.P., Kinsel M.J., Dubach J., Hungerford L.L., Sarich N.A., Witecki K.E., Braid M.D., Pedrak C., Houde C.M. Effective primary isolation of wild-type canine distemper virus in MDCK, MV1 Lu and Vero cells without nucleotide sequence changes within the entire haemagglutinin protein gene and in subgenomic sections of the fusion and phospho protein genes. J. Virol. Methods. 2004;118:147–157. 10.1016/j.jviromet.2004.02.004. [Abstract] [CrossRef] [Google Scholar]
- Loots A.K., Mokgokong P.S., Mitchell E., Venter E.H., Kotze A., Dalton D.L. Phylogenetic analysis of canine distemper virus in South African wildlife. PLoS One. 2018;13 10.1371/journal.pone.0199993. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
- Martella V., Elia G., Buonavoglia C. Canine distemper virus. Vet. Clin. N. Am. Small Anim. Pract. 2008;38:787–797. 10.1016/j.cvsm.2008.02.007. [Abstract] [CrossRef] [Google Scholar]
- Martin D.P., Murrel B., Golden M., Khoosal A., Muhire B. RDP4: detection and analysis of recombination patterns in virus genomes. Virus Evol. 2015;1:vev003. 10.1093/ve/vev003. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
- Merchioratto I., de Oliveira P.S.B., Silva Júnior, et al. Phylogeny and amino acid analysis in single and mixed bovine papillomavirus infections in Southern Brazil, 2016-2020. Arch Virol. 2023;168:52. 10.1007/s00705-022-05622-4. [Abstract] [CrossRef] [Google Scholar]
- Mochizuki M., Hashimoto M., Hagiwara S., Yoshida Y., Ishguro S. Genotypes of canine distemper virus determined by analysis of the hemagglutinin genes of recent isolates from dogs in Japan. J. Clin. Microbiol. 1999;37:2936–2942. 10.1128/JCM.37.9.2936-2942.1999. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
- National Center for Biotechnology Information – NCBI. https://www.ncbi.nlm.nih.gov/.
- Nguyen D.V., Suzuki J., Minami S., Yonemitsu K., Nagata N., Kuwata R., Shimoda H., Vu C.K., Truong T.Q., Maeda K. Isolation and phylogenetic analysis of canine distemper virus among domestic dogs in Vietnam. J. Vet. Med. Sci. 2016;79:123–127. 10.1292/jvms.16-0394. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
- Peserico A., Marcacci M., Malatesta D., Di Domenico M., Pratelli A., Mangone I., D'Alterio N., Pizzurro F., Cirone F., Zaccaria G., Cammà C., Lorusso A. Diagnosis and characterization of canine distemper virus through sequencing by MinION nanopore technology. Sci. Rep. 2019;9:1714. 10.1038/s41598-018-37497-4. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
- Piewbang C., Chansaenroj J., Kongmakee P., Banlunara W., Poovorawa Y., Techangamsuwan S. Genetic adaptations, biases, and evolutionary analysis of canine distemper virus Asia-4 lineage in a fatal outbreak of wild-caught civets in Thailand. Viruses. 2020;12:361. 10.3390/v12040361. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
- Piewbang C., Radtanakatikanon A., Puenpa J., Poovorawan Y., Techangamsuwan S. Genetic and evolutionary analysis of a new Asia-4 lineage and naturally recombinant canine distemper virus strains from Thailand. Sci. Rep. 2019;9:3198. 10.1038/s41598-019-39413-w. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
- Posada D. jModelTest: phylogenetic model averaging. Mol. Biol. Evol. 2008;25:1253–1256. 10.1093/molbev/msn083. [Abstract] [CrossRef] [Google Scholar]
- Rendon-Marin S., Budaszewski R.F., Canal C.W., Ruiz-Saenz J. Tropism and molecular pathogenesis of canine distemper virus. Virol. J. 2019;16:30. 10.1186/s12985-019-1136-6. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
- Sarute N., Calderón M.G., Pérez R., La Torre J., Hernández M., Francia L., Panzera Y. The fusion protein signal-peptide-coding region of canine distemper virus: a useful tool for phylogenetic reconstruction and lineage identification. PLoS One. 2013;8:e63595. 10.1371/journal.pone.0063595. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
- Sidhu M.S., Husar W., Cook S.D., Dowling P.C., Udem S.A. Canine distemper terminal and intergenic non-protein coding nucleotide sequences: completion of the entire CDV genome sequence. Virology. 1993;193:66–72. 10.1006/viro.1993.1103. [Abstract] [CrossRef] [Google Scholar]
- Yuan C., Liu W., Wang Y., Hou J., Zhang L., Wang G. Homologous recombination is a force in the evolution of canine distemper virus. PLoS One. 2017;12 10.1371/journal.pone.0175416. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
Articles from Virus Research are provided here courtesy of Elsevier
Citations & impact
Impact metrics
Alternative metrics
Discover the attention surrounding your research
https://www.altmetric.com/details/141183175
Article citations
Phylogenetic analysis of papillomaviruses in dogs from southern Brazil: molecular epidemiology and investigation of mixed infections and spillover events.
Braz J Microbiol, 55(2):2025-2033, 07 May 2024
Cited by: 0 articles | PMID: 38710991
Data
Data behind the article
This data has been text mined from the article, or deposited into data resources.
Nucleotide Sequences (Showing 12 of 12)
- (3 citations) ENA - AY443350
- (2 citations) ENA - AY466011
- (2 citations) ENA - AY542312
- (2 citations) ENA - MH316137
- (2 citations) ENA - KF640687
- (2 citations) ENA - AY445077
- (1 citation) ENA - MT448054
- (1 citation) ENA - AY649446
- (1 citation) ENA - KU666057
- (1 citation) ENA - MF926599
- (1 citation) ENA - JN896987
- (1 citation) ENA - AF164967
Show less
Similar Articles
To arrive at the top five similar articles we use a word-weighted algorithm to compare words from the Title and Abstract of each citation.
Phylogenetic analysis of the wild-type strains of canine distemper virus circulating in the United States.
Virol J, 15(1):118, 02 Aug 2018
Cited by: 15 articles | PMID: 30068352 | PMCID: PMC6090796
Phylogenetic analysis of haemagglutinin gene deciphering a new genetically distinct lineage of canine distemper virus circulating among domestic dogs in India.
Transbound Emerg Dis, 66(3):1252-1267, 27 Feb 2019
Cited by: 24 articles | PMID: 30725534
Genotypic lineages and restriction fragment length polymorphism of canine distemper virus isolates in Thailand.
Vet Microbiol, 166(1-2):76-83, 14 Jun 2013
Cited by: 19 articles | PMID: 23830775
Evolution and Interspecies Transmission of Canine Distemper Virus-An Outlook of the Diverse Evolutionary Landscapes of a Multi-Host Virus.
Viruses, 11(7):E582, 26 Jun 2019
Cited by: 51 articles | PMID: 31247987 | PMCID: PMC6669529
Review Free full text in Europe PMC
Funding
Funders who supported this work.
Conselho Nacional de Desenvolvimento Científico e Tecnológico (1)
Grant ID: 140837/2020-5
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior
Financiadora de Estudos e Projetos (1)
Grant ID: 301414/2010-6