About the Authors:
Minako Hijikata
Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Validation, Visualization, Writing – original draft, Writing – review & editing
Affiliation: Department of Pathophysiology and Host Defense, The Research Institute of Tuberculosis, Japan Anti-Tuberculosis Association, Kiyose, Tokyo, Japan
Naoto Keicho
Roles Conceptualization, Data curation, Formal analysis, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing
* E-mail: [email protected]
Affiliation: The Research Institute of Tuberculosis, Japan Anti-Tuberculosis Association, Kiyose, Tokyo, Japan
ORCID http://orcid.org/0000-0001-8232-9448
Le Van Duc
Roles Funding acquisition, Investigation, Project administration, Resources, Supervision
Affiliation: Da Nang Lung Hospital, Da Nang, Vietnam
Shinji Maeda
Roles Investigation, Methodology, Resources
Affiliation: Hokkaido Pharmaceutical University School of Pharmacy, Sapporo, Hokkaido, Japan
Nguyen Thi Le Hang
Roles Investigation, Project administration, Resources, Supervision
Affiliation: NCGM-BMH Medical Collaboration Center, Hanoi, Vietnam
Ikumi Matsushita
Roles Investigation
Affiliation: Department of Pathophysiology and Host Defense, The Research Institute of Tuberculosis, Japan Anti-Tuberculosis Association, Kiyose, Tokyo, Japan
Seiya Kato
Roles Funding acquisition, Project administration, Supervision
Affiliation: The Research Institute of Tuberculosis, Japan Anti-Tuberculosis Association, Kiyose, Tokyo, Japan
Introduction
The methods of genotyping Mycobacterium tuberculosis complex (MTBC) facilitate many aspects of tuberculosis studies. The MTBC was previously considered genetically monomorphic in nature but the development of genotyping methods that discriminate strains into distinct lineages has demonstrated previously unrecognized diversity. Consequently, the genetic variations of MTBC have been investigated extensively in phylogenetic studies to understand the evolution and spread of MTBC [1, 2]. Based on the regions of difference (RD) classification system using large sequence polymorphisms (LSP), six major global MTBC lineages have been defined (1 Indo-Oceanic, 2, East-Asian including Beijing, 3 East-African-Indian, 4 Euro-American, 5 West Africa or Mycobacterium africanum I, 6 West Africa or M. africanum II) [3], and another phylogenetic lineage of MTBC has recently been described in Ethiopia as lineage 7 [4]. Whole-genome sequencing (WGS) technology using a high-throughput sequencer has enabled us to obtain more complete information about phylogenetic markers, and SNP-based barcode system is now used to classify a number of sublineages [2].
Among the molecular markers traditionally used, clustered regulatory interspaced short palindromic repeats (CRISPR)-based spacer oligonucleotide typing (spoligotyping) [5] and insertion sequence (IS)6110-restriction fragment length polymorphism [6] have often been applied in epidemiologic research. The direct repeat (DR) region of the M. tuberculosis genome consists of 36 base pair (bp) DR copies and 35 to 41 bp spacers. Each pair of DR and the adjacent spacer is called a direct variant repeat (DVR). The original spoligotyping method uses 43 of 68 DVRs in the region and detects the presence or absence of these spacers by a PCR-based dot-blot hybridization technique [5, 7]. Microbead techniques are also applied to the typing [8, 9]. Alignment of the spacers is well conserved, and the genetic diversity in the region is mostly due to deletion of DVR [7]. International databases for the spoligotypes of these 43 spacers have been developed; SpolDB4 contains data from 40,000 MTBC isolates from 120 countries [10], and the publicly available online database SITVITWEB has over 60,000 isolates from 150 countries [11]. Although spoligotypes are not always consistent with phylogenetic groups [12], some of the spoligotype families are relatively unique. For instance, the East African–Indian (EAI) family strains belong to lineage 1 [13], and the Beijing family strains belong to lineage 2. Thus, conventional spoligotyping system is still informative and practically useful in countries or areas where a variety of MTBC lineages and sublineages coexist.
Spoligotype-defined EAI family strains belonging to lineage 1 are prevalent in East Africa, South Asia, and Southeast Asia [14, 15]. Of these, the EAI4-VNM subfamily is known to be one of the typical Vietnamese M. tuberculosis genotypes [10, 11], but the relationship between lineage 1 strains and the EAI4 subfamily has not been fully elucidated. In the present study, we explored these genetic features in central Vietnam and compared classical spoligotyping results with WGS.
Materials and methods
M. tuberculosis strains
Cultured M. tuberculosis isolates were collected from 186 patients with active pulmonary tuberculosis in Da Nang, Vietnam between January 2015 and November 2016. The study was approved by the Ethics Committee for Biomedical Researches, National Hospital of Pediatrics, Vietnam. Written informed consent was collected from all study participants. Genomic DNA was extracted using Isoplant kits (Nippon Gene, Tokyo, Japan). Beijing and non-Beijing strains were discriminated by a single nucleotide variation (SNV) at position 779,615 of H37Rv (AL123456) [16] as described previously [17], and lineage 1 strains were further identified in the non-Beijing group by detecting SNV at position 649,345 [18] using real-time PCR.
Spoligotyping
Spoligotyping was performed according to a standard protocol [5, 19]. Classification of the spoligotype family was based on the international database SITVIT [11]. Spoligotype patterns characterized by the absence of spacers 26, 27, 29 to 32, and 34 and the presence of 33, but not registered in the database, were regarded as EAI4-like strains in the present study.
WGS analysis of M. tuberculosis isolates
A library for WGS analysis was prepared from 200 ng of genomic DNA with the TruSeq Nano DNA LT Sample Preparation Kit (Illumina, San Diego, CA, USA). To improve the amplification of GC-rich sequences of the M. tuberculosis genome, a PCR step in the library preparation was performed using KOD FX Neo (Toyobo, Osaka, Japan). Paired-end (2 × 250 bp or 2 × 300 bp) sequencing was performed using MiSeq (Illumina).
Paired-end fastq files were used for de novo assembly using the Platanus trimming tool-1.0.7 and Platanus_assmbler-1.2.4 [20], and their quality was evaluated using QUAST 4.3 [21]. Contig sequences including the DR region were selected from gap-closed fasta files, and alignment of the spacers was further analyzed using Genetyx-Mac (Genetyx, Tokyo, Japan). Fastq files were also subjected to in silico spoligotyping using the SpolPred [22] or SpoTyping-v2.1-commandLine [23] tool. To identify the presence or absence of 68 spacer sequences in fastq files in silico, a spacer-EX.fasta file with 68 probes, consisting of 25 nucleotides each, was prepared (S1 Fasta file) and served for the standard nucleotide BLAST+ program in the extended SpoTyping tool. The BLAST+ program (ncbi-blast-2.4.0+) was used to further search for nucleotide matches between the 43 original spoligotyping probes and the otherwise unused spacer sequences. The presence or absence of region of difference (RD) 239 was assessed by RD-Analyzer [24] and used to determine lineage 1, and IS6110 insertion sites were identified by ISMapper [25].
Sanger sequencing of DNA fragments hybridized with spacer 26 probe by spoligotyping
After hybridization, a membrane with a weakly positive signal at position 26 was excised, briefly washed, immersed in 1× PCR buffer, and heated at 95°C for 10 min. Eluted DNA was re-amplified by PCR with DRa and DRb primers with additional adapter sequences (underlined) at their 5′ end (R1-Dra, 5′-CTGGAGTTCAGACGTGTGGTTTTGGGTCTGACGAC-3′; R2-DRb, 5′-CTCTTTCCCTACACGACCCGAGAGGGGACGGAAAC-3′). The amplified products were subjected to direct sequencing with primers R1 (5′-CTGGAGTTCAGACGTGT-3′) or R2 (5′-CTCTTTCCCTACACGACC-3′) using the BigDye Terminator v3.1 Cycle Sequencing Kit (ThermoFisher Scientific, Waltham, MA, USA) and a 3500xl Genetic Analyzer (ThermoFisher Scientific).
Sequences of lineage 1 strains downloaded from public databases and phylogenetic analysis
Fastq files of lineage 1 strains reported in Asia and Africa were randomly downloaded from public databases. [26–28]. When their lineage information was unknown, lineage-specific mutations were identified using TB Profiler [29]. As a result, WGS data from 43 lineage 1 strains were obtained and subjected to the in silico spoligotyping methods SpolPred and SpoTyping (S1 Table). In addition, sequencing reads of Da Nang isolates and the downloaded reads were mapped to the M. tuberculosis reference genome H37Rv (AL123456), and SNVs were called using CLC Genomics Workbench 9.5 (QIAGEN, Hilden, Germany), with the following parameters: minimum coverage = 10; minimum central quality (Phred score) = 20; and minimum neighborhood quality = 15. After removal of SNVs in the PE/PPE/PGRS genes, concatenated SNVs were obtained and a phylogenetic tree was constructed using RAxML version 8.2.8 [30], with a maximum likelihood search and 100 rapid bootstrap analyses, and then visualized with FigTree v1.4.3 [https://github.com/rambaut/figtree/releases] using M. canettii (ERR313114) as an outgroup.
Results
Characterization of M. tuberculosis isolates in Da Nang, Vietnam using spoligotyping
Of the 186 Da Nang isolates, 63 (33.9%) Beijing and 123 (66.1%) non-Beijing isolates were identified, and 86 of the 123 isolates belonged to lineage 1. Genomic DNA samples from the 86 Vietnamese lineage 1 isolates were subjected to spoligotyping. Of these, 74 spoligotypes were consistent with or very similar to EAI patterns, which were characterized by the absence of spacers 29 to 32 and 34 and the presence of spacer 33 [10, 31]. The remaining three lacked spacer 33, and nine had larger spacer deletions, showing non-EAI spoligo patterns.
Ambiguous spoligotypes observed in lineage 1 strains
Among those showing EAI spoligo patterns, typical EAI4-VNM strains are characterized by the additional deletion of spacers 26 and 27 [10, 31]. As shown in Fig 1, the hybridization signal of spacer 26 was weak but visible in most of the Da Nang strains. Depending on the assessment of spacer 26, i.e., positive or negative, it was possible to assign two different octal codes and shared international type (SIT) numbers, such as SIT139 in EAI4-VNM and SIT152 (= SIT139 + positive spacer 26) in EAI5 (Table 1). To investigate the possible cause of this ambiguity, we selected 12 isolates that served for WGS analysis.
[Figure omitted. See PDF.]
Fig 1. Membrane hybridization patterns using conventional spoligotyping probes.
DN-038 to DN-059 were representative lineage 1 isolates from Da Nang. BCG and H37Rv strains were used as controls. The density of dots at position 26 (▼) was weak compared to other positive dots in many isolates.
https://doi.org/10.1371/journal.pone.0186800.g001
[Figure omitted. See PDF.]
Table 1. Ambiguous spoligotyping results from lineage 1 strains in Da Nang.
https://doi.org/10.1371/journal.pone.0186800.t001
Genomic analysis using a de novo assembled sequence of the DR region
De novo assemblies resulted in gap-closed fasta files comprising 58–79 contigs of 500 bp or longer (S2 Table). Of these, one or two contig sequences containing the entire DR region were identified and examined for the presence or absence of 68 spacers, consisting of 43 used and 25 not used, though previously reported, for conventional spoligotyping [7]. The alignment of these 68 spacers was identical to that shown in a previous report, and the assembled contigs did not hold any novel spacer sequences (Fig 2A). The spacer 26 sequence in the original description of spoligotyping [5] was found in the contig of the EAI5 isolate and showed a strong signal at spacer position 26 but was not found in other Vietnamese lineage 1 isolates that had weak signals at position 26. Because these isolates carried 13–24 additional spacer sequences not used for conventional typing, we hypothesized that the ambiguous signal observed at spacer position 26 might be because of cross-hybridization with one of the extra spacer sequences co-amplified by PCR. Based on a sequence similarity search, a nucleotide sequence of the 17th spacer, reported by van Embden et al. [7], but not used for the 43-spacer spoligotyping, was 88% (22/25 nucleotides) identical to the spacer 26 typing probe [5] (Fig 2B). The strength of similarity of the spacer 26 typing probe to van Embden’s 17th spacer [7], when assessed by e-value and bit-score of BLAST+, was prominent among all comparisons between the 43 original probes and the otherwise unused spacer sequences (S3 Table). Two EAI4-like isolates (DN-012 and DN-004) that lacked the 17th spacer in the assembled contig (Fig 2A) did not show any ambiguous hybridization signal at position 26 in the conventional typing (Fig 1).
[Figure omitted. See PDF.]
Fig 2. Presence/absence of 68 spacers in de novo contig sequences and probe 26 sequence similarity.
(A) The presence and absence of 68 spacer sequences are shown. Data from DN-038 to DN-059, 12 isolates from Da Nang, were derived from the de novo contig sequences. Data from BCG and H37Rv were obtained from their complete genomic sequences, AM408590.1 and AL123456.2, respectively. The present spacer is shown by a closed square (■). The positions of Kamerbeek’s spacer 26 [5] and van Embden's 17th spacer [7] are marked (▼). (B) The degree of sequence identity between Kamerbeek’s spacer 26 [5] and van Embden's 17th spacer [7]. The oligonucleotide sequence for the spacer 26 probe is boxed in the figure; 22/25 nucleotides (88%) are identical to those of van Embden’s 17th spacer. Nucleotides in the adjacent DR sequences are shown in small grey letters.
https://doi.org/10.1371/journal.pone.0186800.g002
Experimental detection of the DNA sequence cross-hybridized to the spacer 26 probe
In DN-038 and DN-049, DNA fragments hybridized to the membrane of spacer 26 were eluted and re-amplified by PCR. Direct sequencing revealed that the PCR product from DN-049 had the exact nucleotide sequence as the 17th spacer described by van Embden et al. [7] (S1 Fig), whereas the product from DN-038 failed to be assessed because of the presence of mixed bases.
In silico spoligotyping and characteristics of Vietnamese lineage 1 isolates
As shown in S1 Table, in silico spoligotyping of 43 spacers using the short reads of WGS showed no discordance between SpolPred and Spotyping when analytical conditions were optimized. The DN-038 isolate had SNV (position 3,120,278 of H37Rv, AL123456) in the probe region of spacer 28. It was not detected in a setting to identify a complete match. When the Spotyping program was extended to detect the presence or absence of the full set of the 68 spacer sequences reported by van Embden et al. [7], the signal patterns were perfectly matched with those obtained by the de novo assembly (S1 Table and Fig 2). Encouraged by these reproducible results from in silico typing, 43 downloaded sequences of lineage 1 strains were also subjected to extended Spotyping, and the patterns of the 68 spacers were compared with those of the Da Nang strains. As a result, the absence of spacers 26 and 27 by the numbering of Kamerbeek et al. [5] was unique to Vietnamese EAI4 isolates, and Vietnamese isolates showing the SIT405 pattern (ZERO) in conventional typing revealed a large deletion spanning 35 of the 68 spacer regions (S1 Table). As expected, absence of RD239 sequence, a specific marker of lineage 1 strains, was observed in EAI4-VNM, other EAI-like strains, and those showing the SIT405 pattern analyzed in Da Nang (n = 12), as well as in other EAI family strains extracted from the public database (n = 43) (S1 Table).
When the IS6110 sequence was searched in the short reads using ISMapper, at most one IS6110 was detected in the Vietnamese EAI4 and EAI5 isolates, whereas isolates showing the SIT405 pattern did not have any detectable IS6110 sequences (S1 Table). We confirmed that only one IS6110 was detected from a complete genome sequence of a Hanoi strain with the EAI-VNM pattern, recently deposited by our group as accession number AP018033 (data not shown) [32].
In silico spoligotyping of lineage 1 strains from other geographical regions
Of 43 lineage 1 strains from the public database, 38 belonged to one of the following clades: EAI1-SOM, EAI2-Manila, EAI2-Nonthaburi, EAI3-IND, EAI4-VNM, EAI5, or EAI6-BGD. The remaining five were non-EAI strains with deletions of many spacers, as determined by WGS analysis using the Spotyping and SpolPred tools (S1 Table).
Phylogenetic analysis
When a phylogenetic tree was constructed with concatenated SNVs of the Da Nang M. tuberculosis isolates analyzed in the present study and with other lineage 1 strains from the public database, Vietnamese EAI-VNM formed a closely related group, the L1.1.1 clade including L1.1.1.1 by Coll et al. [2], which was discrete from the EAI strains of other geographical regions in Asia and Africa (L1.1.2, L1.1.3, L1.2.1, and L1.2.2). Non-EAI isolates showing the SIT405 pattern and Vietnamese EAI5 were phylogenetically included in the L1.1.1 (EAI4-VNM) group (Fig 3), two non-EAI strains (SIT346) from Pakistan were included in the L1.2.2 (EAI1-SOM) group, and one non-EAI strain (SIT1391) from Taiwan was included in L1.1.3 (EAI6-BGD) group.
[Figure omitted. See PDF.]
Fig 3. Phylogenetic tree of lineage 1 strains comprising the 12 isolates from Da Nang and 43 from the public database.
A phylogenetic tree based on concatenated SNVs from lineage 1 strains (the 12 isolates from Da Nang and 43 from the public database) was constructed using RAxML. M. canettii was used as an outgroup. The tree was visualized using FigTree v1.4.3. A consensus pattern of spoligotype predicted by in silico spoligotyping programs Spotyping [23] and SpolPred [22] is shown on the right side of each isolate. The scale bar indicates the number of substitutions per site. Lineage classification by TB Profiler is also shown.
https://doi.org/10.1371/journal.pone.0186800.g003
Discussion
In the present study, we used spoligotyping and WGS to analyze M. tuberculosis isolates of lineage 1 from Da Nang, Vietnam. EAI4-VNM and the patterns similar to EAI4-VNM, were major spoligotypes that formed a distinct clade, together with Vietnamese EAI5 and non-EAI (SIT405) spoligotype strains, in a phylogenetic tree. They were separated from lineage 1 strains currently observed in other Asian and African regions.
The EAI family in spoligotyping was originally defined by the absence of spacers 29 to 32 and 34 and the presence of spacer 33 [10, 31] and was found to be distributed among countries in east Africa, south Asia, and Southeast Asia [14, 15]. In a classic spoligotyping analysis, Buu et al. reported that EAI family strains were found in 36.3% of 2,207 samples in southern Vietnam [33]. In northern Vietnam, the EAI strains were found in 19.5% of 465 isolates [17]. In southern and northern Vietnam, Beijing strains were also prevalent, whereas in Hue, located at the center of Vietnam, EAI isolates comprised 58.0% of 100 isolates [34]. Da Nang is also located in the center of Vietnam; in the present study, EAI was the most frequent spoligotype (39.8%). Both EAI4-VNM and EAI5 were observed in the above mentioned studies, and we confirmed that EAI strains are still predominant in central Vietnam.
In the present study, we clearly demonstrated that the cross-hybridization between spacer 26 probe and van Embden’s 17th spacer caused ambiguous spoligo patterns, which was initially suspected by the results of WGS analysis using de novo assembly and in silico spoligotyping tools and further corroborated by DNA elution and amplification using Sanger direct sequencing. The presence of van Embden’s 17th spacer is one of the characteristics of EAI strains [7], and the absence of spacers 26 and 27 is a hallmark of EAI4-VNM. Therefore, cross-hybridization occurs predominantly in the Vietnamese strains. Moreover, the percent sequence similarity between the spacer 26 probe and van Embden’s 17th spacer was prominent among all comparisons between 43 spoligotyping probes and 25 other spacer sequences. Previous Vietnamese studies have shown that EAI isolates have irregular spoligo patterns, namely, the absence of spacers 27, 29 to 32, and 34 and the presence of spacer 26 [34, 35], possibly because of cross-hybridization, as we observed herein. In addition, other studies have also noted that the EAI5 spoligotype SIT152 is closely related to the EAI4 spoligotype SIT139, with a difference of a single positive spacer 26 [36, 37]. Honisch et al. found that 10 of 325 M. tuberculosis strains exhibited discordant spoligotyping results between methods and showed that the presence of spacer 26 in membrane-based results was not reproduced in their MALDI-TOF MS-derived spoligo pattern [38], although the cause of the discrepancy in their study was not revealed.
The EAI family belongs to lineage 1, originally defined by SNV and LSP typing with deletion of RD239 [12, 13]; this finding was confirmed by WGS [2]. According to a recent hypothesis, M. tuberculosis lineage 1, starting in Africa, has spread in the southern part of Asia as a result of human migration [27]. Presumably reflecting a long evolutionary history, it is conceivable that the EAI family has been divided into several region-specific subgroups apart from their prototype: EAI5 and EAI1-SOM (Somalia); EAI2-Manila (Philippines); EAI2-Nonthaburi (Thailand); EAI3-IND (India); EAI4-VNM (Vietnam); EAI6-BGD/1 and EAI7-BGD/2 (Bangladesh); and EAI8-MDG (Madagascar) [10]. Typical EAI4-VNM is characterized by the additional deletion of spacers 26 and 27 [10, 31]. Although non-EAI strains showing the SIT405 pattern (ZERO) in Da Nang had a large deletion in the DR region with many spacers missing, SIT405 strains were regarded as lineage 1 by LSP in two previous studies [39, 12]. In our WGS analysis, SIT405 strains in Da Nang were phylogenetically very similar to EAI4-VNM, forming a monophyletic group. Non-EAI strains with a series of spacer deletions from Pakistan and Taiwan belonged to the L1.2.2 (EAI1-SOM) and L1.1.3 (EAI6-BGD) clades, respectively. Extensive spacer deletions, including the SIT405 pattern, may have recently emerged across subgroups of the EAI family.
Our study also suggested that the number of IS6110 sequences was quite different among EAI families. EAI4-VNM strains have only a few IS6110 sequences, whereas EAI2 strains have more than 10. Roychowdhury et al. reported that EAI4 strains have only one IS6110 sequence [40]; complete genome sequencing of an EAI-VNM strain in Hanoi, Vietnam, has also indicated this [32]. Complete genome sequences derived from a variety of lineages can be used as better references for the mapping of WGS and may contribute to more accurate genotyping in the future.
The spoligotyping method is still useful to discriminate lineage 1 strains from other M. tuberculosis families, including Beijing strains, particularly in countries where several MTBC lineages are mixed and spread together. Genomic features, including alignment of all spacers in the DR region, should be investigated to determine the correct genotypes. In addition, WGS data may help to analyze the relationships between apparently distinct spoligo patterns. According to recent reports, host responses to M. tuberculosis are different between ancient MTBC lineages (lineages 1, 5, and 6) and modern MTBC lineages (lineages 2, 3, and 4) [41]. Future progress in genomic research, including lineage 1, will help further the understanding of lineage- or clade-specific phenotypes, diversity of the pathogen, and its interaction with the host.
Supporting information
[Figure omitted. See PDF.]
S1 Table. Summary of isolates included in the analysis.
https://doi.org/10.1371/journal.pone.0186800.s001
(XLSX)
S2 Table. Evaluation of genome assemblies by computing various metrics using QUAST 4.3 with AL123456 as a reference genome.
https://doi.org/10.1371/journal.pone.0186800.s002
(XLSX)
S3 Table. Best nucleotide matches between the 43 original spacer probes and unused spacer sequences, assessed by blastn-short with tabular output format 6.
https://doi.org/10.1371/journal.pone.0186800.s003
(XLSX)
S1 Fig. Electropherogram of bidirectional sequencing of the amplified DNA bound to spacer 26 probe in DN-049 isolate.
The 17th spacer sequence reported by van Embden, et al. [7] was obtained by direct sequencing using the R2 primer, while the complementary sequence of the 17th spacer was obtained using the R1 primer.
https://doi.org/10.1371/journal.pone.0186800.s004
(TIF)
S1 Fasta. Fasta file for identification of the presence or absence of 68 spacer sequences in fastq files in silico using the extended SpoTyping tool.
https://doi.org/10.1371/journal.pone.0186800.s005
(FASTA)
Acknowledgments
The authors would like to thank Akiko Miyabayashi and Keiko Wakabayashi (The Research Institute of Tuberculosis, JATA), Nguyen Thi Kieu Diem and Nguyen Thi Thanh Yen (Da Nang Lung Hospital) for technical assistance, Nguyen Thu Huyen (NCGM-BMH Medical Collaboration Center) for monitoring site implementation, staff of Da Nang Lung Hospital, and relevant district TB centers for participating in this study.
Citation: Hijikata M, Keicho N, Duc LV, Maeda S, Hang NTL, Matsushita I, et al. (2017) Spoligotyping and whole-genome sequencing analysis of lineage 1 strains of Mycobacterium tuberculosis in Da Nang, Vietnam. PLoS ONE12(10): e0186800. https://doi.org/10.1371/journal.pone.0186800
1. Kato-Maeda M, Metcalfe JZ, Flores L. Genotyping of Mycobacterium tuberculosis: application in epidemiologic studies. Future Microbiol. 2011;6(2):203–16. pmid:21366420
2. Coll F, McNerney R, Guerra-Assunção JA, Glynn JR, Perdigão J, Viveiros M, et al. A robust SNP barcode for typing Mycobacterium tuberculosis complex strains. Nat Commun. 2014;5:4812. pmid:25176035
3. Gagneux S, DeRiemer K, Van T, Kato-Maeda M, de Jong BC, et al. Variable host-pathogen compatibility in Mycobacterium tuberculosis. Proc Natl Acad Sci U S A. 2006;103(8):2869–73. pmid:16477032
4. Firdessa R, Berg S, Hailu E, Schelling E, Gumi B, et al. Mycobacterial lineages causing pulmonary and extrapulmonary tuberculosis, Ethiopia. Emerg Infect Dis. 2013;19(3):460–3. pmid:23622814
5. Kamerbeek J, Schouls L, Kolk A, van Agterveld M, van Soolingen D, Kuijper S, et al. Simultaneous detection and strain differentiation of Mycobacterium tuberculosis for diagnosis and epidemiology. J Clin Microbiol. 1997;35(4):907–14. pmid:9157152
6. van Embden JD, Cave MD, Crawford JT, Dale JW, Eisenach KD, Gicquel B, et al. Strain identification of Mycobacterium tuberculosis by DNA fingerprinting: recommendations for a standardized methodology. J Clin Microbiol. 1993;31(2):406–9. pmid:8381814
7. van Embden JD, van Gorkom T, Kremer K, Jansen R, van Der Zeijst BA, Schouls LM. Genetic variation and evolutionary origin of the direct repeat locus of Mycobacterium tuberculosis complex bacteria. J Bacteriol. 2000;182(9):2393–401. pmid:10762237
8. Cowan LS, Diem L, Brake MC, Crawford JT. Transfer of a Mycobacterium tuberculosis genotyping method, Spoligotyping, from a reverse line-blot hybridization, membrane-based assay to the Luminex multianalyte profiling system. J Clin Microbiol. 2004;42(1):474–7. pmid:14715809
9. Zhang J, Abadia E, Refregier G, Tafaj S, Boschiroli ML, Guillard B, et al. Mycobacterium tuberculosis complex CRISPR genotyping: improving efficiency, throughput and discriminative power of 'spoligotyping' with new spacers and a microbead-based hybridization assay. J Med Microbiol. 2010;59(Pt 3):285–94. pmid:19959631
10. Brudey K, Driscoll JR, Rigouts L, Prodinger WM, Gori A, Al-Hajoj SA, et al. Mycobacterium tuberculosis complex genetic diversity: mining the fourth international spoligotyping database (SpolDB4) for classification, population genetics and epidemiology. BMC Microbiol. 2006;6:23. pmid:16519816
11. Demay C, Liens B, Burguière T, Hill V, Couvin D, Millet J, et al. SITVITWEB—a publicly available international multimarker database for studying Mycobacterium tuberculosis genetic diversity and molecular epidemiology. Infect Genet Evol. 2012;12(4):755–66. pmid:22365971
12. Comas I, Homolka S, Niemann S, Gagneux S. Genotyping of genetically monomorphic bacteria: DNA sequencing in Mycobacterium tuberculosis highlights the limitations of current methodologies. PLoS One. 2009;4(11):e7815. pmid:19915672
13. Kato-Maeda M, Gagneux S, Flores LL, Kim EY, Small PM, Desmond EP, et al. Strain classification of Mycobacterium tuberculosis: congruence between large sequence polymorphisms and spoligotypes. Int J Tuberc Lung Dis. 2011;15(1):131–3. pmid:21276309
14. Ismail F, Couvin D, Farakhin I, Abdul Rahman Z, Rastogi N, Suraiya S. Study of Mycobacterium tuberculosis complex genotypic diversity in Malaysia reveals a predominance of ancestral East-African-Indian lineage with a Malaysia-specific signature. PLoS One. 2014;9(12):e114832. pmid:25502956
15. Mbugi EV, Katale BZ, Streicher EM, Keyyu JD, Kendall SL, Dockrell HM, et al. Mapping of Mycobacterium tuberculosis Complex Genetic Diversity Profiles in Tanzania and Other African Countries. PLoS One. 2016;11(5):e0154571. pmid:27149626
16. Nakajima C, Tamaru A, Rahim Z, Poudel A, Maharjan B, Khin Saw Aye, et al. Simple multiplex PCR assay for identification of Beijing family Mycobacterium tuberculosis isolates with a lineage-specific mutation in Rv0679c. J Clin Microbiol. 2013;51(7):2025–32. pmid:23596248
17. Maeda S, Hang NT, Lien LT, Thuong PH, Hung NV, Hoang NP, et al. Mycobacterium tuberculosis strains spreading in Hanoi, Vietnam: Beijing sublineages, genotypes, drug susceptibility patterns, and host factors. Tuberculosis (Edinb). 2014;94(6):649–56.
18. Chuang PC, Chen HY, Jou R. Single-nucleotide polymorphism in the fadD28 gene as a genetic marker for East Asia Lineage Mycobacterium tuberculosis. J Clin Microbiol. 2010;48(11):4245–7. pmid:20826639
19. Molhuizen HO, Bunschoten AE, Schouls LM, van Embden JD. Rapid detection and simultaneous strain differentiation of Mycobacterium tuberculosis complex bacteria by spoligotyping. In: Parish T and Stoker NG, editors. Methods in Molecular Biology, vol. 101: Mycobacteria protocols. 1st ed. New York:Humana press;1998. p. 381–94.
20. Kajitani R, Toshimoto K, Noguchi H, Toyoda A, Ogura Y, Okuno M, et al. Efficient de novo assembly of highly heterozygous genomes from whole-genome shotgun short reads. Genome Res. 2014;24(8):1384–95. pmid:24755901
21. Gurevich A, Saveliev V, Vyahhi N, Tesler G. QUAST: quality assessment tool for genome assemblies. Bioinformatics. 2013;29(8):1072–5. pmid:23422339
22. Coll F, Mallard K, Preston MD, Bentley S, Parkhill J, McNerney R, et al. SpolPred: rapid and accurate prediction of Mycobacterium tuberculosis spoligotypes from short genomic sequences. Bioinformatics. 2012;28(22):2991–3. pmid:23014632
23. Xia E, Teo YY, Ong RT. SpoTyping: fast and accurate in silico Mycobacterium spoligotyping from sequence reads. Genome Med. 2016;8(1):19. pmid:26883915
24. Faksri K, Xia E, Tan JH, Teo YY, Ong RT. In silico region of difference (RD) analysis of Mycobacterium tuberculosis complex from sequence reads using RD-Analyzer. BMC Genomics. 2016;17(1):847. pmid:27806686
25. Hawkey J, Hamidian M, Wick RR, Edwards DJ, Billman-Jacobe H, Hall RM, et al. ISMapper: identifying transposase insertion sites in bacterial genomes from short read sequence data. BMC Genomics. 2015;16:667. pmid:26336060
26. Manson AL, Cohen KA, Abeel T, Desjardins CA, Armstrong DT, Barry CE 3rd, et al. Genomic analysis of globally diverse Mycobacterium tuberculosis strains provides insights into the emergence and spread of multidrug resistance. Nat Genet. 2017;49(3):395–402. pmid:28092681
27. Comas I, Coscolla M, Luo T, Borrell S, Holt KE, Kato-Maeda M, et al. Out-of-Africa migration and Neolithic coexpansion of Mycobacterium tuberculosis with modern humans. Nat Genet. 2013;45(10):1176–82. pmid:23995134
28. Coker OO, Chaiprasert A, Ngamphiw C, Tongsima S, Regmi SM, Clark TG, et al. Genetic signatures of Mycobacterium tuberculosis Nonthaburi genotype revealed by whole genome analysis of isolates from tuberculous meningitis patients in Thailand. PeerJ. 2016;4:e1905. pmid:27114869
29. Coll F, McNerney R, Preston MD, Guerra-Assunção JA, Warry A, Hill-Cawthorne G, et al. Rapid determination of anti-tuberculosis drug resistance from whole-genome sequences. Genome Med. 2015;7(1):51. pmid:26019726
30. Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30(9):1312–3. pmid:24451623
31. Filliol I, Driscoll JR, Van Soolingen D, Kreiswirth BN, Kremer K, Valétudie G, et al. Global distribution of Mycobacterium tuberculosis spoligotypes. Emerg Infect Dis. 2002;8(11):1347–9. pmid:12453368
32. Wada T, Hijikata M, Maeda S, Hang NTL, Thuong PH, Hoang NP, et al. Complete Genome Sequence of a Mycobacterium tuberculosis Strain Belonging to the East African-Indian Family in the Indo-Oceanic Lineage, Isolated in Hanoi, Vietnam. Genome Announc. 2017;5(24). pii: e00509–17. pmid:28619797
33. Buu TN, van Soolingen D, Huyen MN, Lan NT, Quy HT, Tiemersma EW, et al. Increased transmission of Mycobacterium tuberculosis Beijing genotype strains associated with resistance to streptomycin: a population-based study. PLoS One. 2012;7(8):e42323. pmid:22912700
34. Nguyen VA, Bañuls AL, Tran TH, Pham KL, Nguyen TS, Nguyen HV, et al. Mycobacterium tuberculosis lineages and anti-tuberculosis drug resistance in reference hospitals across Viet Nam. BMC Microbiol. 2016;16(1):167. pmid:27464737
35. Nguyen VA, Choisy M, Nguyen DH, Tran TH, Pham KL, Thi Dinh PT, et al. High prevalence of Beijing and EAI4-VNM genotypes among M. tuberculosis isolates in northern Vietnam: sampling effect, rural and urban disparities. PLoS One. 2012;7(9):e45553. pmid:23029091
36. Zhang J, Heng S, Le Moullec S, Refregier G, Gicquel B, Sola C, et al. A first assessment of the genetic diversity of Mycobacterium tuberculosis complex in Cambodia. BMC Infect Dis. 2011;11:42. pmid:21299851
37. Pichat C, Couvin D, Carret G, Frédénucci I, Jacomo V, Carricajo A, et al. Combined Genotypic, Phylogenetic, and Epidemiologic Analyses of Mycobacterium tuberculosis Genetic Diversity in the Rhône Alpes Region, France. PLoS One. 2016;11(4):e0153580. pmid:27128522
38. Honisch C, Mosko M, Arnold C, Gharbia SE, Diel R, Niemann S. Replacing reverse line blot hybridization spoligotyping of the Mycobacterium tuberculosis complex. J Clin Microbiol. 2010;48(5):1520–6. pmid:20200291
39. Duong DA, Nguyen TH, Nguyen TN, Dai VH, Dang TM, Vo SK, et al. Beijing genotype of Mycobacterium tuberculosis is significantly associated with high-level fluoroquinolone resistance in Vietnam. Antimicrob Agents Chemother. 2009;53(11):4835–9. pmid:19721073
40. Roychowdhury T, Mandal S, Bhattacharya A. Analysis of IS6110 insertion sites provide a glimpse into genome evolution of Mycobacterium tuberculosis. Sci Rep. 2015;5:12567. pmid:26215170
41. Comas I, Gagneux S. A role for systems epidemiology in tuberculosis research. Trends Microbiol. 2011;19(10):492–500. pmid:21831640
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2017 Hijikata et al. This is an open access article distributed under the terms of the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Background
Spacer oligonucleotide typing (spoligotyping), a widely used, classical genotyping method for Mycobacterium tuberculosis complex (MTBC), is a PCR-based dot-blot hybridization technique to detect the genetic diversity of the direct repeat (DR) region. Of the seven major MTBC lineages in the world, lineage 1 (Indo-Oceanic) mostly corresponds to the East African–Indian (EAI) spoligotype family in East Africa and Southeast Asia.
Objectives
We investigated the genomic features of Vietnamese lineage 1 strains, comparing spoligotype patterns using whole-genome sequencing (WGS) data.
Methods
M. tuberculosis strains isolated in Da Nang, Vietnam were subjected to conventional spoligotyping, followed by WGS analysis using a high-throughput sequencer. Vietnamese lineage 1 strains were further analyzed with other lineage 1 strains obtained from a public database.
Results
Indicating a major spoligotype in Da Nang, 86 (46.2%) of the 186 isolates belonged to the EAI family or lineage 1. Although typical EAI4-VNM strains are characterized by the deletion of spacers 26 and 27, 65 (75.6%) showed ambiguous signals on spacer 26. De novo assembly of the entire DR region and in silico spoligotyping analysis suggested the absence of spacer 26, and direct sequencing revealed that the 17th spacer sequence not used for conventional typing, was cross-hybridized to the spacer 26 probe. Vietnamese EAI4-VNM, other EAI-like strains, and those showing a non-EAI pattern lacking many spacers formed a monophyletic group separate from other EAI families in the world.
Conclusion
Information about the alignment of spacers in the entire DR region obtained from WGS data provides a clue for the determination of experimentally ambiguous spoligo patterns. WGS data also helped to analyze the hidden relationships between apparently distinct spoligo patterns.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer