Insights into genomic variations between Mycobacterium tuberculosis compartmentalized in the lungs and blood of HIV-infected individuals in Uganda
Abstract
Introduction: Mycobacterium tuberculosis (MTB), particularly clinical strains present a
genomic diversity that ranges from several single nucleotide polymorphisms (SNPs) to insertions
and deletions (INDELS). This study assessed the genetic variations between sequences of
isolates belonging to the MTB Uganda family, a sub-lineage of the MTB lineage 4, from the
lungs and blood of TB-HIV positive individuals in Uganda with the hypothesis that sequences
of isolates from the blood have unique SNPs and INDELs that characterize MTB survival.
Methods: A case-control study was conducted in-silico. Twenty-four MTB blood and lung
sequences were examined in this study. The sequences were aligned against the H37Rv (RefSeq:
NC_000962.3) reference genome using BWA-MEM. The alignment quality was checked using
IGV. SAMtools was used to sort and index the alignment. FreeBayes was used to identify SNPs
and INDELs. The SNPs and INDELs were annotated using SnpEff. The 'cat' and 'grep' commands
were used to compare and analyze variants between the MTB blood and lung sequences, and then
the functions of the genes in which the variants unique to the blood sequences occurred were
identified.
Results: Comparative analysis of the genetic variation of the samples showed that MTB blood
isolates had 11 virulence genes with distinctive non-synonymous SNPs, which were absent in
MTB lung isolates. The majority of the INDELs were found in non-virulence genes, with the
remainder found in both blood and lung isolate sequences.
Discussion: Overall, the 11 virulence genes identified in this study that had distinctive non synonymous SNPs are implicated in pathways that increase CFUs in the lungs and organs, as well
as lower host survival, increase host tissue damage, and enhance tissue pathology, allowing for
human host persistence. The true role of all SNPs mentioned in this study is difficult to determine.
ix
This supports the theory that MTB-blood isolates have SNPs that let them survive longer than
MTB-pulmonary isolates, allowing these strains to stay in the blood of their hosts.
Conclusions: MTB blood sequences have distinctive SNPs, which could explain the capacity of
MTB blood isolates to persist outside of the lungs (in blood) of HIV-infected individuals