OUP user menu

Conservation of Dcm-mediated cytosine DNA methylation in Escherichia coli

Kevin T. Militello, Robert D. Simon, Mehr Qureshi, Robert Maines, Michelle L. Van Horne, Stacy M. Hennick, Sangeeta K. Jayakar, Sarah Pounder
DOI: http://dx.doi.org/10.1111/j.1574-6968.2011.02482.x 78-85 First published online: 1 March 2012


In Escherichia coli, cytosine DNA methylation is catalyzed by the DNA cytosine methyltransferase (Dcm) protein and occurs at the second cytosine in the sequence 5′CCWGG3′. Although the presence of cytosine DNA methylation was reported over 35 years ago, the biological role of 5-methylcytosine in E. coli remains unclear. To gain insight into the role of cytosine DNA methylation in E. coli, we (1) screened the 72 strains of the ECOR collection and 90 recently isolated environmental samples for the presence of the full-length dcm gene using the polymerase chain reaction; (2) examined the same strains for the presence of 5-methylcytosine at 5′CCWGG3′ sites using a restriction enzyme isoschizomer digestion assay; and (3) quantified the levels of 5-methyl-2′-deoxycytidine in selected strains using liquid chromatography tandem mass spectrometry. Dcm-mediated cytosine DNA methylation is conserved in all 162 strains examined, and the level of 5-methylcytosine ranges from 0.86% to 1.30% of the cytosines. We also demonstrate that Dcm reduces the expression of ribosomal protein genes during stationary phase, and this may explain the highly conserved nature of this DNA modification pathway.

  • Escherichia coli
  • 5-methylcytosine
  • dcm
  • LC MS/MS


DNA bases are modified by postreplicative methylation by enzymes termed DNA methyltransferases. In prokaryotes, the most common modified DNA bases are 6-methyladenine and 5-methylcytosine (5mC). The most recognized role of modified DNA bases is in restriction-modification (R-M) systems (Ishikawa et al., 2010). In each R-M system, there is a restriction endonuclease that cleaves foreign DNA and a site-specific DNA methyltransferase that prevents cleavage of host DNA, and in some cases controls expression of the R-M system (O'Driscoll et al., 2005). However, some DNA methyltransferases are not found in conjunction with a cognate restriction enzyme and are termed solitary DNA methyltransferases. In addition to DNA adenine methyltransferase (Dam), Escherichia coli possesses another solitary DNA methyltransferase termed Dcm for DNA cytosine methyltransferase (Marinus & Lobner-Olesen, 2009).

The presence of Dcm was discovered in 1973 by Marinus & Morris. The dcm gene of E. coli K-12 contains 1419 base pairs, and the predicted protein is 472 amino acids (Bhagwat et al., 1986; Hanck et al., 1989). The protein contains ten conserved motifs and a catalytic cysteine residue that is found in all cytosine-5 DNA methyltransferases (Posfai et al., 1989). The Dcm protein methylates the internal C in the sequence 5′CCWGG3′ where W = A/T (Palmer & Marinus, 1994). 5mC is occasionally spontaneously deaminated in an existing C:G base pair, and a T:G mismatch is formed. The dcm gene is in an operon with the very short patch repair (vsr) gene and is controlled by the same promoter. The vsr gene lies downstream of the dcm gene and the genes overlap by seven codons (Sohail et al., 1990). The Vsr protein is an endonuclease that is necessary to remove the new thymine residue (Hennecke et al., 1991) and thus compensates for the mutagenic potential of 5mC. There is evidence that Dcm itself is required for robust very short patch repair of mismatched bacteriophage heteroplexes (Jones et al., 1987; Lieb, 1987; Zell & Fritz, 1987), but this relationship has not been observed in all reports (Sohail et al., 1990). Nonetheless, the sequence 5′CCWGG3′ is still a mutational hot-spot sequence, because not all mismatches are repaired (Lieb & Bhagwat, 1996).

The biological role of the dcm gene and 5′CCWGG3′ cytosine DNA methylation in E. coli remains unclear. The dcm gene is not essential as mutant, deletion, and knockout strains are viable (Marinus & Morris, 1973; Baba et al., 2008). Interestingly, the dcm gene and cytosine DNA methylation are absent from E. coli B (Doskocil & Sormova, 1965; Fujimoto et al., 1965), a host strain used extensively to study bacteriophages T1–T7 (Daegelen et al., 2009). Genome sequencing of E. coli B (REL606) shows that when compared to E. coli strain K-12 MG1665, it has an IS1-associated 41-kbp deletion from uvrY to hchA that comprises ~0.9% of the genome including the dcm gene and 21 flagellar genes (Studier et al., 2009). The loss of the dcm operon in E. coli B may have been coupled to the loss of the nearby flagellar and chemotaxis genes, as strains that lack flagellar and chemotaxis genes have an advantage during laboratory evolution experiments (Asakura et al., 2011). Nonetheless, several pieces of evidence suggest that Dcm has a role in modulating the activity of the EcoRII R-M system in K-12 strains, which also targets 5′CCWGG3′ sequences. Experiments by Takahashi et al. (2002) indicate that loss of a plasmid containing the EcoRII methyltransferase and restriction enzyme genes is higher in dcm+ cells compared to dcm mutant cells, indicating that Dcm protects the genome against attack by this R-M system. Furthermore, Dcm protects the cell from postsegregational killing due to loss of the EcoRII R-M system (Ohno et al., 2008). Also, dcm partially protects DNA from cleavage during entry into a new host containing the EcoRII restriction enzyme (Hattman et al., 1973). However, it is unclear whether there are roles for Dcm beyond the role in the EcoRII R-M system. Therefore, we determined whether the dcm gene and 5mC were present in E. coli clinical strains and strains isolated from water and animal feces (environmental strains). We also tested the hypothesis that dcm influences the process of transcription, as cytosine-5 DNA methyltransferases often have this property.

Materials and methods

Bacterial strains

The E. coli Reference Collection (ECOR), a set of E. coli strains isolated from a variety of hosts and geographical locations (Ochman & Selander, 1984), was obtained from the ‘Shiga-toxin producing E. coli’ Center (STEC) at Michigan State University. Environmental strains of E. coli were isolated from seven different watersheds of Conesus Lake in New York over a 5-year period (Makarewicz, 2009; Simon & Makarewicz, 2009). Fifty-three strains were collected in the streams draining the watersheds, as well as at the mouth of the stream during all seasons of the year. Twenty-three independent strains were also collected from the Conesus Lake near-shore, focusing on those associated with the green alga Cladophora (Whitman et al., 2003; Byappanahalli et al., 2007). Escherichia coli was isolated on m-ColiBlue24 plates (Millipore®; Grant, 1997), and standard microbial testing was used to confirm the identification. All environmental isolates were positive for growth on lactose with gas formation, glucuronidase activity and the production of indole, while they were negative for growth on citrate and urea (APHA, 1999). Additional bacterial strains used in this study are listed in Table .

View this table:

Additional bacterial strains and plasmids used in this study

JM109Wild-type dcm alleleNew England BioLabs
ER2925dcm-6New England BioLabs
GM204Δ(supD-dcm-flaA)Martin Marinus
BW25113Wild-type dcm alleleBaba et al. (2008)
JW1944-2Δdcm-735::kanBaba et al. (2008)
CP9ExPECRusso et al. (1993)
E234E69EPECLevine et al. (1978)
Popeye-1EHEC, O157:H7Crane et al. (2011)
pDcm-9dcm vsrSohail et al. (1990)
pDcm-21dcm+ vsrSohail et al. (1990)
  • Extraintestinal pathogenic E. coli.

  • Enteropathogenic E. coli.

  • Enterohemorrhagic E. coli.

  • Both plasmids were originally constructed by enzyme-mediated deletion of pDcm-4 which contains the entire dcm-vsr operon.

Isolation and genotype analysis of genomic DNA

Bacteria were propagated in Luria-Bertani broth overnight at 37 °C with shaking at 250 r.p.m. Genomic DNA was isolated from 2-mL cultures of stationary phase cells using a DNeasy Blood and Tissue Kit (Qiagen), and RNase A was added at 200 μg mL−1 during lysis. Typical DNA preparations had A260 nm/A280 nm readings of 1.8–2.1 and were 80–120 ng DNA μL−1. A triplex PCR-based method for chuA, yjaA, and TSPE4.C2 was used to assign environmental isolates of E. coli to phylogenetic groups A, B1, B2, and D (Table ; Clermont et al., 2000). Templates were either isolated genomic DNA or bacteria extracted in boiling TE buffer. Increasing Mg2+ to 3 mM in the PCR generated stronger products compared to 1.5 mM Mg2+.

View this table:

Characteristics of environmental and ECOR strains used in this study

E. coli phylogenomic group

PCR for the Dcm gene

PCR was carried out in 30-μL reactions containing 100 ng of genomic DNA or DNA from bacteria boiled in TE buffer, 0.3 μm of forward primer, 0.15 μM of reverse primer I, 0.15 μM of reverse primer II, 0.2 mM dNTPs, 1.5 mM MgCl2, and 0.75 units of TAQ DNA polymerase (Promega). Primer sequences are listed in Supporting Information, Fig. S1. The reaction conditions were one cycle of 95 °C for 2 min, 32 cycles of 95 °C for 1 min, 55 °C for 1 min, 72 °C for 1.5 min, and a final cycle of 72 °C for 10 min. PCR products were analyzed by agarose gel electrophoresis and ethidium bromide staining.

Restriction enzyme isoschizomer assay

The restriction enzymes BstNI and PspGI were purchased from New England BioLabs. Reactions were carried out using 20 μL volumes that contained 1 μg of genomic DNA and 0.3–0.5 units of enzyme. The DNAs were digested at 60 °C for 2 h, and the products were analyzed by gel electrophoresis on 1% agarose gels and ethidium bromide staining. PspG1 was used at 60 °C even though the optimal working temperature for the enzyme is 75 °C (New England Biolabs) because the DNA degraded at 75 °C (data not shown). Every experiment included DNA isolated from a dcm+ strain as a positive control (JM109 or BW25113) and DNA isolated from a dcm strain as a negative control (ER2925, JW1944-2, or unmethylated phage lambda DNA).

Liquid chromatography tandem mass spectrometry

The levels of 5-methyl-2′-deoxcytidine (5mdC) in DNA samples were determined by liquid chromatography and tandem mass spectrometry (LC MS/MS) at the Roswell Park Pharmacodynamics/Pharmacokinetics Resource (Buffalo, NY) as previously described (Song et al., 2005; Militello et al., 2008). Briefly, isolated E. coli DNA (1 μg) from overnight cultures was digested to nucleosides using sequential treatment with S1 nuclease, snake venom phosphodiesterase, and alkaline phosphatase before separation on a dC18 column. Tandem mass spectrometry was used to detect the molecular ion (242.1 atomic mass units) and product ion (126.3 atomic mass units) for 5mdC. Simultaneously, the molecular ion and product ion for 2′-deoxyguanosine were detected. The ratios of 5mdC to 2′-deoxyguanosine in the experimental samples were compared to a standard curve of the same two nucleosides, to generate percent 5mdC. At least three distinct biological samples (separate cultures) were used for each strain, except for the commercial E. coli B preparation (four technical replicates).

Quantitative PCR

Overnight E. coli cultures were diluted 1 : 100 into fresh LB medium and grown at 37 °C until early logarithmic phase (OD600 nm of ~0.4) and early stationary phase (OD600 nm of ~3.0). Total RNA was isolated using the MasterPure RNA Isolation kit (Epicentre). cDNA was made from 2 to 3 μg of RNA in presence of random primers. qPCR was performed on a Stratagene Mx3000P machine with Stratagene Brilliant Sybr Green qPCR master mix. Primer sequences are found in Fig. S1. Reactions were performed in triplicate and at least two different RNA samples were tested (biological replicates).

Results and discussion

PCR analysis for the presence of dcm loci

A PCR assay was developed to detect the presence of the dcm gene in E. coli. Forty-one E. coli and Shigella full-length dcm DNA sequences were obtained from NCBI (http://www.ncbi.nlm.nih.gov/genomes/lproks.cgi). The sequences were aligned using ClustalX 2.0.10 (www.clustal.org/) and used to construct a N-J tree (Fig. S2). To develop a set of PCR primers for the full-length gene (1419 basepairs), the sequences at the beginning and the end of the alignment were examined. The first 88 nucleotides of all gene sequences were identical, and one forward primer was chosen. While there are three possible reverse primers, reverse primer III is present in only one sequence, and we therefore used a mixture of reverse primers I and II for all experiments. Initial PCRs were optimized using E. coli JM109 DNA (dcm+) as a positive control, and the reactions routinely generated a product of the expected size of 1419 basepairs (Fig. a). The assay was specific, as the dcm PCR product was not observed in reactions without DNA template or with DNA from E. coli GM204, a strain with a deletion of the dcm operon. To confirm that the PCR product truly represented the dcm gene, the PCR DNA from E. coli JM109 was purified and analyzed by DNA sequencing (data not shown).

Figure 1

(a) Detection of the dcm gene via PCR. Escherichia coli genomic DNA from different strains was used as a template for PCR using one forward dcm primer and two reverse dcm primers as described in the Materials and methods. PCR products were analyzed by agarose gel electrophoresis and ethidium bromide straining. The negative sign (−) represents a no DNA control. Escherichia coli GM204 DNA (GM204) contains a deletion of the dcm operon. Escherichia coli JM109 DNA (JM109) contains a wild-type dcm allele. EC068 and EC064 are DNAs from E. coli environmental isolates. (b) Detection of 5-methylcytosine in 5′CCWGG3′ sequences using restriction enzyme isoschizomer pairs. DNA from E. coli ER2925 (dcm-6), JM109 (dcm+), and four environmental strains (EC007-EC010) were left undigested (−), digested with BstNI (B), or digested with PspGI (P). Reactions were analyzed by agarose gel electrophoresis and ethidium bromide staining.

Subsequently, we used the PCR assay to screen the E. coli strains from multiple sources. The ECOR collection was initially screened, as it represents a diverse set of strains with respect to strain characteristics, phylogenetic groups, serotypes, and pathogenicity. Furthermore, new E. coli environmental samples were isolated as described in the materials and methods from a relatively small geographical region (Western New York). These strains included representatives of the four main phylogenetic groups of A, B1, B2, and D (Clermont et al., 2000). All 162 DNAs tested generated an appropriate size PCR product, indicating the presence of the dcm gene or a highly related dcm homolog. The presence of the dcm gene was independent of the source, pathogenicity, or phylogenetic group of the strain (Table S1).

Detection of methylated 5′CCWGG3’ sequences

While all strains tested contained a full-length dcm gene, the PCR assay alone does not prove that each strain contains a functional cytosine DNA methylation and 5mC. Our PCR assay could not rule out dcm mutations that inactivate the enzyme, mutations in regulatory regions that inhibit transcription and translation, or the absence of other molecules required for cytosine DNA methylation. Therefore, a restriction enzyme isoschizomer assay was used to test for methylation of 5′CCWGG3′ sequences. Genomic DNAs were separately digested with the restriction enzymes BstNI and PspGI. Both enzymes cleave the sequence 5′CCWGG3′, but PspGI is blocked by Dcm-mediated cytosine methylation of the second cytosine. The assay was originally optimized with JM109 DNA (dcm+) and ER2925 DNA (dcm). JM109 DNA was resistant to digestion with PspGI, which is consistent with DNA methylation of 5′CCWGG3′ sequences (b). When ER2925 DNA was cut with PspGI, fragments that were heterogeneous in size were observed via gel electrophoresis, indicating ER2925 DNA is sensitive to this enzyme and lacks methylation at 5′CCWGG3′ sites. Titration of mixtures of methylated and unmethylated DNA indicated that the isoschizomer assay could detect partial cytosine DNA methylation down to 10%, but the assay is largely qualitative. DNA samples from all 162 ECOR and environmental strains were resistant to digestion by PspGI. This demonstrates that every strain of E. coli examined in this study has a dcm gene and 5mC in the sequence 5′CCWGG3′. Our data are in contrast to data on the solitary cytosine DNA methyltransferase M.Vch from Vibro cholera, as it was absent in two of 25 strains tested (Banerjee & Chowdhury, 2006). Our experiments cannot determine whether all 5′CCWGG3′ sites are methylated; however, there are reports suggesting the presence of rare, unmethylated 5′CCWGG3′ sites (Ringquist & Smith, 1992; Bormann Chung et al., 2010). Nonetheless, each strain analyzed in our study has a functional cytosine DNA methylation pathway.

Detection of 5mdC in E. coli DNA using LC MS/MS

We were interested in determining the actual levels of 5mC in different strains and used LC MS/MS to detect 5-methyl-2′-deoxycytidine (5mdC) levels in complete DNA digests. The dcm+ laboratory K-12 strains have ~1% 5mdC; JM109 has 0.92% (± 0.02) 5mdC; and BW25113 has 1.02% (± 0.09) 5mdC. The data are consistent with previous results using different methodologies that indicate ~1% of the cytosines of E. coli K-12 strains are methylated (Vanyushin et al., 1968). The level of 5mdC was not above the limit of detection (0.01%) in the dcm knockout strain JW1944-2, indicating that Dcm is the major or only enzyme that produces 5mC in laboratory E. coli strains. We also tested a commercial preparation of E. coli B DNA (Sigma) and did not detect 5mdC. We next tested nine ECOR and environmental isolates in this assay, representing pathogenic and nonpathogenic strains. In each case, 5mdC was detected, indicating that these strains do indeed contain 5mC. The levels of 5mdC ranged from 0.86% to 1.30% of the total cytosine in the DNA (Table ). anova analysis of all strains with 5mdC suggested that there is a statistically significant difference (P < 0.05) between the amounts of 5mdC in all strains tested (P = 0.013). The small differences in levels of 5mdC could be due to small differences in GC content between strains, the lack of methylation of some 5′CCWGG3′ sites in some strains, the presence of 5mC at non-5′CCWGG3′ sites, and/or the presence of another DNA methyltransferase in some strains (e.g. R-M systems).

View this table:

5-methyl-2′-deoxycytidine levels in Escherichia coli strains as determined by LC MS/MS

StrainDescription% 5mdCSD
BW25113K-12 laboratory strain1.020.09
JM109K-12 laboratory strain0.920.02
JW1944-2dcm knockout strainUnder LOD
E. coli BB laboratory strainUnder LOD
EC6001Environmental isolate1.100.05
EC6002Environmental isolate1.030.05
EC6006Environmental isolate1.080.15
EC6026Environmental isolate1.030.12
ECOR11Human urinary tract infection0.920.18
ECOR27Giraffe isolate0.860.03
ECOR47Sheep isolate0.920.11
ECOR52Orangutan isolate1.300.02
Popeye-1Human O157:H7 strain1.130.27
  • The limit of detection (LOD) is 0.01% 5mdC.

Potential roles for Dcm and cytosine DNA methylation

Our data indicate that the dcm gene and cytosine DNA methylation at 5′CCWGG3′ sequences are highly conserved in E. coli, which suggests that cytosine DNA methylation has an important role in E. coli biology. There are reports implicating 5mC in a role in phage lambda recombination, Tn10 insertion, and R-M system maintenance (Korba & Hays, 1982; Lee et al., 1987; Takahashi et al., 2002). Yet, there is no consensus model for dcm function and there is little known regarding the relationship between dcm and E. coli biological processes beyond protection from the EcoRII restriction enzyme. Methylated DNA bases are associated with transcriptional silencing in eukaryotes (Feng et al., 2010). There are reports that some E. coli genes contain Dcm recognition sites within their promoters (Gomez-Eichelmann & Ramirez-Santos, 1993; Palmer & Marinus, 1994). We have extended this observation by analyzing a large number of the promoter sites (1864) in the complete genome of E. coli K-12 MG1655 (Gama-Castro et al., 2011). Promoter sites associated with Sigma 24, 28, 32, 38, 54, and 70 all have examples of 5′CCWGG3′ sequences (a), suggesting that DNA methylation could influence transcription initiation. One hundred and ninety promoters have one 5′CCWGG3′ site, 17 promoters have two 5′CCWGG3′ sites, and two promoters have three 5′CCWGG3′ sites. The distribution of all 5′CCWGG3′ sites in the promoter region relative to the transcription start site is given in b. On the basis of the analysis of the variance to mean ratio (1.53), the distribution of 5′CCWGG3′ locations in promoters is clumped (neither random nor uniform) (P = 0.0018). As expected, there are fewer 5′CCWGG3′ sites associated with the −35 and −10 regions as these regions contain the conserved sequences for sigma factor binding. An analysis of clusters of orthologous genes (COGs) indicates that 5′CCWGG3′ sites are abundant in numerous COG categories and the highest percentages are in transcription, amino acid transport and metabolism, and translation (Table S2).

Figure 2

The number and location of 5′CCWGG3′ sites in Escherichia coli K-12 MG1655 promoters. (a) 5′CCWGG3′ abundance in E. coli promoters. Promoter sequences available in the Regulon database (http://regulondb.ccg.unam.mx/) were downloaded and queried for the number of 5′CCWGG3′ sites. # represents the number of promoters in each category and total is the total number of promoters with 5′CCWGG3′ sequences. (b) Histogram of the frequency of 5′CCWGG3′ sites with respect to the transcription start site. The numbers represent the distance to the transcription start site where the number 1 is the transcription start site. The position refers to the first C in the sequence 5′CCWGG3′.

To determine whether Dcm plays a role in transcription, RNA levels in wild-type bacteria with a plasmid with an inactive dcm truncation (BW25113/pDcm-9), dcm knockout bacteria with a plasmid with an inactive dcm truncation (JW1944-2/pDcm-9), and dcm knockout bacteria with a plasmid containing a functional dcm gene (JW1944-2/pDcm-21) were compared using qPCR. We focused on ribosomal protein gene expression, as a previous report indicated that ribosomal protein S16 gene contains a large number of 5′CCWGG3′ sites (Gomez-Eichelmann & Ramirez-Santos, 1993), and many genes in the translation COG have 5′CCWGG3′ sites in their promoters (Table S2). We started with the rplC and rpsJ genes; these genes code for large and small ribosomal subunits and are part of an operon controlled by the rpsJp promoter. Indeed, there are three 5′CCWGG3′ sites within the rplC gene, one site within the rpsJ gene, and one site 364 basepairs upstream of the rpsJ start codon. At early logarithmic phase, there was relatively no change in rplC and rpsJ RNA levels when comparing the three strains (). However, at early stationary phase, there was a marked increase in both rplC and rpsJ RNA levels in JW1944-2/pDcm-9 cells, and the RNA levels were reduced in the complemented JW1944-2/pDcm-21 cells. These data indicate that Dcm is necessary for repression of these genes and thus potentially influences stationary phase fitness or viability. Expression of the rplC and rpsJ genes is increased in the presence of 5-azacytidine, an inhibitor of cytosine DNA methylation (M.L. VanHorne and K.T. Militello, unpublished data). Thus, we hypothesize that depression of ribosomal protein gene expression is due directly to the loss of DNA methylation. These data are important as they indicate that Dcm has a role in the cell beyond protection from restriction enzymes that cleave the same sequence.

Figure 3

Expression of ribosomal protein genes in the absence and presence of dcm. Escherichia coli wild-type bacteria (BW25113) with a plasmid containing an inactive dcm truncation (WT/pDcm-9), dcm knockout bacteria (JW1944-2) containing an inactive dcm truncation (KO-2/pDcm-9), and dcm knockout bacteria (JW1944-2) containing a plasmid with a functional dcm gene (KO/pDcm-21) were grown to early logarithmic phase and early stationary phase. Total RNA was isolated and converted to cDNA. The levels of rplC and rpsJ were measured by qPCR and normalized to the levels of malate dehydrogenase (mdh) RNA. The WT/pDcm-9 samples were set to a value of one. Error bars represent one standard deviation.

Bacterial ribosome number is correlated with growth rate. In addition to translational control of ribosomal protein gene expression during growth, there is new evidence for widespread transcriptional control of ribosomal protein genes (Lemke et al., 2011). Dcm may participate in reducing or fine-tuning ribosome biosynthesis during stationary phase via methylation-dependent reduction in transcription of ribosomal protein genes during stationary phase. Methylated 5′CCWGG3′ sites in promoters or genes bodies could represent the binding sites for repressors of transcription initiation or elongation. Alternatively, activators may exist that are not able to bind to 5′CCWGG3′sites when they are methylated. In both models, the absence of methylation at these sites will be correlated with increased gene expression. Identifying gene regulatory molecules that are affected by methylation of 5′CCWGG3′ sites will be a high priority. The link between DNA methylation and ribosome biosynthesis could be at the heart of the interaction between a host and a parasitic R-M system. As a large number of DNA methyltransferases found in REBASE modify 5′CCWGG3′sites, it is possible that R-M systems influence expression of ribosomal protein genes and/or other genes to promote their maintenance. The effect of Dcm and other DNA methyltransferases on the entire E. coli transcriptome is currently under investigation.


We thank Dr John Crane (SUNY Buffalo) and Dr Martin Marinus (University of Massachusetts Medical School) for providing E. coli strains. We thank Dr Ashok Bhagwat (Wayne State University) for providing the pDcm-9 and pDcm-21 plasmids. We thank Ping Wang and Joshua Prey at the Roswell Park Cancer Institute for the LC MS/MS analysis. Support for this work was provided by the Geneseo Foundation and NIH Grant R15AI074035-01 (K.T.M).

Authors’ contributions

K.T.M. and R.D.S. contributed equally to the work.

Supporting Information

Fig. S1. PCR primers used in this study.

Fig. S2. Evolutionary relationship of the dcm genes from Escherichia coli and Shigella spp. based on the full length DNA sequence of the gene and the PCR primers that amplify the gene from the various strains.

Table S1. Characteristics of environmental and ECOR animal derived strains used in this study.

Table S2. The presence of 5′CCWGG3′ sites in promoter regions of genes from different COGs.


  • Editor: Richard Calendar


View Abstract