Summary: | Cytosine residues in mammalian DNA occur in five forms, cytosine (C), 5-methylcytosine (5mC), 5-hydroxymethylcytosine (5hmC), 5-formylcytosine (5fC) and 5-carboxylcytosine (5caC). The ten-eleven translocation (Tet) dioxygenases convert 5mC to 5hmC, 5fC and 5caC in three consecutive, Fe(II)- and α-ketoglutarate-dependent oxidation reactions1–4. The Tet family of dioxygenases is widely distributed across the tree of life5, including the heterolobosean amoeboflagellate Naegleria gruberi. The genome of Naegleria6 encodes homologs of mammalian DNA methyltransferase and Tet proteins7. Here we study biochemically and structurally one of the Naegleria Tet-like proteins (NgTet1), which shares significant sequence conservation (approximately 14% identity or 39% similarity) with mammalian Tet1. Like mammalian Tet proteins, NgTet1 acts on 5mC and generates 5hmC, 5fC and 5caC. The crystal structure of NgTet1 complexed with DNA containing a 5mCpG site revealed that NgTet1 uses a base-flipping mechanism to access 5mC. The DNA is contacted from the minor groove and bent towards the major groove. The flipped 5mC is positioned in the active site pocket with planar stacking contacts, Watson–Crick polar hydrogen bonds and van der Waals interactions specific for 5mC. The sequence conservation between NgTet1 and mammalian Tet1, including residues involved in structural integrity and functional significance, suggests structural conservation across phyla.
|