Yeast mtDNA is compacted into nucleoprotein structures called mitochondrial nucleoids (mt-nucleoids). The principal mediators of nucleoid formation are mitochondrial high-mobility group (HMG)-box containing (mtHMG) proteins. Although these proteins are some of the fastest evolving components of mt-nucleoids, it is not known whether the divergence of mtHMG proteins on the level of their amino acid sequences is accompanied by diversification of their biochemical properties. In the present study we performed a comparative biochemical analysis of yeast mtHMG proteins from Saccharomyces cerevisiae (ScAbf2p), Yarrowia lipolytica (YlMhb1p) and Candida parapsilosis (CpGcf1p). We found that all three proteins exhibit relatively weak binding to intact dsDNA. In fact, ScAbf2p and YlMhb1p bind quantitatively to this substrate only at very high protein to DNA ratios and CpGcf1p shows only negligible binding to dsDNA. In contrast, the proteins exhibit much higher preference for recombination intermediates such as Holliday junctions (HJ) and replication forks (RF). Therefore, we hypothesize that the roles of the yeast mtHMG proteins in maintenance and compaction of mtDNA in vivo are in large part mediated by their binding to recombination/replication intermediates. We also speculate that the distinct biochemical properties of CpGcf1p may represent one of the prerequisites for frequent evolutionary tinkering with the form of the mitochondrial genome in the CTG-clade of hemiascomycetous yeast species.
The compaction of DNA into chromosomes enables not only its accommodation into the confines of the cell, but also provides a means for spatial regulation of gene expression and protection from DNA damage. Given its importance, it is not surprising that the compaction of DNA into nucleosomes is mediated by a complex of highly conserved proteins called histones . Their extremely high level of conservation is exemplified by the fact that only eight of 102 amino acids differ between the H4 histones of such evolutionary distant species as humans and Saccharomyces cerevisiae . A deficiency in a histone-encoding gene, or even an imbalance in their expression is often fatal, or can accelerate aging [3,4], further underlining the importance of these proteins in mediating principal functions in DNA maintenance and gene expression .
The necessity for DNA compaction is not limited to the eukaryotic nucleus, as it also extends to bacterial cells as well as DNA-containing organelles, namely chloroplasts and mitochondria. In the latter, the DNA is compacted into nucleoprotein structures called mitochondrial nucleoids (mt-nucleoids) [6–11]. This compaction is in part mediated by proteins containing two DNA-binding domains known as an high-mobility group (HMG)-box . The best-characterized members of this group of proteins (mitochondrial HMG-box containing proteins; mtHMG proteins) are ScAbf2p [13–15] and mammalian mitochondrial transcription factor A (TFAM) [16–20]. Apparently, during the evolution of eukaryotic cells, the host genome-encoded mtHMG proteins replaced the polypeptides that served as the major nucleoid components in the original α-proteobacterial endosymbiont . Yet, although it seems that the presence of an HMG-box is a universal feature of mitochondrial compaction proteins in all eukaryotes, their overall amino acid sequences exhibit very low similarity. In fact, mtHMG proteins seem to represent one of the most divergent groups of mitochondrial proteins . Thus, in contrast with histones, it is very difficult to identify them by simple bioinformatic tools and most of the mtHMG proteins were identified using proteomic analyses of purified mt-nucleoids [22–25].
The dissimilarities between mtHMG proteins can be explained either by the fast evolutionary divergence of the common ancestor or by acquisition of new features. Perhaps the heterogeneity at the level of amino acid sequences in the mtHMG proteins corresponds to different biochemical roles in mtDNA maintenance and segregation, or possibly differences in mtDNA base composition and topology. To explore this question we have performed a comparative biochemical analysis of mtHMG proteins from distantly related species. For this purpose, yeast species from the subphylum Saccharomycotina represent an ideal group of organisms. They exhibit a high degree of biodiversity , their mtDNAs differ in size, base composition and topology , and mtHMG proteins were identified in a number of species separated by hundreds of millions of years of evolution [15,22–24,28–30]. However, the only biochemically characterized yeast mtHMG protein is Abf2 of S. cerevisiae [13–15,31–35]. It was shown that ScAbf2p prefers negatively supercoiled DNA over circular or linear DNA and that, in cooperation with a DNA topoisomerase, it introduces negative supercoils into a topologically relaxed, covalently closed circular dsDNA molecule [15,32]. Its binding to mtDNA is nonrandom, which may be accomplished by the phased distribution of short stretches of poly(dA) indicating its role in genome organization and site-specific regulation of transcription or DNA replication . Optical trapping of single DNA molecules extended by flow and visualized by fluorescence microscopy allowed determination of the binding constant of Abf2 (Kb=2.57±0.74×107 M−1)  (but see also ), and relative small forces (<0.6 pN) stabilizing the condensed DNA–protein interactions . AFM revealed that at high concentrations of Abf2, the DNA is compacted into relatively loosely packaged 190 nm structures [31,33] indicating that Abf2 is indeed the bona fide mtDNA-packaging protein. This conclusion was also reached based on the results of an in organello ChIP-on-chip assay demonstrating that ScAbf2p binds to most of the mitochondrial genome with a preference for GC-rich gene sequences .
Although these studies provided important information about the DNA-binding properties of ScAbf2, they also left several important questions unanswered. First, the in vitro studies were mostly performed on intact dsDNA substrates, whereas yeast mitochondria contain topologically different forms of DNA generated as a result of various types of transactions including replication, recombination and repair [37–40]. It is known that HMG-box containing proteins recognize some of these structures with high affinity . However, information about the binding of yeast mtHMG proteins to DNA substrates such as Holliday junctions (HJ) or replication forks (RF) is lacking. Second, the compaction of DNA is induced at relatively high ScAbf2p to DNA-binding site ratios (20 to 1) [31,33], whereas in the organelle there is one molecule of ScAbf2p per 27 bp of DNA (size of the binding site)  (see also below). And third, basically all biochemical data on mtHMG proteins are derived from the studies of ScAbf2p. As indicated above, the mtHMG proteins represent the fastest evolving component of mt-nucleoids  and it is currently unknown if this divergence in amino acid sequence translates into differences in biochemical properties.
To address these questions we have performed a comparative analysis of mtHMG proteins from three distinct yeast species (Figure 1). Namely, we selected ScAbf2p from S. cerevisiae as the best-characterized protein thus allowing comparison of our results with those published by other authors. YlMhb1p from Yarrowia lipolytica  was chosen because this species belongs to basal lineages of Saccharomycotina and phylogenetically is very distant to S. cerevisiae. Therefore, we could compare biochemical properties of two proteins with very divergent amino acid sequences, whose only common feature is the presence of two HMG-boxes. Finally, CpGcf1p is the mtHMG protein from Candida parapsilosis, the yeast species with a linear mitochondrial genome [22,41]. A detailed characterization of this protein enabled us to address the question of how biochemical properties of mtHMG proteins may be associated with the evolutionary emergence of the linear mitochondrial genome forms, frequently occurring in species from the CTG-clade of Saccharomycotina encompassing C. parapsilosis.
Yeast mtHMG proteins
MATERIALS AND METHODS
Microbial strains and growth conditions
Escherichia coli DH5α (F−, φ80dlacZΔM15, Δ(lacZYA-argF) U169, deoR, recA1, endA1, hsdR17 (rk−, mk+), λ, thi-1, gyrA96, relA1, glnV44, nupG) (Life Technologies) was used for the amplification of plasmid constructs. E. coli BL21 Star™(DE3) (F−, ompt, hsdSB, rB−mB−, gal, dcm, rne131) (Life Technologies) was used for production of recombinant proteins (ScAbf2noMP, YlMhb1noMP, CpGcf1noMP). Bacterial cultures were grown in LB medium (1% (w/v) bacto peptone (Difco), 0.5% (w/v) yeast extract (Difco), 1% (w/v) NaCl, pH 7.5) containing 100 μg/ml ampicillin.
Enzymatic manipulations with DNA, cloning procedures and DNA labelling were performed according to the instructions provided by the vendors. The oligonucleotides (Table 1) were synthesized by Microsynth. The PCRs were performed in 10–50 μl volumes using DreamTaq DNA polymerase (Life Technologies) or Phusion Hot Start II High fidelity DNA polymerase (Life Technologies) and contained all four dNTPs (final concentration 200 μM each), the corresponding primers (final concentra-tion 1 μM), and either 100 ng of genomic DNA or mtDNA or 10 ng of plasmid DNA. The PCR fragments were purified from agarose gels using a QIAquick Gel Extraction kit (Qiagen) or Zymoclean Gel DNA recovery kit (Zymo Research).
Construction of plasmid vectors
For the expression of recombinant versions of mtHMG proteins lacking the cleavable mitochondrial import sequence (noMP) in fusion with GST a series of pGEX-6P-2 (GE Healthcare) derived plasmids was constructed as follows. Plasmids pGEX-6P-2-ScABF2noMP and pGEX-6P-2-YlMHB1noMP were prepared by inverse PCR to eliminate the first 26 amino acids for ScAbf2p and 14 amino acids for YlMhb1p, corresponding to cleavable mitochondrial import sequence. In the reaction, primers pGEX6P2noMP_R and ScABF2noMP_F or YlMHB1noMP_F (Table 1) were used and previously prepared plasmids pGEX-6P-2-ScABF2 or pGEX-6P-2-YlMHB1  were used as templates. For the construction of the plasmid pGEX-6P-2-CpGCF1noMP the CpGCF1 ORF lacking the first 33 amino acids, which represent cleavable mitochondrial import sequence, was amplified by PCR from the genomic DNA of C. parapsilosis strain CBS604 using primers CpGCF1noMP_F and CpGCF1noMP_R (Table 1). The PCR product was inserted into the vector pGEX-6P-2 (GE Healthcare) linearized with SmaI. All plasmid constructs were verified by restriction enzyme mapping and DNA sequencing (Microsynth) of the inserted fragments.
Expression and purification of recombinant mtHMGp from
Recombinant mtHMG proteins without the mitochondrial import presequence were purified from bacterial cells as described previously for full length YlMhb1 protein . The presence and purity of proteins were verified by 12% SDS-PAGE stained with Coomassie Brilliant Blue R-250.
DNA substrates and electrophoretic-mobility shift assay (EMSA)
For electrophoretic-mobility shift assay (EMSA) experiments aimed at characterizing the length of the binding site for mtHMG proteins (Figure 2) oligonucleotides ScATP9_15_D and ScATP9_25_D (Table 1), derived from the S. cerevisiae mitochondrial gene atp9, were radioactively labelled using T4 polynucleotide kinase (Life Technologies) and [γ32P]ATP. The labelled oligonucleotide was then mixed with non-labelled complementary oligonucleotide (ScATP9_15_C or ScATP9_25_C, Table 1) in a molar ratio of 1:3. The mixtures were incubated at 95°C for 5 min and cooled slowly to room temperature to allow DNA annealing. The unincorporated [γ32P]ATP was removed from the DNA by gel filtration using Probe Quant G-50 MicroColumns (GE Healthcare). The 50 bp long DNA substrate derived from the atp9 gene was amplified by PCR from mtDNA of S. cerevisiae strain W303-1A using primers ScATP9_15_D and ScATP9_50_R (Table 1) and terminally end-labelled using T4 polynucleotide kinase. Fluorescently labelled DNA substrates for EMSA were prepared as described previously [42,43]. The structures of the DNA probes are schematically depicted in the corresponding figures. The GC content of the probes ranged between 40% (50 bp probe) and 53% (15 bp probe), which is higher than the GC content in atp9 coding sequence (33%).
Purified yeast mtHMG proteins differ in their ability to bind dsDNA of various lengths
Indicated amounts of purified recombinant proteins ScAbf2noMP, YlMhb1noMP or CpGcf1noMP were mixed with the individual radioactively or fluorescently labelled DNA substrate (3 nM) and incubated for 10 min at 30°C in 10 μl of a buffer containing 20 mM Tris/HCl pH 7.5, 1 mM EDTA/NaOH pH 8.0, 50 mM NaCl, 100 μg/ml BSA. Samples were electrophoretically separated in 5 or 8% (w/v) polyacrylamide gels in 0.5× TBE buffer (45 mM Tris/borate, 1 mM EDTA/NaOH pH 8.0) at 4°C. Note that the loading buffer contained only a final concentration of 5% (v/v) glycerol, as we observed that the presence of bromophenol blue and xylene cyanol blue almost completely abolished the binding of mtHMG proteins to DNA. Radioactively labelled DNA substrates were visualized after exposing the gels to storage phosphor screens (Kodak) for 24–72 h using a Personal Molecular Imager FX (BioRad). Fluorescent DNA substrates were visualized directly using imager reader FLA-9000 Starion (Fuji) and quantified using MultiGauge V3.2 software (Fuji).
The DNA-binding reactions for EM were performed in 50 μl of HEN buffer (20 mM HEPES/NaOH pH 7.5, 1 mM EDTA/NaOH pH 8.0, 50 mM NaCl) containing 2 ng/μl of the substrate DNA and 15 ng/μl of purified ScAbf2p. Plasmid pGLGAP and preparation of the RF and HJ substrates were described elsewhere [44,45]. The reactions were carried out at room temperature for 15 min, followed by addition of 10 μl of 1.2% (v/v) glutaraldehyde and incubation at room temperature for additional 6 min. To remove the unbound proteins and fixative, the samples were diluted to 50 μl in HEN buffer and passed over 2 ml columns of 6% agarose beads (ABT Inc.) equilibrated with TE buffer (10 mM Tris/HCl, pH 7.4, 0.1 mM EDTA/NaOH). Aliquots of the fractions containing the complexes were mixed with a buffer containing spermidine and adsorbed on to copper grids coated with a thin carbon film glow-charged shortly before sample application. Following adsorption of the samples for 3 min, the grids were dehydrated through a graded ethanol series and rotary shadowcast with tungsten at 10−7 torr . Samples were examined in an FEI T12 TEM equipped with a Gatan 2k × 2k SC200 CCD camera.
Yeast mtHMG proteins differ in their affinity to dsDNA
To initiate the biochemical characterization of mtHMG proteins, we expressed recombinant genes encoding the corresponding protein without the N-terminal mitochondrial targeting sequence in fusion with GST using the pGEX-6P-2 vector. To obtain native versions of the proteins, the fusion proteins purified from E. coli were treated with PreScission protease to remove the GST affinity tag (Figure 2A). The concentrations of the proteins were then adjusted to the same value and their affinity to dsDNA assessed by EMSA.
The proteins were first tested for their ability to bind dsDNA of various lengths (15–50 bp) derived from the S. cerevisiae atp9 gene (Figure 2B). Although none of the proteins was able to bind the shortest (15 bp) DNA fragment, the 25 bp DNA was almost quantitatively shifted by ScAbf2p. YlMhb1p exhibited very weak binding and CpGcf1p did not bind this probe at all. The 50 bp DNA fragment was bound by all three mtHMG proteins. Both ScAbf2p and YlMhb1p formed two DNA–protein complexes, possibly corresponding to one and two protein molecules per molecule of DNA, respectively. CpGcf1p formed only a single complex with DNA, although its mobility corresponded to the slower migrating form of DNA bound by YlMhb1p. Although the nature of this complex is unclear (see Discussion), it is evident that the affinities of the mtHMG proteins to dsDNA as well as the lengths of their corresponding DNA-binding sites differ, and that the CpGcf1p exhibits the lowest affinity towards intact dsDNA.
ScAbf2p exhibits a high preference for replication/recombination intermediates
The results presented in Figure 2B indicate that to obtain quantitative binding of mtHMG proteins to dsDNA, even in the case of ScAbf2p (which exhibits the strongest binding), a relatively high protein to DNA-binding site ratio (50–100 to 1) is required. A relatively high ratio of ScAbf2p to binding sites was also required for a complete compaction of DNA by ScAbf2p as visualized by AFM [31,33]. Namely, to completely compact a linear DNA (pBR322) containing 175 binding sites, almost 4000 molecules of ScAbf2p were needed (a ratio of 22 ScAbf2p to 1 DNA-binding site) , and similar results were obtained by another group . At such high ratios of protein to DNA, contamination of the native protein with denatured molecules or truncated species, either of which may induce a general aggregation or collapse of the DNA, becomes a significant concern.
In vivo, the ratio of ScAbf2p to DNA is much lower than what has been employed in the previous biochemical studies. Diffley and Stillman  originally estimated the amount of ScAbf2p to be 250000 molecules per cell, but more recent assessments of the cellular amount of Abf2p resulted in two orders of magnitude lower numbers, namely 3810  and 860  molecules of Abf2p per cell, respectively. If there are 50–100 molecules of 85 kbp mtDNA per cell and the length of the binding site is 27 bp (Figure 2B; ), there would be 160000–320000 binding sites per cell and thus even when the highest estimate of Abf2p molecules is taken into account, the stoichiometric ratio of ScAbf2p to DNA substrate in vivo is 1 to 1 (when considering the lower estimates, the ratio would drop dramatically). Under these conditions, the binding of ScAbf2p to dsDNA is 3–5-fold lower (Figure 2B) and the level of compaction is almost negligible [31,33]. Indeed, when we visualized the binding of ScAbf2p to plasmid DNA by EM at a protein to DNA-binding site ratio of 1 to 1 we observed only 5–10 protein particles per DNA molecule (Figure 3A). Therefore, we reasoned that intact dsDNA may not be the best substrate for this protein. Rather, ScAbf2 may prefer DNA forms resulting from various types of DNA transactions such as replication and recombination. This hypothesis was supported by the fact that many nuclear mtHMG proteins exhibit a preference for recombination intermediates  and yeast mitochondria possess a DNA recombination system involved in both DNA replication and repair . We therefore tested the ability of ScAbf2p to bind RF or HJ by EM. Indeed, we found that the binding of protein to DNA occurred almost exclusively at the RF (Figure 3B) or at the junction (Figure 3C). These results prompted us to compare the binding of all three mtHMG proteins to various DNA substrates by EMSA.
ScAbf2 protein exhibits a binding preference for RF and HJ
Comparative analysis of yeast mtHMG proteins reveals differences in their binding preferences to various DNA substrates
To compare the DNA-binding properties of the mtHMG proteins we tested 10 different DNA substrates, namely intact dsDNA (see also Figure 2B), HJ, RF, 3′ Flap, 5′ Flap, single-strand (ss), 3′ and 5′ overhang (OH), Y-form, nicked and intact HJ (nHJ a HJ) and displacement loop (D-loop) DNAs (Figure 4). The similarities and differences between the proteins are best illustrated by the first four substrates (Figures 4A and 4B). ScAbf2p exhibited the strongest binding to each of these structures with intact dsDNA and the 3′ Flap substrates being the least preferred. The difference in binding between the three proteins was most evident on intact dsDNA. To shift 50% of the probe 30 nM of ScAbf2p was sufficient, whereas 120 nM of YlMhb1p was needed, and CpGcf1p did not shift 50% of dsDNA even at the highest tested concentration (Figure 4B). On the other hand, RF containing DNA, and especially HJ DNAs were bound very efficiently by all three mtHMG proteins even at lower protein concentrations (Figures 4A and 4B). It is of note that CpGcf1p, which bound dsDNA and 3′ Flap substrates very poorly, was almost as efficient in binding to the HJ DNA as ScAbf2p (Figure 4B). This demonstrates that the low affinity of CpGcf1p to dsDNA and 3′ Flap DNA is not a result of the protein being nonfunctional, but rather reflects its intrinsic biochemical preference for certain types of DNA substrates. The binding of the proteins to other DNA substrates, including ssDNA, 5′ Flap, Y-form, 3′ OH and 5′ OH substrates was similar as to dsDNA, whereas the nHJ and D-loop DNAs were bound as efficiently as the HJ DNA (Figure 4C).
EMSA of yeast mtHMG proteins with various DNA substrates reveals both similarities and differences in their DNA-binding properties
Intriguingly, CpGcf1p exhibits the largest difference in its ability to bind recombination intermediates (HJ, nHJ, D-loop DNAs) compared with dsDNA and RF DNA. At the protein concentration of 120 nM, CpGcf1p exhibits an 8-fold higher affinity towards HJ compared with 3′ Flap DNA (Figure 5A). Therefore, we have performed competition experiments, where the protein is incubated simultaneously with both dsDNA and HJ DNA. Whereas the former is basically unrecognized, there is a relatively robust binding of the protein to the HJ substrate (Figure 5B).
CpGcf1p exhibits the most dramatic difference in binding to dsDNA and 3′ Flap compared with HJ and RF substrates
Our results point out several important properties of yeast mtHMG proteins. First, intact dsDNA is a relatively poor substrate. ScAbf2p exhibited the highest apparent affinity for dsDNA; however, to achieve a quantitative binding, high protein to DNA ratios were needed (Figure 2B). Whereas YlMhb1p exhibited similar binding to the 50 bp long dsDNA as ScAbf2p, it bound very weakly to 25 bp dsDNA. This might be caused by a longer binding site of YlMhb1p compared with ScAbf2p. The binding of CpGcf1p to dsDNA was very weak for all substrates even at the highest protein to DNA ratios (Figures 2B and 5). In case of the 50 bp probe the shift by the CpGcf1p corresponds to the supershift observed by YlMhb1p and ScAbf2p suggesting that protein dimers are bound to the probe. The presence of a coiled-coil domain in CpGcf1p might be responsible for the formation of oligomeric complexes, but this possibility needs to be tested experimentally. The role of dimerization in DNA binding and compaction was studied in detail in case of the mammalian mtHMG protein (TFAM) and is still a matter of debate. Two crystal structures of TFAM bound to the heavy strand promoter or to a nonspecific mtDNA sequence showed evidence of TFAM dimerization . On the other hand, a more recent study demonstrated that dimerization is not required for TFAM-induced compaction of mtDNA  underlining the importance of further investigation of interactions of mtHMG proteins with DNA.
Even though ScAbf2p might bind to some sites on mtDNA with higher preference, it was found to bind to most of the genome in vivo . Taking into account the ratio of ScAbf2p to DNA in vivo (at most one molecule of the protein per one 27 bp binding site, but possibly much lower (see above)) our results indicate that the role of yeast mtHMG proteins in the compaction of mtDNA might not be as straightforward as suggested by previous studies. This conclusion is supported by the fact that cells lacking ScAbf2p , YlMhb1p  or the Gcf1p orthologue of Candida albicans (CaGcf1p) , exhibit morphologically different, yet still functional mt-nucleoids. This can be explained by the existence of distinct mtDNA-compaction factors acting in parallel with the mtHMG proteins. Some of the candidates (such as Aco1p, Ilv5p) were identified in S. cerevisiae [7,52,53]; however, the means by which these proteins mediate compaction of mtDNA are far from understood.
The second general important property of the yeast mtHMG proteins we observed is their preference for DNA structures generated during replication, recombination and/or repair (Figures 3 and 4). This biochemical property is common for proteins of the HMG-1/HMG-2 subfamily that were shown to bind distorted structures such as HJ, cis-platinum adducts or base bulges . This is also true for mammalian mtHMG protein TFAM that was demonstrated to exhibit 10-fold higher affinity for HJ  as well as RNA four-way junctions  compared with the linear dsDNA. Although it was shown that an R-loop formed in the human mitochondrial replication origin contains a Holliday-like structure , the level of DNA recombination in mammalian mitochondria is (at best) relatively low  and thus the physiological role of high-affinity binding of TFAM to HJ structures is not clear. A recent elegant study of Kukat et al.  revealed that TFAM-mediated nucleoid formation in vitro is a multistep process initiated by TFAM aggregation and cross-strand binding. It is possible that yeast mtHMG proteins also employ similar mechanisms of mtDNA compaction. However, in contrast with their mammalian counterparts, yeast mitochondria exhibit a high level of recombination DNA intermediates [37–40]. In fact, recombination seems to be a principal mechanism of yeast mtDNA replication . The high incidence of recombination intermediates in yeast mitochondria combined with our results demonstrating a preference of yeast mtDNA proteins for such DNA structures indicate that these structures may represent their main substrates in vivo. Indeed, the abf2Δ mutants of S. cerevisiae [59,60] and knockdown gcf1− strains of C. albicans  exhibit a decrease in the level of mtDNA recombination intermediates. Conversely, overexpression of the ABF2 gene in S. cerevisiae results in an increased level of recombination intermediates and destabilization of mtDNA . In addition, both abf2Δ and mhb1Δ mutants of S. cerevisiae and Y. lipolytica, respectively, are more prone to mutations [23,61]. This may be the result of both deprotection of mtDNA and/or a decreased capacity of recombination-dependent DNA repair. Although all these results indicate that binding of mtHMG proteins to various DNA structures is important for mtDNA transactions in yeasts, the molecular details of their participation in these processes are still largely unknown.
The observation of Kucej et al.  that in vivo Abf2p binds to most of the mitochondrial genome with a preference for GC-rich gene sequences can also be interpreted in light of our results. First, the ChIP-on-chip assay was performed on a population of cells and could not address the distribution of ScAbf2p at the level of single mtDNA molecules. Second, GC-clusters are hot-spots of mtDNA recombination in S. cerevisiae mitochondria , so it is possible that the preferential binding to these regions was caused by recombination undergoing at these sites.
Taken together, it is likely that the compaction of mtDNA by yeast mtHMG proteins is achieved via a combination of cross-strand binding of intact DNA (as demonstrated for TFAM; ) and binding to DNA recombination/replication intermediates that are abundant in yeast mitochondria. It was shown that the nuclear protein ScIxr1, a paralogue of ScAbf2p, binds and bends intrastrand DNA cross-links induced by platinum  and thus participates in the repair of these DNA lesions. When HJ or RF DNAs were used as substrates for mtHMG proteins, we observed the formation of several distinct species of DNA–protein complexes, some of which may contain oligomeric forms of the protein and/or compacted DNA (Figure 4). It is possible that these DNA structures serve as signals for binding of mtHMG proteins followed by their oligomerization accompanied by compaction of DNA and/or recruitment of other components of mt-nucleoids. Considering the high amount of recombination intermediates in yeast mitochondria, it would be interesting to investigate how they, in concert with mtHMG proteins, participate in the formation of mt-nucleoids and how the mode of binding of mtHMG proteins to various substrates is affected by interactions with other nucleoid-associated proteins, as well as their posttranslational modifications such as phosphorylation and proteolytic cleavage [64–66].
One of the main motivations for the present study was to address the question of whether the high degree of amino acid divergence among the yeast mtHMG proteins corresponds to differences in their biochemical properties. On one hand, the proteins seem to be similar in their ability to complement (although to a different extent) the abf2Δ mutation in S. cerevisiae [22,23]. Also, they all exhibit a preference for recombinational intermediates compared with intact dsDNA. On the other hand, their relative affinities for various substrates differ, especially when comparing ScAbf2p and YlMhb1p with CpGcf1p. In contrast to its counterparts, CpGcf1p hardly binds dsDNA, whereas its binding to RF and especially HJ DNA is comparable to the other two mtHMG proteins (Figures 4A and 4B). Of note is the inability of CpGcf1p to bind to 5′ OH containing DNA under the conditions tested (Figure 4C) as this structure is present at the ends of linear mtDNA of C. parapsilosis  and therefore is relatively frequent in vivo. Apparently, the ss/double-stranded (ds) junction does not seem to be a preferred site for CpGcf1p. Moreover, the 5′ OH terminus of mtDNA of C. parapsilosis is covered by the mitochondrial telomere-binding protein (mtTBP) [67–69] and it is also engaged in the formation of telomeric loops (t-loops) , and thus the junction would probably not be accessible for binding in vivo.
Although, based on its biochemical properties, CpGcf1p does not seem to be present at the terminal regions of the linear mtDNA of C. parapsilosis in vivo, the dramatic differences between its ability to bind recombination intermediates (HJ, nHJ, D-loop DNA) compared with dsDNA and RF DNA (Figure 5A) indicate that it is an important player employed by C. parapsilosis mitochondria to maintain mtDNA in general and mitochondrial telomeres in particular. When analysed by 2D agarose electrophoresis, preparations of mtDNA of C. parapsilosis contain a large variety of recombination intermediates indicating that recombination plays an important part in mtDNA replication . Moreover, maintenance of mitochondrial telomeres composed of tandemly repeated sequences is mediated by telomeric circles (t-circles), whose formation is dependent on the recombination machinery [71–73]. Their replication via a rolling-circle mechanism generates an array of telomeric sequences that can reintegrate back into the main mtDNA molecules via homologous recombination [71,73]. Moreover, we have shown previously that mtDNA of C. parapsilosis forms t-loops where 5′ OH invades into the ds region of the telomere . All these features highlight a crucial role of CpGcf1p in DNA recombination and thus in the maintenance of C. parapsilosis mtDNA as well as its telomeres.
Finally, it is of note that ScAbf2p and YlMhb1p are biochemically more similar to each other than to CpGcf1p, although S. cerevisiae and Y. lipolytica are phylogenetically more distant than either species is to C. parapsilosis (Figure 6). It is possible that this observation can be explained by differences in the evolutionary dynamics of mitochondrial genome architecture in the corresponding phylogenetic branches. Although S. cerevisiae and Y. lipolytica are separated by about 500 million years , they belong to groups of species mostly possessing a circular-mapping mitochondrial genome, whose mode of maintenance is likely quite similar. This would explain the similarity in biochemical properties of ScAbf2p and YlMhb1p. On the other hand, C. parapsilosis is a member of a CTG-clade of the subphylum Saccharomycotina exhibiting a wide repertoire of forms of mitochondrial genomes ranging from circular-mapping to linear molecules with defined telomeres such as tandem repeats, hairpins, or covalently attached proteins [27,74–76]. The conversion between various forms seems to be quite frequent, often occurring in strains of the same species [77,78]. It is possible that distinct biochemical features of the Gcf1 protein described in the present study represent one of the prerequisites allowing such frequent evolutionary tinkering .
Phylogenetic tree illustrating the distribution of various types of mtHMG proteins and variability of the form of mitochondrial genomes in selected yeast species
Jana Bakkaiova performed the experiments shown in Figures 2, 4 and 5. Victoria Marini prepared the DNA substrates and assisted with the experiments shown in Figures 4 and 5. Smaranda Willcox performed the experiments shown in Figure 3. Jozef Nosek performed bioinformatic analyses, prepared Figure 1 and constructed the phylogenetic tree shown in Figure 6. Jack D. Griffith supervised and coordinated the electron-microscopic analysis shown in Figure 3. Lumir Krejci designed and supervised the EMSA experiments presented in Figures 4 and 5. Lubomir Tomaska conceived and coordinated the study, assisted with the experiments presented in Figures 2(B) and 3 and wrote the first draft of the manuscript. All authors reviewed the results, edited the manuscript and approved its final version.
We thank Ladislav Kovac for inspiration and continuous support, Filip Tomaska for technical assistance with the EM experiments and for help with the construction of the expression plasmid pGEX-6P-2-ScABF2noMP, Vilko Bakkai for cooperation in finalization of the EMSA experiments, and members of our laboratories for discussions.
This work was supported by the Slovak grant agencies APVV [0035-11 (to L.T.) and 14-0253 (to J.N.)], VEGA [1/0311/12 (to L.T.) and 1/0333/15 (to J.N.)]; the National Institutes of Health [ES013773-06 and GM31819 (to J.D.G)]; the Czech Science Foundation [GACR 13-26629S and P207/12/2323]; and European Regional Development Fund–Project FNUSA-ICRC [CZ.1.05/1.1.00/02.0123 (to L.K.)].
electrophoretic-mobility shift assay
nicked Holliday junction
transcription factor A, mitochondrial