The judicious choice of promoter to drive gene expression remains one of the most important considerations for synthetic biology applications. Constitutive promoter sequences isolated from nature are often used in laboratory settings or small-scale commercial production streams, but unconventional microbial chassis for new synthetic biology applications require well-characterized, robust and orthogonal promoters. This review provides an overview of the opportunities and challenges for synthetic promoter discovery and design, including molecular methodologies, such as saturation mutagenesis of flanking regions and mutagenesis by error-prone PCR, as well as the less familiar use of computational and statistical analyses for de novo promoter design.

Introduction

Predictable output is a defining aspiration of synthetic biology. A number of factors affect the output from synthetic gene networks to a greater or lesser extent, including transgene copy number [1], integration into the genome or expression from plasmids [2], promoter activity [3], ribosome-binding sites [46], codon bias of the host [7], transcription rate and tRNA abundance [8], half-life of mRNA [9], substrate and cofactor availability [10], adjustment of enzyme kinetics [11], protein scaffolding [12] and sub-cellular localization through the use of microcompartments [13,14]. The use of RNA as a control mechanism, either through the application of riboswitches [15] or toehold switches [16] has also emerged as a powerful tool for pathway control. Each of these aspects can be investigated and improved individually, and then integrated by a model, a suite of experiments or ideally, a combination of modelling and empiricism.

Several investigations, including the now archetypal ‘repressilator’ [17] and the genetic toggle switch [18] have modelled promoters and generated bacteria that display patterns of gene expression consistent with mathematical predictions. However, despite these successes, when individual bacteria are investigated, strong variations in transgene expression levels become apparent, even within clonal populations [19].

Controlling transcription is often the simplest way to balance expression of a transgene or synthetic pathway, and constitutive promoters with different and predictable activation characteristics are a desirable feature of any synthetic biology toolkit. However, in practise, promoter availability tends to be restricted to relatively few sequences [20], which do not always perform as required and may not necessarily be transferrable to new microbial chassis. The fact that many promoters are characterized as merely ‘weak’ or ‘strong’ [21] highlights this issue–such definitions are hardly sufficient to allow adequate promoter selection.

A number of inducible promoter systems are available for which the concentration of inducer can, in theory, be modulated in order to achieve the desired level of protein production [22]. Although the use of inducible promoter systems has been successful in some instances, in others it can prove inadequate. Promoter hypersensitivity to the inducer [23], the cost of adding large quantities of inducer to an industrial-scale fermenter [24] or heterogeneous expression levels across a population [25] all complicate the use of inducible promoters in industrial-scale cultures. Consequently, for large-scale production applications, constitutive promoters with ‘hard-wired’, predictable properties are often preferred and are the focus of this review.

In this article, we review the potential and methodologies for designing and characterizing new constitutive promoter sequences with predictable outputs, including conventional PCR-based techniques, hybrid promoter engineering and the expanding use of computational analysis for de novo promoter design.

Characteristics of promoters for synthetic biology applications

A promoter can be broadly defined as a cis-regulatory element containing a somewhat modular suite of key motifs that control the transcription of individual ORFs or operons. In prokaryotes, the structure and organization of natural promoter motifs is relatively well understood (Figure 1A). Eukaryotic promoters are somewhat more complex than their prokaryotic counterparts (Figure 1B), with localization of the transcriptional apparatus resulting from interactions between highly specific transcription factors, the promoter elements and co-activators [26]. The activity of promoters is typically quantified through measures of cellular mRNA or reporter proteins [27], linking the levels of promoter activity (or ‘strength’) to both transcription and translation. In reality, the promoter regulates only transcription but in practise, experimental constraints use protein quantification as a useful proxy for promoter activity.

Schematic representations of typical promoter sequences

Figure 1
Schematic representations of typical promoter sequences

(A) Schematic representation of a typical prokaryotic promoter sequence. The transcription start site (TSS) is shown in red. Two conserved hexamers, at approximately 10 and 35 bp upstream of the TSS [68], highlighted here in blue, serve as key binding regions for RNA polymerase [69]. No such conserved motifs have been found in the region of sequence separating the two hexamers, although a consensus length of 17 bp has been observed in some species [70]. In addition to these core promoter elements, an upstream region (highlighted here in turquoise) is present in some promoters. Typically adenine/thymine rich, these UP elements boost transcription rate through interactions with the C-terminal domain on the RNA polymerase α-subunit [71]: Estrem, S.T., Gaal, T., Ross, W. and Gourse, R.L. (1998) Identification of an UP element consensus sequence for bacterial promoters. Proc. Natl. Acad. Sci. U.S.A. 95, 9761–9766. The UP element consensus sequence is as derived by [71]. −10 and −35 consensus sequences are from E. coli and are reproduced from [3]: Blazeck, J. and Alper, H.S. (2013) Promoter engineering: recent advances in controlling transcription at the most fundamental level. Biotechnol. J. 8, 46–58 and [72]: Ross, W., Aiyar, S.E., Salomon, J. and Gourse, R.L. (1998) Escherichia coli promoters with UP elements of different strengths: modular structure of bacterial promoters. J. Bacteriol. 180, 5375–5383. N represents any deoxyribonucleotide. W represents adenine (A) or thymine (T). G and C represent guanine and cytosine respectively. (B) Schematic representation of a S. cerevisiae promoter sequence. The TSS is highlighted in red. Eukaryotic promoters can be broadly split into two regions, a core promoter element (shown in blue) and an upstream enhancer [3] (shown in turquoise), both of which can be modified in order to modulate expression levels. The core region provides the minimal sequence necessary for initiation of basal transcription and may contain key motifs, the most widely studied of which is the TATA box, which typically occurs 40–120 bp upstream of the TSS [73]. However, such motifs are by no means requisite for transcription initiation, as TATA boxes appear in only 20% of S. cerevisiae promoter elements [74]. Diagonal lines represent the region in which TATA boxes are most common. Upstream of the core promoter, the enhancer element serves to localize transcription factors, with interactions between bound transcription factors and the transcriptional machinery serving as a determinant of promoter strength and control [56]. Transcription factor binding sites do not display uniform distribution across the enhancer element, and are represented here as solid vertical lines in arbitrary positions. The highest concentration of such binding motifs has been reported between 50–150 bp prior to the TSS [75], although they may be present as much as 500 bases upstream of the TSS [76].

Figure 1
Schematic representations of typical promoter sequences

(A) Schematic representation of a typical prokaryotic promoter sequence. The transcription start site (TSS) is shown in red. Two conserved hexamers, at approximately 10 and 35 bp upstream of the TSS [68], highlighted here in blue, serve as key binding regions for RNA polymerase [69]. No such conserved motifs have been found in the region of sequence separating the two hexamers, although a consensus length of 17 bp has been observed in some species [70]. In addition to these core promoter elements, an upstream region (highlighted here in turquoise) is present in some promoters. Typically adenine/thymine rich, these UP elements boost transcription rate through interactions with the C-terminal domain on the RNA polymerase α-subunit [71]: Estrem, S.T., Gaal, T., Ross, W. and Gourse, R.L. (1998) Identification of an UP element consensus sequence for bacterial promoters. Proc. Natl. Acad. Sci. U.S.A. 95, 9761–9766. The UP element consensus sequence is as derived by [71]. −10 and −35 consensus sequences are from E. coli and are reproduced from [3]: Blazeck, J. and Alper, H.S. (2013) Promoter engineering: recent advances in controlling transcription at the most fundamental level. Biotechnol. J. 8, 46–58 and [72]: Ross, W., Aiyar, S.E., Salomon, J. and Gourse, R.L. (1998) Escherichia coli promoters with UP elements of different strengths: modular structure of bacterial promoters. J. Bacteriol. 180, 5375–5383. N represents any deoxyribonucleotide. W represents adenine (A) or thymine (T). G and C represent guanine and cytosine respectively. (B) Schematic representation of a S. cerevisiae promoter sequence. The TSS is highlighted in red. Eukaryotic promoters can be broadly split into two regions, a core promoter element (shown in blue) and an upstream enhancer [3] (shown in turquoise), both of which can be modified in order to modulate expression levels. The core region provides the minimal sequence necessary for initiation of basal transcription and may contain key motifs, the most widely studied of which is the TATA box, which typically occurs 40–120 bp upstream of the TSS [73]. However, such motifs are by no means requisite for transcription initiation, as TATA boxes appear in only 20% of S. cerevisiae promoter elements [74]. Diagonal lines represent the region in which TATA boxes are most common. Upstream of the core promoter, the enhancer element serves to localize transcription factors, with interactions between bound transcription factors and the transcriptional machinery serving as a determinant of promoter strength and control [56]. Transcription factor binding sites do not display uniform distribution across the enhancer element, and are represented here as solid vertical lines in arbitrary positions. The highest concentration of such binding motifs has been reported between 50–150 bp prior to the TSS [75], although they may be present as much as 500 bases upstream of the TSS [76].

From an industrial perspective, it is preferable to have a system that displays little variation, even if the overall output of that system is, on average, slightly less than that of an alternative that displays irregularities; synthetic biology aims to be boringly predictable rather than wonderfully complex. Candidate promoters for synthetic biology must therefore be well-characterized and yield consistent results, and also be insulated from the background metabolisms and molecular control systems. However, consistency is often confounded by the inherently stochastic nature of gene expression, which subjects both promoters and any downstream proteins used in their characterization to large degrees of noise [28], as well as the all-or-nothing phenomenon [29] in inducible systems, wherein expression is typically fully induced in a subset of the population whereas the remaining cells display no expression [22,30,31].

Natural promoter sequences

The promoters available for use in synthetic systems have generally been limited to those endogenous elements isolated from model organisms, for instance, the Escherichia coli lac promoter and derivatives thereof [3235] and the arabinose-inducible PBAD [3638] promoter.

Phage genomes can also be used to generate novel promoters. For example the pL promoter, isolated from bacteriophage lambda, provides medium to high expression levels, and is tightly thermally-regulated by the cI repressor [34,39]. pL has been successfully employed to increase yield of various proteins in E. coli expression systems [4042]. Similarly, the T7 RNA polymerase-based promoter system, also initially isolated from bacteriophage, has been widely adopted [34,43].

Although natural promoters are widely used in relatively simple, laboratory applications, the relative paucity of sufficiently characterized elements makes their use in control in industrial contexts problematic. Additionally, natural promoter activity is often context-specific [3] and subject to interaction with a multitude of regulatory proteins, rendering prediction of activity levels under varying conditions non-trivial [44]. As a result of these inherent limitations, researchers have increasingly turned to libraries of synthetic promoter elements to meet their needs.

Molecular approaches for the production of synthetic promoter libraries

Saturation mutagenesis of flanking regions

A key method of forming synthetic promoter libraries (SPLs) is based on the observation that the flanking regions surrounding consensus motifs within the promoter sequence have a role in determining activity [45]. Degenerate oligonucleotides allow known consensus motifs to be maintained whereas the flanking regions are mutagenized, leading to altered promoter activity. For example saturation mutagenesis of flanking regions (SMFR) was successfully used to produce a SPL with a 400-fold activity range in Lactococcus lactis, with greater range being reported as a result of synthesis errors in the consensus sequences and alteration to flank length [24,45]. However, the initial approach taken to saturation mutagenesis by Jensen and Hammer [24,45] does not take into account the context-dependant nature of promoter activity. Consequently, current SPL generation uses a single PCR stage, with degenerate oligonucleotides coupled to either a full-length or truncated version of the gene that the promoter is intended to drive. This improvement allows for ectopic analysis or replacement of a wild-type promoter with a synthetic alternative, although maintaining the 5’ mRNA of the target gene [23,46]. Promoter function is maintained due to the preservation of the key consensus regions within the sequence, with altered expression levels likely being the result of minor changes in DNA confirmation within the randomized flanks [45].

SMFR has been successfully applied in a variety of prokaryotes and eukaryotes, including Corynebacterium glutamicum [47] and Streptomyces coelicolor [48], yielding robust libraries with broad expression profiles. The methodology has also shown applicability in Saccharomyces cerevisiae, wherein screening of an initial large library of colonies ultimately yielded 20 characterized promoters, displaying expression levels of yeast-EGFP that varied by approximately 22-fold [21]. In a separate study, a selection of constitutive promoters was initially isolated from the S. cerevisiae genome, and expression levels were subsequently characterized using expression profiles available from public databases. The promoter of the gene PFΥ1 was chosen as a starting point for its robust expression profile [49]. Knowledge of PFΥ1 structure enabled identification of a rDNA enhancer-binding protein and a poly-dT that were important for transcription initiation [50]. These regions were therefore held constant whereas a 48 bp section of the promoter core was randomized, providing a library of 36 promoter elements with a broad range of expression levels. It must be noted that none of the new sequences provided higher expression levels than the original PFΥ1 promoter [49]. This inability to produce a synthetic promoter with higher expression levels than a natural alternative was also reported by McWhinnie and Nano [51].

Although SMFR has successfully provided many new promoters, the technique requires labour intensive cloning and an a priori knowledge of promoter structure in the organism of interest, something that may not be immediately available in industrially relevant microbes. Furthermore, as many libraries use composite promoter scaffolds as a starting point, establishing a definitive wild-type reference expression baseline is impossible. Definitively stating whether SMFR will improve wild-type expression capability pre hoc, is therefore problematic [3]. Additionally, by restricting mutagenesis to only the flanking regions, SMFR fails to take into account alterations to consensus sequences, which are known to play a significant role in modulating expression strength.

Error-prone PCR

Generating a SPL by applying error-prone PCR (epPCR) to an entire promoter sequence obviates any a priori knowledge of functional motif location and can potentially result in promoters with entirely new characteristics [3]. This methodology was successfully used to mutagenize a bacteriophage PL-λ promoter that was subsequently placed upstream of a green fluorescent protein (GFP) coding sequence and transformed into E. coli, resulting in a library containing approximately 9000–12000 functional clones [52,53]. Visual screening of the colonies resulted in a subset of 200 promoters, of which 27, representing 22 discrete promoter sequences, were found to give homogeneous expression levels. Subsequently, thorough characterization of this promoter subset resulted in a promoter library which was successfully employed to modulate levels of phosphoenolpyruvate carboxylase and lycopene production in E. coli [53]. epPCR for promoter production has also been employed in C. glutamicum, where iterative rounds of high-throughput sorting and analysis at the single-cell level ultimately yielded a library of 20 well-characterized sequences from an initial library of 105 mutagenized cells [54]. The technique has also been successfully applied in yeast [55].

Despite these successes, the epPCR approach to SPL production has certain limitations: a reliance on a selection of a small subset of colonies for further analysis [53,54] renders discovery of a true optimum problematic. Moreover, the extensive screening required to isolate said subset should not be underestimated; it is typical for initial libraries of hundreds or thousands of bacterial colonies to ultimately yield relatively few fully characterized promoters. Both these problems become less of an issue if visual selection of colonies is replaced by high-throughput analytical techniques such as fluorescence-activated cell sorting and/or imaging cytometry.

Hybrid promoter engineering

In addition to the two mutagenic techniques discussed above, the generation of synthetic promoters through hybridization of existing promoter elements provides an alternative strategy for promoter genesis. By combining minimal core promoter elements with various combinations of modular upstream activation sequences (UAS), Blazeck et al. [56] demonstrated that expression levels could be increased compared with a wild-type baseline in S. cerevisiae. A roughly linear relationship was observed between the number of UAS modules added and promoter strength, with the addition of four such elements boosting expression of a weak constitutive promoter to levels comparable with the strongest endogenous promoter [56]. Transcriptional increase was shown to depend both on the core element and UAS, but all core promoters were amenable to improvement [56].

Computational methods for synthetic promoter discovery

Although the above molecular methodologies have certainly provided new promoters of varying activities, these approaches do not represent a systematic, theoretical examination of the promoter design space. If, for arguments sake, a promoter sequence is 100 bp in length, there are 4100 potential promoter sequences. Therefore, although the best sequence discovered by molecular-based SPL may be sufficient for some experimental purposes, it is possible that other optima are present. In silico methods that are capable of deciphering the effect of individual DNA bases and motifs, or predicting promoter activity level in advance of in vivo characterization have, in this context, considerable potential [57]. Conventionally, the use of computational techniques in pathway design and optimization has been limited to post hoc data analytics [21]. However, computational modelling in biological systems design and optimization is becoming more widespread, and a number of computational methodologies are available to facilitate the de novo design of synthetic promoter sequences.

Position weight matrix models

Position weight matrix (PWM) models have been widely applied for the detection of transcription factor binding sites [58,59], and have also shown some promise in predicting promoter strength. By breaking promoter sequences into constitutive motifs, PWM models were able to predict the strength of E. coli core promoter sequences recognized by sigma factor σE to a relatively high degree of accuracy [60]. The core promoter PWM was subsequently combined with a score describing the activity of upstream elements to provide a model capable of predicting the strength of entire promoter sequences [61]. In addition to this predictive power, PWM models provide increased understanding of promoter structure, something that is often limited in novel microbial chassis.

Although PWM models certainly have the potential to be applied to de novo sequence design, they are not without limitations. PWMs may prove inadequate for modelling in promoter families with a less conserved nature than those which interact with σE, as poorly conserved sequences required greater complexity within the model [60]. Application of PWMs in novel microbial chassis, where understanding of interactions between proteins and promoter sequences can be limited, may therefore be challenging.

Additionally, by assuming that the contribution of individual nucleotides to DNA-protein binding is independent and additive [61], PWMs fail to account for the effect of interactions between positions. Despite these limitations, the application of PWMs for the pre hoc determination of strength in certain promoter families carries great potential.

Partial least squares regression

The use of statistical modelling to quantitatively link DNA sequence to function is not a new concept [62], although as a method for the generation of synthetic promoters it remains underutilized. In a pioneering study, 25 E. coli promoters were analysed using a partial least squares (PLS) methodology, resulting in a statistical model that analysed the contribution of each individual nucleotide at any given position in the DNA sequence. In order to validate the model, two synthetic sequences with predicted high activity levels were synthesized. The −35, −10 and +1 sites were determined using the consensus sequence of the training set of 25 promoters, whereas the remainder of the synthetic sequences were determined using regression coefficients provided by the modelling process [62]. In vivo characterization of the synthetic promoters revealed activity levels within approximately 8% of the strength predicted by the model. Furthermore, the synthetic sequences were shown to provide higher expression levels than any of those sequences found within the training set [62].

Similar statistical methods were later applied to quantitatively link promoter structure with function for a library of synthetic E. coli promoters that were generated through the randomization of flanking regions [29]. The generated model was able to predict, with reasonable accuracy, the strength of promoter sequences that had not been used in the construction of the model [29]. In further validation of this computational technique, the promoter strength predictive model was subsequently utilized to predict the strength of an endogenous E. coli promoter, that of the ppc gene [63]. Based on this information, stronger promoters were selected from the previously characterized promoter library [29] in order to fine-tune ppc expression levels. This knock-in approach resulted in an increase in expression levels roughly in line with the model's predictions, with a 3–4-fold increase in mRNA levels seen at flask scale [63]. Although the PLS regression doubtlessly aided in the optimization process, it was not applied, in this instance to the de novo design of synthetic promoter sequences.

Artificial neural networks

The linear nature of PLS modelling is a drawback when applied to the analysis of promoter sequences, confounding the effects of any interactions between bases with the main effects for each individual nucleotide position [62]. PLS models therefore may not accurately account for the complexity inherent in promoter structure, thereby increasing the probability of prediction errors and inadequate generality [64]. Indeed, many such models lack robust prediction accuracy [65], rendering their use in de novo sequence design challenging.

Artificial neural networks (ANNs) may provide a solution to these issues. Based upon a network of interconnected nodes designed to act as a rudimentary mimic of the brain, ANNs permit machine learning, as the order and force of connections may be altered [66]. By systematically altering node structure during the analysis of a training data set, ANN models can potentially better represent the complex, non-linear interactions occurring within a promoter sequence [64]. ANN modelling has proven successful for de novo promoter design [64]; using a set of synthetic promoters derived from the random mutagenesis of a wild-type E. coli promoter as a training set for an ANN model, strength predictions of sequences generated by in silico mutagenesis were used to select 16 synthetic sequences for in vivo verification [64]. The predicted expression levels displayed good correlation with empirical testing, suggesting that such models are indeed applicable to synthetic promoter design. Indeed, the fact that approximately 30% of de novo designed sequences displayed greater expression levels that the wild-type control [64] compares extremely favourably to the more traditional mutagenesis-based techniques discussed above, where much lower success rates are not uncommon.

The importance of insulation

Whichever method is applied to the generation of SPLs, promoter elements must be sufficiently insulated if they are to be efficiently used in synthetic regulatory systems. Empirical or predictive data regarding promoter strength from characterization using a reporter protein must be comparable to promoter performance when coupled to a protein of interest within a synthetic pathway; context-dependent effects should be minimal. However, achieving context dependency is non-trivial, as fluctuation in promoter activity levels may be the result of a wide array of experimental and/or genetic factors [27,53].

A possible solution to this problem is to separate core elements from their genetic context through the use of insulator sequences, such as a defined 5’ mRNA sequence [67]. By using such insulators, promoter elements from a SPL can produce constant relative levels of various reporter proteins when used for both plasmid and chromosomal expression [67].

Conclusion

The ability to select a reliable promoter of known activity is of paramount importance for synthetic biology. Indeed, promoters with different and, most importantly, predictable effects on transcription may be used to regulate complex gene circuits, balance engineered metabolic pathways and exploit new chassis for industrial-scale applications. As reviewed here, a number of molecular and computational methodologies are available for the discovery and design of new constitutive promoters. Each technique has advantages and weaknesses, and a selection of one over the other will depend on the aims of specific projects. However, to date, computational approaches to promoter design remain underutilized aside from proof of principle studies in model organisms. As the applications of synthetic biology become more entrenched in the future bio-economy, which may require the development of different chassis, the application of computational modelling to promoter design can enhance and accelerate the design process and ultimately enhance our fundamental knowledge of genetic regulation in complex systems.

Abbreviations

     
  • ANN

    artificial neural network

  •  
  • epPCR

    error-prone PCR

  •  
  • PLS

    partial least squares

  •  
  • PWM

    position weight matrix

  •  
  • SMFR

    saturation mutagenesis of flanking regions

  •  
  • SPL

    synthetic promoter library

  •  
  • TSS

    transcription start site

  •  
  • UAS

    upstream activation sequence

Synthetic Biology UK 2015: Held at Kingsway Hall Hotel, London, U.K., 1–3 September 2015

References

References
1
Ajikumar
P.K.
Xiao
W.H.
Tyo
K.E.J.
Wang
Y.
Simeon
F.
Leonard
E.
Mucha
O.
Heng Phon
T.
Pfeifer
B.
Stephanopoulos
G.
Isoprenoid pathway optimization for taxol precursor overproduction in Escherichia coli
Science
2010
, vol. 
330
 (pg. 
70
-
74
)
[PubMed]
2
Tyo
K.E.J
Ajikumar
P.K.
Stephanopoulos
G.
Stabilized gene duplication enables long-term selection-free heterologous pathway expression
Nat. Biotechnol.
2009
, vol. 
27
 (pg. 
760
-
765
)
[PubMed]
3
Blazeck
J.
Alper
H.S.
Promoter engineering: recent advances in controlling transcription at the most fundamental level
Biotechnol. J.
2013
, vol. 
8
 (pg. 
46
-
58
)
[PubMed]
4
Lin
Z.
Xu
Z.
Li
Y.
Wang
Z.
Chen
T.
Zhao
X.
Metabolic engineering of Escherichia coli for the production of riboflavin
Microb. Cell Fact.
2014
, vol. 
13
 (pg. 
1
-
12
)
[PubMed]
5
Ravasi
P.
Peiru
S.
Gramajo
H.
Menzella
H.G.
Design and testing of a synthetic biology framework for genetic engineering of Corynebacterium glutamicum
Microb. Cell Fact.
2012
, vol. 
11
 (pg. 
147
-
158
)
[PubMed]
6
Markley
A.L.
Begemann
M.B.
Clarke
R.E.
Gordon
G.C.
Pfleger
B.F.
Synthetic Biology toolbox for controlling gene expression in the Cyanobacterium Synechococcus sp. strain PCC 7002
ACS Synth. Biol.
2015
, vol. 
4
 (pg. 
595
-
603
)
[PubMed]
7
Quax
T.E.F.
Claassens
N.J.
Söll
D.
van der Oost
J.
Codon bias as a means to fine-tune gene expression
Mol. Cell
2015
, vol. 
59
 (pg. 
149
-
161
)
[PubMed]
8
Angov
E.
Codon usage: nature's roadmap to expression and folding of proteins
Biotechnol. J.
2011
, vol. 
6
 (pg. 
650
-
659
)
[PubMed]
9
Curran
K.A.
Karim
A.S.
Gupta
A.
Alper
H.S.
Use of expression-enhancing terminators in Saccharomyces cerevisiae to increase mRNA half-life and improve gene expression control for metabolic engineering applications
Metab. Eng.
2013
, vol. 
19
 (pg. 
88
-
97
)
[PubMed]
10
Jones
J.A.
Toparlak
Ö.D.
Koffas
M.A.
Metabolic pathway balancing and its role in the production of biofuels and chemicals
Curr. Opin. Biotech.
2015
, vol. 
33
 (pg. 
52
-
59
)
11
Bloom
J.D.
Meyer
M.M.
Meinhold
P.
Otey
C.R.
MacMillan
D.
Arnold
F.H.
Evolving strategies for enzyme engineering
Curr. Opin. Struct. Biol.
2005
, vol. 
15
 (pg. 
447
-
452
)
[PubMed]
12
Dueber
J.E.
Wu
G.C.
Malmirchegini
G.R.
Moon
T.S.
Petzold
C.J.
Ullal
A.V.
Prather
K.L.J.
Keasling
J.D.
Synthetic protein scaffolds provide modular control over metabolic flux
Nat. Biotechnol.
2009
, vol. 
27
 (pg. 
753
-
759
)
[PubMed]
13
Boyle
P.M.
Silver
P.A.
Parts plus pipes: synthetic biology approaches to metabolic engineering
Metab. Eng.
2012
, vol. 
14
 (pg. 
223
-
232
)
[PubMed]
14
Parsons
J.B.
Frank
S.
Bhella
D.
Liang
M.
Prentice
M.B.
Mulvihill
D.P.
Warren
M.J.
Synthesis of empty bacterial microcompartments, directed organelle protein incorporation, and evidence of filament-associated organelle movement
Mol. Cell
2010
, vol. 
38
 (pg. 
305
-
315
)
[PubMed]
15
Mellin
J.R.
Cossart
P.
Unexpected versatility in bacterial riboswitches
Trends Genet.
2015
, vol. 
31
 (pg. 
150
-
156
)
[PubMed]
16
Green
A.A.
Silver
P.A.
Collins
J.J.
Yin
P.
Toehold switches: de-novo-designed regulators of gene expression
Cell
2014
, vol. 
159
 (pg. 
925
-
939
)
[PubMed]
17
Elowitz
M.B.
Leibler
S.
A synthetic oscillatory network of transcriptional regulators
Nature
2000
, vol. 
403
 (pg. 
335
-
338
)
[PubMed]
18
Gardner
T.S.
Cantor
C.R.
Collins
J.J.
Construction of a genetic toggle switch in Escherichia coli
Nature
2000
, vol. 
403
 (pg. 
339
-
342
)
[PubMed]
19
Guido
N.J.
Wang
X.
Adalsteinsson
D.
McMillen
D.
Hasty
J.
Cantor
C.R.
Elston
T.C.
Collins
J.J.
A bottom-up approach to gene regulation
Nature
2006
, vol. 
439
 (pg. 
856
-
860
)
[PubMed]
20
Lu
T.K.
Khalil
A.S.
Collins
J.J.
Next-generation synthetic gene networks
Nat. Biotechnol.
2009
, vol. 
27
 (pg. 
1139
-
1150
)
[PubMed]
21
Ellis
T.
Wang
X.
Collins
J.J.
Diversity-based, model-guided construction of synthetic gene networks with predicted functions
Nat. Biotechnol.
2009
, vol. 
27
 (pg. 
465
-
471
)
[PubMed]
22
Siegele
D.A.
Hu
J.C.
Gene expression from plasmids containing the araBAD promoter at subsaturating inducer concentrations represents mixed populations
Proc. Natl. Acad. Sci. U.S.A.
1997
, vol. 
94
 (pg. 
8168
-
8172
)
[PubMed]
23
Hammer
K.
Mijakovic
I.
Jensen
P.R.
Synthetic promoter libraries–tuning of gene expression
Trends Biotechnol.
2006
, vol. 
24
 (pg. 
53
-
55
)
[PubMed]
24
Jensen
P.R.
Hammer
K.
Artificial promoters for metabolic optimization
Biotechnol. Bioeng.
1998
, vol. 
58
 (pg. 
191
-
195
)
[PubMed]
25
Khlebnikov
A.
Datsenko
K.A.
Skaug
T.
Wanner
B.L.
Keasling
J.D.
Homogeneous expression of the PBAD promoter in Escherichia coli by constitutive expression of the low-affinity high-capacity AraE transporter
Microbiol.
2001
, vol. 
147
 (pg. 
3241
-
3247
)
26
Hahn
S.
Young
E.T.
Transcriptional regulation in Saccharomyces cerevisiae: transcription factor regulation and function, mechanisms of initiation, and roles of activators and coactivators
Genetics
2011
, vol. 
189
 (pg. 
705
-
736
)
[PubMed]
27
Kelly
J.R.
Rubin
A.J.
Davis
J.H.
Ajo-Franklin
C.M.
Cumbers
J.
Czar
M.J.
de Mora
K.
Glieberman
A.L.
Monie
D.D.
Endy
D.
Measuring the activity of BioBrick promoters using an in vivo reference standard
J. Biol. Eng.
2009
, vol. 
3
 pg. 
4
 
[PubMed]
28
Rudge
T.J.
Brown
J.R.
Federici
F.
Dalchau
N.
Phillips
A.
Ajioka
J.W.
Haseloff
J.
Characterization of intrinsic properties of promoters
ACS Synth. Biol.
2016
, vol. 
5
 (pg. 
89
-
98
)
[PubMed]
29
De Mey
M.
Maertens
J.
Lequeux
G.J.
Soetaert
W.K.
Vandamme
E.J.
Construction and model-based analysis of a promoter library for E. coli: an indispensable tool for metabolic engineering
BMC Biotechnol.
2007
, vol. 
7
 pg. 
34
 
[PubMed]
30
Morgan-Kiss
R.M.
Wadler
C.
Cronan
J.E.
Jr
Long-term and homogeneous regulation of the Escherichia coli araBAD promoter by use of a lactose transporter of relaxed specificity
Proc. Natl. Acad. Sci. U.S.A.
2002
, vol. 
99
 (pg. 
7373
-
7377
)
[PubMed]
31
Keasling
J.D.
Gene-expression tools for the metabolic engineering of bacteria
Trends Biotechnol.
1999
, vol. 
17
 (pg. 
452
-
460
)
[PubMed]
32
Makoff
A.J.
Oxer
M.D.
High level heterologous expression in E. coli using mutant forms of the lac promoter
Nucleic Acids Res.
1991
, vol. 
19
 (pg. 
2417
-
2421
)
[PubMed]
33
de Boer
H.
Comstock
L.J.
Vasser
M.
The tac promoter: a functional hybrid derived from the trp and lac promoters
Proc. Natl. Acad. Sci. U.S.A.
1983
, vol. 
80
 (pg. 
21
-
25
)
[PubMed]
34
Terpe
K.
Overview of bacterial expression systems for heterologous protein production: from molecular and biochemical fundamentals to commercial systems
Appl. Microbiol. Biotechnol.
2006
, vol. 
72
 (pg. 
211
-
222
)
[PubMed]
35
Jensen
P.R.
Westerhoff
H.V.
Michelsen
O.
The use of lac-type promoters in control analysis
Eur. J. Biochem.
1993
, vol. 
211
 (pg. 
181
-
191
)
[PubMed]
36
Wycuff
D.R.
Matthews
K.S.
Generation of an AraC-araBAD promoter-regulated T7 expression system
Anal. Biochem.
2000
, vol. 
277
 (pg. 
67
-
73
)
[PubMed]
37
Guzman
L.M.
Belin
D.
Carson
M.J.
Beckwith
J.
Tight regulation, modulation and high-level expression by vectors containing the Arabinose PBAD promoter
J. Bacteriol.
1995
, vol. 
177
 (pg. 
4121
-
4130
)
[PubMed]
38
Cagnon
C.
Valverde
V.
Masson
J.M.
A new family of sugar-inducible expression vectors for Escherichia coli
Protein Eng.
1991
, vol. 
4
 (pg. 
843
-
847
)
[PubMed]
39
Valdez-Cruz
N.A.
Caspeta
L.
Pérez
N.O.
Ramírez
O.T.
Trujillo-Roldán
M.A.
Production of recombinant proteins in E. coli by the heat inducible expression system based on the phage lambda pL and/or pR promoters
Microb. Cell Fact.
2010
, vol. 
9
 pg. 
18
 
[PubMed]
40
Elvin
C.M.
Thompson
P.R.
Argall
M.E.
Hendry
P.
Stamford
N.P.J.
Lilley
P.E.
Dixon
N.E.
Modified bacteriophage lambda promoter vectors for overproduction of proteins in Escherichia coli
Gene
1990
, vol. 
87
 (pg. 
123
-
126
)
[PubMed]
41
Mellado
R.P.
Salas
M.
High level synthesis in Escherichia coli of the Bacillus subtillis phage phi 29 proteins p3 and p4 under the control of phage lambda PL promoter
Nucleic Acids Res.
1982
, vol. 
10
 (pg. 
5773
-
5784
)
[PubMed]
42
Simons
G.
Remaut
E.
Allet
B.
Devos
R.
Fiers
W.
High-level expression of human interferon gamma in Escherichia coli under control of the pL promoter of bacteriophage lambda
Gene
1984
, vol. 
28
 (pg. 
55
-
64
)
[PubMed]
43
Studier
F.W.
Moffatt
B.A.
Use of bacteriophage T7 RNA polymerase to direct selective high-level expression of cloned genes
J. Mol. Biol.
1986
, vol. 
189
 (pg. 
113
-
130
)
[PubMed]
44
Collado-Vides
J.
Magasanik
B.
Gralla
J.D.
Control site location and transcriptional regulation in Escherichia coli
Microbiol. Rev.
1991
, vol. 
55
 (pg. 
371
-
394
)
[PubMed]
45
Jensen
P.R.
Hammer
K.
The sequence of spacers between the consensus sequences modulates the strength of prokaryotic promoters
Appl. Environ. Microb.
1998
, vol. 
64
 (pg. 
82
-
87
)
46
Solem
C.
Jensen
P.R.
Modulation of gene expression made easy
Appl. Environ. Microb.
2002
, vol. 
68
 (pg. 
2397
-
2403
)
47
Rytter
J.V.
Helmark
S.
Chen
J.
Lezyk
M.J.
Solem
C.
Jensen
P.R.
Synthetic promoter libraries for Corynebacterium glutamicum
Appl. Microbiol. Biotechnol.
2014
, vol. 
98
 (pg. 
2617
-
2623
)
[PubMed]
48
Sohoni
S.V.
Fazio
A.
Workman
C.T.
Mijakovic
I.
Lantz
A.E.
Synthetic promoter library for modulation of Actinorhodin production in Streptomyces coelicolor A3(2)
PLoS One
2014
, vol. 
9
 pg. 
e99701
 
[PubMed]
49
Blount
B.A.
Weenink
T.
Vasylechko
S.
Ellis
T.
Rational diversification of a promoter providing fine-tuned expression and orthogonal regulation for synthetic biology
PLoS One
2012
, vol. 
7
 pg. 
e33279
 
[PubMed]
50
Angermayr
M.
Oechsner
U.
Bandlow
W.
Reb1p-dependent DNA bending effects nucleosome positioning and constitutive transcription at the yeast profilin promoter
J. Biol. Chem.
2003
, vol. 
278
 (pg. 
17918
-
17926
)
[PubMed]
51
McWhinnie
R.L.
Nano
F.E.
Synthetic promoters functional in Francisella novicida and Escherichia coli
Appl. Environ. Microb.
2013
, vol. 
80
 (pg. 
226
-
234
)
52
Fischer
C.R.
Alper
H.
Nevoigt
E.
Jensen
K.L.
Stephanopoulos
G.
Response to Hammer et al.: tuning genetic control–importance of thorough promoter characterization versus generating promoter diversity
Trends Biotechnol.
2006
, vol. 
24
 (pg. 
55
-
56
)
[PubMed]
53
Alper
H.
Fischer
C.
Nevoigt
E.
Stephanopoulos
G.
Tuning genetic control through promoter engineering
Proc. Natl. Acad. Sci. U.S.A.
2005
, vol. 
102
 (pg. 
12678
-
12683
)
[PubMed]
54
Yim
S.S.
An
S.J.
Kang
M.
Lee
J.
Jeong
K.J.
Isolation of fully synthetic promoters for high-level gene expression in Corynebacterium glutamicum
Biotechnol. Bioeng.
2013
, vol. 
110
 (pg. 
2959
-
2969
)
[PubMed]
55
Nevoigt
E.
Kohnke
J.
Fischer
C.R.
Alper
H.
Stahl
U.
Stephanopoulos
G.
Engineering of promoter replacement cassettes for fine-tuning of gene expression in Saccharomyces cerevisiae
Appl. Environ. Microbiol.
2006
, vol. 
72
 (pg. 
5266
-
5273
)
[PubMed]
56
Blazeck
J.
Garg
R.
Reed
B.
Alper
H.S.
Controlling promoter strength and regulation in Saccharomyces cerevisiae using synthetic hybrid promoters
Biotechnol. Bioeng.
2012
, vol. 
109
 (pg. 
2884
-
2895
)
[PubMed]
57
Jensen
K.
Alper
H.
Fischer
C.
Stephanopoulos
G.
Identifying functionally important mutations from phenotypically diverse sequence data
Appl. Environ. Microb.
2006
, vol. 
72
 (pg. 
3696
-
3701
)
58
Sinha
S.
On counting position weight matrix matches in a sequence, with application to discriminative motif finding
Bioinformatics
2006
, vol. 
22
 (pg. 
e454
-
e463
)
[PubMed]
59
Stromo
G.D.
DNA binding sites: representation and discovery
Bioinformatics
2000
, vol. 
16
 (pg. 
16
-
23
)
[PubMed]
60
Rhodius
V.A.
Mutalik
V.K.
Predicting strength and function for promoters of the Escherichia coli alternative sigma factor, σE
Proc. Natl. Acad. Sci. U.S.A.
2010
, vol. 
107
 (pg. 
2854
-
2859
)
[PubMed]
61
Rhodius
V.A.
Mutalik
V.K.
Gross
C.A.
Predicting the strength of UP-elements and full-length E. coli σE promoters
Nucleic Acids Res.
2012
, vol. 
40
 (pg. 
2907
-
2924
)
[PubMed]
62
Jonsson
J.
Norberg
T.
Carlsson
L.
Gustafsson
C.
Wold
S.
Quantitative sequence-activity models (QSAM)-tools for sequence design
Nucleic Acids Res.
1993
, vol. 
12
 (pg. 
733
-
739
)
63
De Mey
M.
Maertens
J.
Boogmans
S.
Soetaert
W.K.
Vandamme
E.J.
Cunin
R.
Foulquié-Moreno
M.R.
Promoter knock-in: a novel rational method for the fine tuning of genes
BMC Biotechnol.
2010
, vol. 
10
 pg. 
26
 
[PubMed]
64
Meng
H.
Wang
J.
Xiong
Z.
Xu
F.
Zhao
G.
Wang
Y.
Quantitative design of regulatory elements based on high-precision strength prediction using artificial neural network
PLoS One
2013
, vol. 
8
 pg. 
e60288
 
[PubMed]
65
Meng
H.
Wang
Y.
Cis-acting regulatory elements: from random screening to quantitative design
Quant. Biol.
2015
, vol. 
3
 (pg. 
107
-
114
)
66
Buscema
P.M.
Massini
G.
Maurelli
G.
Artificial neural networks: an overview and their use in the analysis of the AMPHORA-3 dataset
Subst. Use Misuse
2014
, vol. 
49
 (pg. 
1555
-
1568
)
[PubMed]
67
Davis
J.H.
Rubin
A.J.
Sauer
R.T.
Design, construction and characterization of a set of insulated bacterial promoters
Nucleic Acids Res.
2011
, vol. 
39
 (pg. 
1131
-
1141
)
[PubMed]
68
Harley
C.B.
Reynolds
R.P.
Analysis of E. coli promoter sequences
Nucleic Acids Res.
1987
, vol. 
15
 (pg. 
2343
-
2361
)
[PubMed]
69
Kanhere
A.
Bansal
M.
Structural properties of promoters: similarities and differences between prokaryotes and eukaryotes
Nucleic Acids Res.
2005
, vol. 
33
 (pg. 
3165
-
3175
)
[PubMed]
70
Nair
T.M.
Kulkarni
B.D.
On the consensus structure within the E. coli promoters
Biophys. Chem.
1994
, vol. 
48
 (pg. 
383
-
393
)
[PubMed]
71
Estrem
S.T.
Gaal
T.
Ross
W.
Gourse
R.L.
Identification of an UP element consensus sequence for bacterial promoters
Proc. Natl. Acad. Sci. U.S.A.
1998
, vol. 
95
 (pg. 
9761
-
9766
)
[PubMed]
72
Ross
W.
Aiyar
S.E.
Salomon
J.
Gourse
R.L.
Escherichia coli promoters with UP elements of different strengths: modular structure of bacterial promoters
J. Bacteriol.
1998
, vol. 
180
 (pg. 
5375
-
5383
)
[PubMed]
73
Hampsey
M.
Molecular genetics of the RNA polymerase II general transcriptional machinery
Microbiol. Mol. Biol. Rev.
1998
, vol. 
62
 (pg. 
465
-
503
)
[PubMed]
74
Basehoar
A.D.
Zanton
S.J.
Pugh
B.F.
Identification and distinct regulation of yeast TATA box-containing genes
Cell
2004
, vol. 
116
 (pg. 
699
-
709
)
[PubMed]
75
Hughes
J.D.
Estep
P.W.
Tavazoie
S.
Church
G.M.
Computational identification of cis-regulatory elements associated with groups of functionally related genes in Saccharomyces cerevisiae
J. Mol. Biol.
2000
, vol. 
296
 (pg. 
1205
-
1214
)
[PubMed]
76
Harbison
C.T.
Gordon
D.B.
Lee
T.I.
Rinaldi
N.J.
Macisaac
K.D.
Danford
T.W.
Hannett
N.M.
Tagne
J.B.
Reynolds
D.B.
Yoo
J.
, et al. 
Transcriptional regulatory code of a eukaryotic genome
Nature
2004
, vol. 
431
 (pg. 
99
-
104
)
[PubMed]
This is an open access article published by Portland Press Limited on behalf of the Biochemical Society and distributed under the Creative Commons Attribution Licence 4.0 (CC BY-NC-ND).