Alphavirus nsP2 proteins are multifunctional and essential for viral replication. The protease role of nsP2 is critical for virus replication as only the virus protease activity is used for processing of the viral non-structural polypeptide. Chikungunya virus is an emerging disease problem that is becoming a world-wide health issue. We have generated purified recombinant chikungunya virus nsP2 proteins, both full length and a truncated protease domain from the C-terminus of the nsP2 protein. Enzyme characterization shows that the protease domain alone has different properties compared with the full length nsP2 protease. We also show chikungunya nsP2 protease possesses different substrate specificity to the canonical alphavirus nsP2 polyprotein cleavage specificity. Moreover, the chikungunya nsP2 also appears to differ from other alphavirus nsP2 in its distinctive ability to recognize small peptide substrates.
Chikungunya virus (CHIKV), the causative agent of chikungunya fever, is an Alphavirus belonging to the family Togaviridae, that was originally isolated in Tanzania, Africa . In Africa, CHIKV is believed to be maintained in a sylvatic transmission cycle between forest dwelling mosquitoes and wild non-human primates such as vervet monkeys or baboons [2,3]. In Asia and elsewhere transmission is generally maintained in an urban human–mosquito–human cycle  with Aedes genus mosquitoes being the major vectors, especially Ae. aegypti and Ae. albopictus. Although Ae. albopictus or the Asian tiger mosquito is a native species of Southeast Asia, it was involved in a large chikungunya outbreak in the Indian Ocean islands in 2005–2006 . A mutation of the CHIKV E1 structural protein (Ala226Val) was observed at the beginning of the outbreak and it was subsequently found that the E1 mutation enhanced infectivity in Ae. albopictus mosquitoes . Regrettably, Ae. albopictus is one of the world's 100 most invasive species, increasing the risk of CHIKV emerging or re-emerging and becoming a major health problem around the world . This has been exemplified by the recent introduction of CHIKV into the Americas, where, within a short time, more than a million suspected cases of chikungunya fever are believed to have occurred . Chikungunya fever is typically characterized by headache, high fever, skin rash and nausea as well as joint pain that can be severe and long lasting, and, although rare, fatalities have been reported .
CHIKV is comprised of a genome of an approximately 12 kb positive sense single-stranded RNA including a 5′cap and 3′poly(A) tail . Almost two-thirds of the RNA genome encodes for the non-structural polyprotein precursor, nsP1234, which is processed by the virally encoded protease (nsP2) activity generating the replication complex. Initial processing generates nsP4 and nsP123 which direct synthesis of the negative sense template RNAs, while subsequent processing into nsP1 and nsP23 generates a complex that produces both sense and anti-sense RNAs. Final processing generates discrete nsP2 and nsP3 proteins . The replication complex additionally transcribes a small 26S subgenomic RNA from the 3’-end of the genome that encodes the three structural proteins (Capsid, E1 and E2) as well as two small accessory proteins which are subsequently processed by viral and host cell proteases .
In addition to its protease activity, the viral nsP2 protein possesses multiple enzymatic functions including RNA helicase, nucleoside triphosphatase (NTPase) and RNA-dependent 5′-triphosphatase activities [11–13]. The protease domain is located at the C-terminus of the nsP2 protein, and it is postulated that the CHIKV nsP2 is a papain-like cysteine protease although there is no direct experimental evidence to support this statement [11,14]. However, other alphaviruses such as Sindbis virus (SINV), Semliki forest virus (SFV) and Venezuelan equine encephalitis virus (VEEV) have been well-studied, and the nsP2 protein has been experimentally characterized as a cysteine protease [14–18]. Primary amino acid alignment of the CHIKV nsP2 protease with the other alphaviruses shows the presence of the conserved cysteine and histidine catalytic dyad [18,19], and the protease active site residues are highly conserved across the different alphaviruses.
Presently, only a truncated CHIKV nsP2, consisting of the isolated protease domain (nsP2-Pro) has been proteolytically characterized . In this study we have biochemically characterized the protease activity of the full-length nsP2 protein and compared it to the activity of the isolated protease domain.
Chikungunya viral RNA was prepared from virus stocks of a previously described Thai CHIKV (ECSA E1:226V) isolate  using the illustra RNAspin mini RNA isolation kit (GE Healthcare). First-strand cDNA was synthesized by reverse transcription and was then used as a template for PCR amplification. Specific primers were used to amplify the 2394 nucleotides of the full length nsP2 protein (nsP2-FL) and the 1131 nucleotides of the protease domain of nsP2 (nsP2-Pro). Both sequences were cloned into an engineered vector derived from the pET21d vector. This vector was engineered to contain the maltose binding protein (MBP), a tobacco etch virus (TEV) protease recognition site, an 8xHis-tag and the PreScission protease recognition site, respectively. Moreover this vector was also designed to have a SmaI restriction site to facilitate the cloning by the Ligation Independent Cloning (LIC) method. The recombinants were transformed into Escherichia coli DH5α, and the candidate clones were verified by DNA sequencing.
Protein expression and purification of the nsP2 proteins
The nsP2 recombinant plasmids were extracted and re-transformed into E. coli BL21-CodonPlus® (DE3)-RIL competent cells. The cells were cultured in terrific broth supplemented with 100 μg/ml ampicillin and 34 μg/ml chloramphenicol at 37°C until the cell density reached 0.5 OD600nm. Protein expression was induced with 0.2 mM isopropyl-β-D-thiogalactopyranoside (IPTG) at 18°C in overnight culture. The cells were subsequently centrifuged and kept at -20°C until use. The nsP2-FL and nsP2-Pro proteins were purified by the same protocol. The cell pellets were resuspended with 50 mM Tris/HCl, pH 7.5, 200 mM NaCl, 0.2 M arginine and 0.2 M glutamic acid (equilibration buffer). The cells were chemically lysed by the addition of 2 mg/ml lysozyme and physically lysed by sonication and centrifuged at 10,000 g for 30 min at 4°C. The supernatants were filtered through 0.45 μm membranes before being applied to HisTrap HP columns (GE Healthcare). After application, the columns were washed with 20 column volumes of equilibration buffer followed by 10 column volumes of 20 mM imidazole in equilibration buffer. The MBP-His tag was cleaved with PreScission protease by on-column cleavage at room temperature for 30 min. The cleaved proteins were eluted from the columns and applied to 5 ml HiTrap desalting columns (GE Healthcare) for buffer exchange to a cation exchanger buffer (50 mM potassium phosphate, pH 6.5, 0.2 M arginine and 0.2 M glutamic acid). The cation exchanger columns (1 ml HiTrap™ SP-XL) were used to remove the remaining contaminants. The bound proteins were eluted with 50 mM potassium phosphate, pH 6.5, 0.2 M arginine, 0.2 M glutamic acid and 200 mM NaCl. Finally, the pure proteins were desalted and kept in 50 mM Tris/HCl, pH 7.5, containing 0.2 M arginine and 0.2 M glutamic acid.
Protein concentrations were determined using Bradford reagent (Bio-Rad Laboratories) with BSA as a standard. The purity of the nsP2-FL and nsP2-Pro proteins was evaluated by SDS/PAGE.
Enzyme kinetic characterization
Three synthetic fluorescent substrates were purchased from Peptides International, Inc (Table 1). These substrates corresponded to nsP1/nsP2 (AGA), nsP2/nsP3 (AGC) and nsP3/nsP4 (AGG) cleavage sites of the chikungunya viral non-structural polyprotein. The amino acid substrates spanned P4-P'5 of each scissile site. The synthetic protein substrates were tagged with 2-(N-methylamino)benzoyl (Nma) groups at the N-terminus and 2,4-dinitrophenyl (Dnp) groups at an added lysine residue at the C-terminus. The AGA and AGG peptide substrates were dissolved with dimethyl sulfoxide (DMSO) whereas the AGC substrate was dissolved in dimethylformamide (DMF). Briefly, the stock substrate solutions were diluted in assay buffer (50 mM Tris/HCl, pH 7.5, 0.2 M arginine and 0.2 M glutamic acid) at varying concentrations. Then 90 μl of diluted substrates were pipetted into black 96-well plates. The reactions were started by adding 1 μM (10 μl) of the purified enzyme. The protease activity was continuously measured for substrate depletion using a Beckman Coulter DTX880 multimode detector at 340 nm excitation and 430 nm emission wavelengths at 37°C for 3 h. The progress curves (after subtraction of no enzyme controls) were analysed by Dynafit program© version 3.28.070 (BioKin, Ltd.) to obtain the kinetic parameters .
|Cleavage site (P4–P5′) .||Recognition site .||Designated name .|
|Cleavage site (P4–P5′) .||Recognition site .||Designated name .|
Protease activity characterization
The effects of salt, metal ions and protease inhibitors on nsP2 protease activity were biochemically characterized. Various concentrations of sodium chloride (NaCl) ranging from 100 mM to 2 M were used. The metal ions cobalt (Co2+), magnesium (Mg2+), zinc (Zn2+), nickel (Ni2+) and copper (Cu2+) were used at 2 mM in this study. Four different protease inhibitors were also tested at different final concentrations, 50 μM chymostatin (which inhibits chymotrypsin-like proteases), 10 μM E-64 (a highly selective cysteine protease inhibitor), 100 μM leupeptin (an inhibitor of serine and thiol proteases) and 1 mM phenylmethanesulfonyl fluoride (PMSF; a commonly used serine protease inhibitor). The enzyme reactions were performed as mentioned above with all three synthetic fluorescent substrates. The initial rate reaction was evaluated using GraphPad Prism® software, version 5.01. Percent remaining activity was calculated using the control activity in the absence of an inhibitor and the enzyme activity obtained in the presence of the inhibitor.
Thermal stability test
The freshly purified enzymes were incubated at 37, 42, 50 and 60°C for 10 min and immediately measured for enzyme activity using the synthetic fluorescent AGG substrate. The initial reaction rate was evaluated using GraphPad Prism® software, version 5.01. Percent remaining activity was evaluated by comparing with the control reaction at 37°C.
nsP2 protein expression and purification
Three different expression vectors; His-tag, NusA-His-tag and GST-His-tag fusion, were initially tested as vectors for the expression of the nsP2-FL and nsP2-Pro proteins (data not shown). Results consistently showed that the nsP2 proteins were expressed as inclusion bodies under ‘high’ temperature culturing conditions (37°C), and only partially expressed in a soluble form at 22°C. Although lower temperatures and longer times of expression were investigated product solubility was only improved when a MBP-His fusion tag was added. The approximately 135 kDa (nsP2-FL) and 89 kDa (nsP2-Pro) fusion tag proteins were then expressed for subsequent purification. Several chromatography methods were employed in addition to MBPTrap and HisTrap columns. Moreover various metal ions including Ni2+, Co2+, Cu2+, Zn2+ and Ca2+ were also employed to charge the HisTrap columns. Unfortunately, both nsP2-fusion tag proteins tended to aggregate with contaminating E. coli proteins in all purifications steps (data not shown). Therefore additives to the purification buffers were investigated to dissociate co-eluting aggregate proteins. Finally, a mixture of arginine and glutamic acid that could significantly decrease the protein aggregation was found . HisTrap columns charged with Ni2+ in the presence of 0.2 M arginine and 0.2 M glutamic acid gave the best partial purification from the first column. After cleaving the fusion enzyme with PreScission protease, the nsP2-FL and nsP2-Pro showed as approximately 90 and 43 kDa respectively on SDS/PAGE gels. The few contaminating proteins were subsequently removed by cation exchange column chromatography. It has to be stressed that the purification buffer of all steps contains equal molar concentrations of 0.2 M arginine and glutamic acid. The purified enzymes showed ~95% purity on SDS/PAGE gels stained with Coomassie Brilliant Blue R-250 (Figure 1).
SDS/PAGE showing the purified recombinant CHIKV nsP2-FL and nsP2-Pro
Kinetic parameter determination
Initially, trimeric chromogenic substrates linked to p-nitroaniline (pNa) were used in the protease assay, but protease activity could not be detected with these short chromogenic substrates (data not shown) and thus longer fluorescent peptide substrates were employed. The protease activity was then optimized and analysed according to Michaelis–Menten kinetics. We found that the fluorescent absorption became non-linear at high concentrations of substrates, a phenomenon known as the “inner filter effect” . This problem was overcome by using lower substrate concentrations and analysing the data by progress curve analysis that conforms to the classical Michaelis–Menten equation. The kinetic parameters were then calculated by the Dynafit program [24,25].
Kinetic experiments were performed by varying concentrations of the three substrates. The turn-over number or kcat of the two enzymes for each of the three substrates appear to be similar with both enzymes showing the lowest catalytic rate for the AGG (P3/4) substrate, which is the reported first cleavage site of the viral polyprotein (Table 2). The nsP2-FL Km values are quite similar to one another, whereas the values vary up to 8-fold for the nsP2-Pro. This shows that the two enzymes have different binding affinity for the three substrates. These Km values also are reflected in the catalytic efficiency (kcat/Km) of the two enzymes for all three substrates with the nsP2-FL being similar for the three cleavage sequences whereas the nsP2-Pro showed up to a 6-fold variation in catalytic efficiency.
|Substrates (cleavage site) .||kcat (10−4 S−1) .||Km (μM) .||kcat/Km (10−4 S−1 ·μM −1) .|
|Substrates (cleavage site) .||kcat (10−4 S−1) .||Km (μM) .||kcat/Km (10−4 S−1 ·μM −1) .|
The effect of salt, metal ion and protease inhibitors
Various concentrations of monovalent salt (NaCl) ranging from 100 mM to 2 M were added to the protease activity assay. The enzyme activity of nsP2-FL was not affected by salt up to 2 M (data not shown). However, the nsP2-Pro showed 30–40% inhibition at 1 M and above for AGA (P1/2) and AGC (P2/3) but a variable increase in AGG (P3/4) activity (Figure 2). A characterization study with metal ions was also performed (Figure 3). Both enzymes behaved similarly in the metal ion study. Magnesium did not inhibit either nsP2-FL or nsP2-Pro activity for all three substrates in contrast with nickel which inhibited protease activity some 20–30%. When using AGC as substrate, copper, cobalt and zinc showed inhibition of both enzymes’ activity ranging from 40 to 60%, but these metal ions showed little to no effect on the protease activity when using AGG as a substrate. A curious effect observed was an apparent activation by cobalt for AGA (P1/2) activity for both enzymes. Zinc metal ions inhibited nsP2-Pro for all substrates but inhibited nsP2-FL activity only with the AGA and AGC substrates (Figure 3).
Salt effects on nsP2-Pro activity
The effects of metal ions and protease inhibitors on nsP2 for full length and Pro activity
The effect of different known protease inhibitors on activity was also studied. Four different protease inhibitors were selected. E-64 is a highly selective cysteine protease inhibitor. PMSF is a commonly used serine protease inhibitor whereas leupeptin is specific to serine and thiol proteases and chymostatin inhibits chymotrypsin-like proteases. PMSF did not affect either enzyme when using AGA (P1/2) as a substrate, but using AGC (P2/3) as a substrate PMSF showed no effect on nsP2-Pro while showing 30% inhibition of nsP2-FL. However, in general, these protease inhibitors seemed to have a similar overall effect on both enzymes with inhibition of 10–40% of activity at the concentrations tested (Figure 3).
Thermal stability test
The AGG fluorescent substrate was used in this experiment, and the control reaction was performed at 37°C. At 42°C, the activity of nsP2-FL decreased by 50% and nsP2-Pro lost 75% of activity. Both enzymes behaved similarly at 50°C and 60°C as shown in Figure 4.
Thermal stability test
CHIKV nsP2 is an essential and multifunctional component of the viral replicase complex [10,26,27]. The protease role of nsP2 is critical for virus replication as only the virus protease activity is used for processing the viral non-structural polypeptide. To our knowledge, this is the first enzymatic characterization of the proteolytic activity of the full length nsP2 protein of CHIKV. Initially, most of the recombinant protein was expressed as insoluble inclusion bodies. The protein solubility problem also has been reported for expression of other alphavirus nsP2 proteins, for example the previously reported CHIKV nsP2-Pro, as well as from SFV and VEEV [11,28,29]. We overcame this by fusing the target proteins with an MBP fusion tag, an approach that has been reported to be useful for this problem . After solving the solubility problem, we were able to purify the full length and Pro nsP2 proteins. Initially we found that the soluble proteins were very difficult to purify as the E. coli host proteins tended to aggregate with the recombinant nsP2. Interestingly, an equal molar mixture of arginine and glutamic acid helped reduce the interaction between proteins as well as improve the recombinant protein solubility. These two amino acids work synergistically by generating a ‘crowd’ of charged molecules on the protein surface preventing interactions with other proteins and increasing solubility .
Based on Schechter and Berger's protease nomenclature, the residues on the C-terminal of the scissile bond were assigned as prime site (P') whereas the residues on the N-terminal side were assigned as non-prime site (P) . The residues at position P4–P1′ have been shown to be critical residues to enhance the protease activity and to have a high impact on substrate recognition for the alphavirus nsP2 from SFV . This correlates with our early experiment in which we could not detect protease activity with short tri-peptide substrates. This poor proteoly-tic activity suggests that the flanking region of the scissile bond is very important in determining substrate specificity, and we therefore employed longer peptide substrates (Table 1). These three substrates represent the natural specific cleavage sites of the CHIKV nsP1234 polyprotein. Previous studies with an N-terminal truncated nsP2 (Pro39) protein of SFV showed that cleavage of substrates occurs in a specific manner and order. P3/4 (AGG) site is the first and the most efficient cleavage site as compared with P1/2 (AGA) and P2/3 (AGC) . This implies that the nsP2 protease enzyme should have the highest kcat/Km value toward AGG (P3/4) substrate, less for AGA (P1/2) and even lower activity for AGC (P2/3) substrate. Our characterization of CHIKV nsP2-FL and nsP2-Pro shows that these two enzymes are not only different from each other but are also different from the other reported alphavirus nsP2 proteases. The CHIKV nsP2-Pro showed a marked preference for AGC (P2/3) illustrated by its kcat/Km value. In addition, the CHIKV nsP2 protease activity is in contrast with what has been reported previously for SIN, SFV and VEEV nsP2 proteases as no activity for these enzymes could be observed until cleavage substrates were fused to recombinant tag proteins like thioredoxin or maltose binding protein [33–35]. In fact, SFV nsP2-Pro and full length nsP2 generated no cleavage products if the substrate constructs had less than 10 amino acids on both sides of the sessile site . It was also found that the nsP2-Pro showed greater activity than the nsP2 full length and there were large differences in specificity for the three cleavage sequences for both Pro and full length enzymes . Therefore, it was suggested, the in vivo proteolytic processing is sequentially regulated based on the conformation of polypeptide or replication complex components [36,37]. The conformation of nsP1234 polyprotein may allow the P3/4 (AGG) cleavage site to be exposed on the surface of the polypeptide while the other sites are buried within the polypeptide chain. Once nsP2 protease cleaves the first P3/4 (AGG) cleavage site, this would affect the overall polyprotein conformation and increase the accessibility of the other cleavage sites. It has been suggested that the accessibility/cleavage would also be regulated by the replication complex, the complexes current constituents and the functional status of the complex . Our data support this mechanism, as the kinetic values for each short cleavage sequence do not correlate with the order of the cleavage. We observed that the cleavage site requirements, especially for AGC (P2/3), differ for CHIKV nsP2 compared with the other reported alphavirus nsP2. It was reported that for cleavage to occur the N-terminus of nsP2 was required as well as the macro domain of nsP3, consisting of 170 N-terminal amino acids . If these requirements are lacking then no cleavage of AGC (P2/3) occurs for nsP2 from SFV and SIN, whereas we show that both CHIKV nsP2-FL and nsP2-Pro have activity for a 9-mer AGC (P2/3) substrate.
An amino acid comparison of the full length nsP2 from the three alphaviruses, CHIKV, SFV and SINV, shows relatively high identity ranging from 57% to 71%. Upon closer examination it can be seen that the nsP2 N-terminus is more conserved than the C-terminal protease domain. CHIKV nsP2-Pro compared with the same region of SFV and SINV shows only 65% and 44% amino acid identity, respectively (Figure 5). Structural superposition of CHIKV nsP2-Pro (PDB ID: 3TRK) and SINV nsP2-Pro (PDB ID: 4GUA) illustrates the impact of this amino acid identity difference on the surface charge potential (Figure 6). Of particular interest is the difference shown in the substrate binding region of the two proteins where there are structural differences as well as charge potential differences. Obviously, these variances contribute to the substrate specificity differences observed.
Amino acid alignment of the nsP2 protease domain from CHIKV, SFV and SINV
Stereo view of the electrostatic surface colouring of CHIKV nsP2-Pro and SINV nsP2-Pro
We also characterized the impact of several metal ions and protease inhibitors on the behaviour of both full length and Pro nsP2. In general, both enzymes behaved similarly with two exceptions, Zn2+ appeared to activate the full length enzyme but inhibit the nsP2-Pro for AGG (P3/4) activity and PMSF appeared to inhibit full length but showed little effect on nsP2-Pro for AGC (P2/3).
To our knowledge, the previous alphavirus nsP2 characterization studies of inhibitor and metal ion effects reported have been performed with only a single substrate whereas we used all three substrates. This is of interest as the CHIKV nsP2 appears to behave differently with the test compounds depending on the substrate employed, for example, Co2+ increases AGA (P1/2) activity but inhibited AGC (P2/3) activity. Some of our results are in contrast with a previous report for the CHIKV nsP2-Pro . However, Pastorino et al.  also showed kinetic properties varied for their enzyme with different peptide substrate lengths. We suggest that the peptide length, total as well as number of residues on each side of the scissile bond, and composition of the substrate which includes protein tags and chromogenic or fluorescent chromophores will impact upon the nsP2 catalytic specificity. Nevertheless, examples from the literature , and our present study supports the concept that the various alphavirus nsP2 proteases do display unique characteristics and the protease domain alone can exhibit different characteristics from the full length protein. This would appear to be of small consequence as the current understanding suggests that the virus polypeptide cleavage is regulated by the replication complex. However, the chikungunya nsP2 also appears to differ from other alphavirus nsP2 in its distinctive ability to recognize small peptide substrates. This is of greater potential interest as 50% of nsP2 is found in the nucleus after viral infection , and the functional significance of the localization is still unknown. Therefore an obvious issue to address in the future would be to determine whether there are host substrates for the CHIKV nsP2 protease.
We show here for the first time the enzyme characterization of CHIKV nsP2 full length protein and compared it to the activity of the protease domain alone. The data show that the full length protein and the protease domain alone possess different kinetic properties and also that CHIKV nsP2 is different from other reported alphavirus nsP2 proteases in its ability to cleave small peptide substrates.
Chonticha Saisawang, Pornpan Sillapee, and Kwanhathai Sinsirimongkol performed the experiments. Chonticha Saisawang and Albert Ketterman re-analysed the data. Chonticha Saisawang, Sukathida Ubol, Duncan Smith and Albert Ketterman contributed to the manuscript preparation.
This work was supported by grants from the Office of Higher Education Commission and Mahidol University under the National Research Universities Initiative and Mahidol University.
maltose binding protein
- cleavage sites nsP1/nsP2
non-structural protein 2
full length/protease domain only
Semliki forest virus
Venezuelan equine encephalitis virus