Infection with human noroviruses requires attachment to histo blood group antigens (HBGAs) via the major capsid protein VP1 as a primary step. Several crystal structures of VP1 protruding domain dimers, so called P-dimers, complexed with different HBGAs have been solved to atomic resolution. Corresponding binding affinities have been determined for HBGAs and other glycans exploiting different biophysical techniques, with mass spectrometry (MS) and nuclear magnetic resonance (NMR) spectroscopy being most widely used. However, reported binding affinities are inconsistent. At the extreme, for the same system MS detects binding whereas NMR spectroscopy does not, suggesting a fundamental source of error. In this short essay, we will explain the reason for the observed differences and compile reliable and reproducible binding affinities. We will then highlight how a combination of MS techniques and NMR experiments affords unique insights into the process of HBGA binding by norovirus capsid proteins.
Interactions of human norovirus capsid proteins with HBGAs and experimental methods to obtain binding data
Noroviruses belong to the family of Caliciviridae, non-enveloped viruses with a single-stranded positive-sense RNA. Phylogenetically noroviruses are segregated into ten established genogroups that are further subdivided into genotypes [1]. Viruses of genogroups I, II, IV, VIII, and IX infect humans. Since 2002 GII.4 (genogroup II, genotype 4) norovirus strains have been responsible for most norovirus outbreaks. Noroviruses are the most common cause of acute gastroenteritis worldwide, often leading to epidemics that are difficult to contain. Frequent outbreaks with large case numbers causes substantial economic burden (Centers for Disease Control and Prevention (2021) Norovirus Worldwide: https://www.cdc.gov/norovirus/trends-outbreaks/worldwide.html) [2]. To date, there is no vaccine or antiviral therapy available.
It is well established that infection with human noroviruses critically depends on the attachment of virions to certain glycans, the so-called histo blood group antigens (HBGAs) [3–5]. HBGAs are abundantly present on epithelial cell surfaces in the form of glycolipids or glycoproteins, especially in the gastrointestinal mucosa. All HBGAs share the α-l-Fuc-(1,2)-d-Gal disaccharide (fucose α-(1,2)-linked to galactose) as their core element. This so-called H-antigen can be extended with Gal (galactose) or GalNAc (N-acetylgalactosamine) residues by blood group glycosyltransferases, yielding blood group A or B antigens.
Infectious norovirus capsids consist of 180 copies of the major capsid protein VP1 arranged as 90 dimers in icosahedral symmetry. The outermost sphere of the viral capsid comprises the protruding domain (P-domain) of VP1 while the underlying sphere is made of the shell domain (S-domain). Attachment to HBGAs is mostly mediated at the apical interface of two P-domains forming the dimer (P-dimer) as illustrated in Figure 1. Decrypting the molecular mechanisms behind this attachment process is desirable to better understand virus entry into host cells. Several crystal structures of complexes of human norovirus P-dimers with HBGA ligands are now available [6–15], showing that for GII.4 type P-dimers l-fucose residues make the key binding interactions. Saturation transfer difference (STD) nuclear magnetic resonance (NMR) experiments monitoring binding of a variety of HBGA ligands to P-dimers and to VP1-only virus-like particles (VLPs) complement these data and show that l-fucose is the minimal binding element presented by HBGAs [16].
Structures of VP1 and position of glycan binding.
(a) Noroviruses are icosahedral shaped particles with a diameter of ca. 40 nm. The particles consist of 180 copies of the capsid protein VP1 forming 90 dimers (left: from yellow N-terminus to red C-terminus). Each VP1 consists of an N-terminal S-domain (yellow) and a C-terminal P-domain (red). The sketches are based on crystal structure data (PDB 1ihm). (b) Human norovirus P-dimers bind to HBGAs at the outermost tips at the interface. Shown here is the complex of a GII.4 Saga P-dimer with a blood group B trisaccharide (PDB 4 × 06, see also Figure 5 for the chemical formula).
(a) Noroviruses are icosahedral shaped particles with a diameter of ca. 40 nm. The particles consist of 180 copies of the capsid protein VP1 forming 90 dimers (left: from yellow N-terminus to red C-terminus). Each VP1 consists of an N-terminal S-domain (yellow) and a C-terminal P-domain (red). The sketches are based on crystal structure data (PDB 1ihm). (b) Human norovirus P-dimers bind to HBGAs at the outermost tips at the interface. Shown here is the complex of a GII.4 Saga P-dimer with a blood group B trisaccharide (PDB 4 × 06, see also Figure 5 for the chemical formula).
Several biophysical techniques exist to determine binding affinities of ligands binding to proteins. Glycan ligand binding is often characterized by weak affinities with dissociation constants KD in the high μM to low mM range [17,18]. Detection and characterization of weak binding is a challenge, and not all experimental techniques are equally suited. The interactions between glycan ligands and norovirus capsid proteins are a biologically relevant example for such weak interactions that can be enhanced by multivalent ligand presentation. A few studies have addressed this issue of multivalent presentation of HBGAs by, e.g. glycosphingolipids embedded in membranes [19–23]. Here, we focus on NMR and mass spectrometry (MS) methods to obtain quantitative binding data under weak binding conditions. The experimental techniques in question are native electrospray ionization mass spectrometry (ESI-MS), saturation transfer difference NMR spectroscopy (STD NMR), and chemical shift perturbation NMR experiments (CSP NMR).
Native ESI-MS is a highly effective methodology, whose gentle nature allows the study of noncovalent biomolecular interactions, such as protein–ligand complexes, under near-physiological conditions [24]. Implementing native MS for precise quantification of binding stoichiometry requires a tight control of experimental conditions, as to avoid distortion of mass spectra due to solution and gas-phase processes. For example, non-specific interactions between analytes (clustering) may occur in the electrospray because of free ligands within the same droplets as free or ligand-bound protein, which then dries down to the protein surface upon droplet evaporation. This mechanism is independent of protein size and occurs at increased ligand concentrations [25,26]. To circumvent this, Sun et al. [27] developed a direct method, which adds appropriate reference proteins. These proteins do not bind specifically to any components of the solution and serve to monitor non-specific clustering.
STD NMR experiments [28] allow to identify and characterize the binding of small ligand molecules, typically with a molecular mass well below 1 kDa, to large receptor molecules, usually proteins. Briefly, resonances of the receptor protein are saturated by a cascade of radio frequency pulses. This saturation is passed to ligands binding to the protein, causing attenuation of respective ligand NMR signals. Since a large excess of ligand over protein is used and because free ligands can ‘store’ saturation received during the bound state this saturation accumulates in free ligands. The corresponding signal attenuation is read out in difference spectra only displaying signals of binding ligands and discriminating binding from non-binding ligands. The relative size of a difference signal depends on the proton-environment of that ligand proton in the binding pocket, yielding ligand binding epitopes [29,30]. Furthermore, STD intensities measured as a function of ligand concentration allow the determination of dissociation constants [29,31]. The method is well suited to detect weak binding into the mM range of dissociation constants [32].
Protein-NMR experiments allow direct observation of the effects of ligand binding since chemical shifts of protein protons in the binding pocket are perturbed. These chemical shift perturbations (CSPs) upon binding [33] are recorded employing heteronuclear correlation experiments that provide fingerprints of the protein backbone. Chemical shifts of protein protons attached to backbone 15N nuclei serve as sensitive probes for changes in the chemical environment such as changes in pH, buffer composition, temperature, or ligand binding. Under conditions where pH, buffer composition, and temperature are kept constant this technique allows detection of binding of ligands with affinities ranging from tight binding with dissociation constants in the pM range to very weak binding in the higher mM range. Mapping of such CSPs on the protein directly yields binding site topologies. CSP titration experiments provide quantitative binding data, i.e. dissociation constants KD. Of note, the absence of CSPs proves the absence of binding interactions, which means the method is essentially free of false negatives. Because of the applicability under weak binding conditions, NMR experiments have been important for the identification and quantification of glycan–protein interactions [32,34,35]. An advantage of using NMR experiments is that the experiments can be performed under physiological conditions [32].
All three methods have been employed to determine affinities for HBGA binding to norovirus P-dimers or to VLPs [36–41]. In the following, we review the efforts that have been made to compile quantitative data for binding of HBGAs to primarily GII.4 P-dimers.
Binding affinities from native MS and from STD NMR titrations
Overview of reported affinity data
To quantitate binding of HBGAs to norovirus P-dimers native MS and STD NMR titrations have been used. The first study reporting quantitative dissociation constants KD was based on STD NMR titrations [14] of P-dimers of the GII.10 Vietnam norovirus strain with synthetic HBGAs. KD values were derived for α-l-Fuc (α-l-fucose), β-l-Fuc (β-l-fucose), citrate, α-l-Fuc-(1,2)-d-Gal (H-disaccharide), and blood group A trisaccharide, as well as for a trisaccharide β-l-Fuc-(1,2)-β-d-Gal-(1,4)-d-GlcNAc. It must be noted that this trisaccharide is not H-type 2 trisaccharide. H-type 2 trisaccharide carries an α-l-fucose instead of a β-l-fucose leading to a completely different molecular shape. The dissociation constants KD were given as follows: 390 μM (β-l-Fuc-(1,2)-β-d-Gal-(1,4)-d-GlcNAc), 420 μM (H-disaccharide), 460 μM (citrate), and 460 μM (l-fucose).
In another study, affinities of a variety of HBGA ligands were measured [41] using a direct native MS approach [27]. In the original report, the affinities have been reported as association constants KA. For the sake of consistency and ease of comparability, we convert these values into dissociation constants KD. For H-disaccharide (α-l-Fuc-(1,2)-d-Gal) dissociation constants KD of 2.1 and 1.5 mM were determined for two slightly different P-dimer constructs of the GII.4 VA387 norovirus strain, being about a factor of five higher than determined by STD NMR titrations [14]. Since the HBGA binding sites of GII.10 Vietnam and of GII.4 VA387 are almost identical, one would expect more similar dissociation constants. The MS study [41] also reports that neither 3'-sialyllactose nor 6'-sialyllactose bind to P-dimers, in line with prior findings from STD NMR experiments [16].
A successive MS study addressed the binding of sialylated glycans as found in gangliosides to P-dimers of the GII.4 VA387 and VA115 strains [42]. Dissociation constants KD for a variety of sialylated glycans as found in gangliosides were reported to be in the μM range. For 3'-sialyllactose (GM3 gangliosides) a dissociation constant KD of 0.7 mM is reported. Binding of 3'-sialyllactose to P-dimers of a different strain, GII.4 MI001, was reported from our laboratories using a combination of native MS, STD NMR experiments, and surface plasmon resonance (SPR) [36], yielding dissociation constants for 3'-sialyllactose in the low mM range, in agreement with the previous results.
Another study into HBGA binding to human norovirus GII.4 Saga P-dimers from our laboratories [38] used a combination of NMR spectroscopy, native MS, and crystallography. It was observed that binding isotherms from STD NMR titrations exhibit steps. A crystal structure of GII.10 Vietnam P-dimers in complex with l-fucose showed four instead of the expected two bound fucose residues, and concentration-dependent co-crystallization reflected sequential occupation of the fucose binding pockets [13]. From native MS it was concluded that the GII.4 Saga P-dimers also exhibit four fucose binding pockets. Taken together, these observations led to the hypothesis, that binding of l-fucose is a stepwise process with negative cooperativity. NMR-based chemical shift titrations using uniformly 2H,15N-labeled Saga P-dimers at first sight revealed no steps of corresponding binding isotherms, but denoising the data using singular value decomposition (SVD) revealed different components that matched the hypothesis of a stepwise process. At the time an assignment for the backbone resonances was not available. Step-shaped binding isotherms from STD NMR titrations were also observed for related systems [36,37].
Inspection of the data published revealed discrepancies that were addressed in follow up studies in our laboratories [43–45]. It turned out that the experimental approaches employed by us, and others contain hidden traps that are easy to fall into.
First, it turned out that the data leading to our hypothesis of cooperative binding were only seemingly consistent due to two main causes. For one, it turned out that native MS results were biased by secondary structure-induced glycan clustering [43]. On the other hand, spontaneous deamidation of a highly conserved asparagine residue (Asn373) and its quantitative conversion into an iso-aspartate residue over time led to mixtures of different protein species with different HBGA affinities. This resulted in extra components in the SVD analysis that falsely reflected consistency with the steps observed in STD NMR titrations [45]. Finally, the full backbone assignment had been instrumental for the identification of spontaneous Asn373 deamidation [45] that largely abolishes HBGA binding and obstructs interpretation of CSP titration data. Second, we showed that sialic acid containing glycans such as ganglioside head groups do not bind to human or murine norovirus P-dimers [44]. The reason for this again lies in the secondary structure-induced glycan clustering [43]. A third point concerns the dissociation constants reported so far by us, and others. Some of the data reported differed by more than one order of magnitude. For the STD NMR based data the ‘hidden trap’ is the occurrence of steps in the respective titration curves, and for MS-based data the ‘hidden trap’ again lies in structure-induced glycan clustering. In the following, we address the methodological limitations in more detail and illustrate some of the cases.
Protein secondary structure affects glycan clustering in native MS
Several groups applied native ESI-MS to P-dimer–glycan interactions [38,41]. Results varied remarkably, calling into question whether the selection of reference proteins affects data interpretation. Our initial experiments utilized a small reference protein, as we could not detect any binding with a mid-sized protein, while others reported µM affinities, and attributed this to residual glycan-binding affinity of the tested protein. Since clustering is supposedly independent of size, the choice of the reference protein ought not to affect the results if the conditions are sufficiently gentle. Notably, the direct MS approach was originally developed for high-affinity interactions, where the clustering effect is much lower, which may have masked a size influence of the reference proteins. Therefore, glycan clustering was examined again with different reference proteins. Small proteins such as myoglobin, ubiquitin, and cytochrome c did indeed exhibit less glycan clustering when compared with alcohol dehydrogenase (ADH) [43] (Figure 2). Interestingly, the KD from STD-NMR could be reproduced using ADH as reference protein. However, the signals of the remaining specifically bound glycans on the P-dimer fall within the range of the standard deviation, i.e. they are no longer significant. Closer inspection revealed that not only size affected clustering, but also β-sheet content, which increases with size. α-helices on the other hand did not have significant impact on this. We hypothesized that the hydroxyl-group rich glycans bond in net-like structures with the β-sheets of the reference proteins. β-sheets are labile during ESI [46], which could facilitate intercalation with glycans. This effect has such a strong impact that non-binding proteins with a high number of β-sheets exhibit more or similar glycan ‘interactions’ than the actual P-dimer. Additionally, protein flexibility turned out to be an important factor as well. The highly flexible but low affinity deamidated P-dimer showed similar or increased glycan interaction compared with the wild-type protein. So even a reference protein with the same number of β-sheets could prove difficult for low-affinity glycan binding and should thus be carefully selected to avoid errors [43,47]. Therefore, protein–glycan interactions are best compared with a proper glycan negative control, which has proven impossible to find except for glycan mimetics [48]. It can be assumed that the backbone of the glycan mimetics, which is closer to peptide structures, limits the intercalation effect leading to high clustering. This appears plausible as clustering is much lower for peptides [49].
Comparison of clustering on various reference proteins with P-dimers.
Clustering in native MS with 500 µM HBGA B type 1 tetrasaccharide (HBGA B), all spectra contain P-dimer and different reference proteins. From top to bottom: cytochrome c (cyt c), green fluorescent protein (GFP), carbonic anhydrase (CA), alcohol dehydrogenase (ADH). It is evident that the clustering under otherwise identical conditions is vastly different for the four proteins. Figure reproduced under creative commons attributions license from [43].
Clustering in native MS with 500 µM HBGA B type 1 tetrasaccharide (HBGA B), all spectra contain P-dimer and different reference proteins. From top to bottom: cytochrome c (cyt c), green fluorescent protein (GFP), carbonic anhydrase (CA), alcohol dehydrogenase (ADH). It is evident that the clustering under otherwise identical conditions is vastly different for the four proteins. Figure reproduced under creative commons attributions license from [43].
Saturation transfer difference NMR experiments are sensitive to bulk effects at high millimolar ligand concentrations
STD NMR experiments have been used in the past to analyze the binding of ligands with dissociation constants in the μM to mM range. In the case of equal and independent binding pockets, i.e. a single-site binding model one expects normal Langmuir binding isotherms. In the case of binding of HBGAs to norovirus P-dimers and VLPs the binding isotherms exhibit discontinuities [36–38]. One explanation would be the cooperativity of the binding process, but this has been ruled out by follow up experiments [45]. Likely, the discontinuous shape of the binding isotherms is due to bulk effects that occur at high mM ligand concentrations required for saturation of the low-affinity binding pockets. This is supported by the observation that the shape of the binding isotherms depends on pH and buffer (see Figure S2 of [38]). Although there is no exact explanation at hand yet, ignoring the discontinuities and performing an analysis based on a single-site binding model yields similar dissociation constants as derived from CSP titrations [45].
With this knowledge it is now also plausible why the first reported dissociation constants based on STD NMR titrations [14] were almost two orders of magnitude below the values obtained later from CSP titrations [44,45]: The first step of the binding isotherm has simply been taken as the endpoint of the titration yielding dissociation constants that match the corresponding KD1 values reported for the first step of the binding isotherm [38].
It is well established that STD NMR intensities strongly depend on the dissociation rate constants of a ligand–protein complex [29,50]. Therefore, the observed effects may reflect subtleties of the binding process not accessible by other methods. Although the steps impeded the analysis of dissociation constants it may be worthwhile to study such effects more systematically.
Chemical shift perturbation NMR experiments redefine the glycan recognition code for noroviruses
Complementary to the above discussed STD NMR and other ligand-observed techniques protein NMR experiments allow direct observation of ligand binding by recording CSPs upon binding [33]. A bottleneck for employing CSP measurements is the availability of stable-isotope labeled protein together with the requirement of an assignment of all protein signals as far as possible. For small proteins with a molecular mass below ca. 25 kDa uniform labeling with 15N is sufficient to record two-dimensional backbone NH fingerprints, i.e. 1H,15N HSQC spectra. For large proteins such as the norovirus P-dimers with a molecular mass of ca. 70 kDa the experimental effort to obtain assignments of NMR resonances still is a challenge [51,52], and recording backbone NH fingerprints requires additional labeling of the protein with deuterium (2H). Once this hurdle has been taken NH fingerprints are straightforward to acquire, in this case by recording 1H,15N TROSY HSQC spectra [53]. With such a full assignment in hands, we were able to study HBGA binding to norovirus P-dimers by ‘NMR fingerprinting’, i.e. by observing CSPs of backbone NH upon addition of ligand (Figure 3), yielding the topology of the binding pocket as well as long-range effects of binding [45].
Example for a TROSY HSQC ‘fingerprint’.
(a) 1H,15N TROSY HSQC spectra of [U-2H,15N]-labeled GII.4 Saga P-dimers in the absence and presence of B-trisaccharide. The insert is highlighting CSPs due to binding of B-trisaccharide to P-dimers. (b) HBGA binding pocket of GII.4 Saga P-dimers (pdb 4 × 06) highlighting the close proximity between the l-fucose residue and R345 causing the strong CSP shown in (a). Figure reproduced [44] under creative commons attributions license.
(a) 1H,15N TROSY HSQC spectra of [U-2H,15N]-labeled GII.4 Saga P-dimers in the absence and presence of B-trisaccharide. The insert is highlighting CSPs due to binding of B-trisaccharide to P-dimers. (b) HBGA binding pocket of GII.4 Saga P-dimers (pdb 4 × 06) highlighting the close proximity between the l-fucose residue and R345 causing the strong CSP shown in (a). Figure reproduced [44] under creative commons attributions license.
Reproducible dissociation constants are obtained from backbone chemical shift perturbations
The availability of an almost complete backbone assignment of norovirus GII.4 Saga P-dimers and transfer of the assignment of binding-relevant amino acid resonances to highly homologous P-dimers of other norovirus strains allowed us to scrutinize the mentioned inconsistencies [44,45]. The remarkable structural similarity of the HBGA binding sites in different GII norovirus P-dimers is illustrated in Figure 4 by the superposition of different crystal structures.
The glycan binding pocket in GII noroviruses.
Superposition of HBGA binding pockets of GII.10 Vietnam (orange, pdb 3q38) GII.4 VA387 (gray, pdb 2obt) and GII.4 Saga P-dimers (blue, pdb 4 × 06) in complex with blood group B trisaccharide. Highly conserved amino acids in contact with l-fucose, the minimal binding motif, are highlighted (GII.4 Saga numbering).
Superposition of HBGA binding pockets of GII.10 Vietnam (orange, pdb 3q38) GII.4 VA387 (gray, pdb 2obt) and GII.4 Saga P-dimers (blue, pdb 4 × 06) in complex with blood group B trisaccharide. Highly conserved amino acids in contact with l-fucose, the minimal binding motif, are highlighted (GII.4 Saga numbering).
Binding isotherms from CSP titrations yielded dissociation constants for GII.4 Saga P-dimers up to more than one order of magnitude higher than reported in previous studies from our and other laboratories. Dissociation constants KD from CSP titrations for several synthetic HBGAs are compiled in Figure 5. For instance, H-disaccharide is a very weak binder with a dissociation constant of 30 mM [44], which is even higher than the value of 22 mM found for l-fucose [45]. Compared with the initially reported value of 420 μM from STD NMR titrations [14] this is a factor of 71, and compared with the value from native MS of 1.5 mM [41] it still is a factor of 20. A noticeable increase in affinity is seen for the transition to the B-trisaccharide, in accordance with water-mediated interactions of the α-(1,3)-linked Gal residue with the HBGA binding pocket [12]. The addition of an β-(1,3)-linked GlcNAc residue at the reducing end of the B-trisaccharide yields the blood group B-type-1 tetrasaccharide and leads to a decrease in affinity.
Glycan structures and dissociation constants.
Synthetic HBGAs and KDs for binding to GII.4 Saga and GII.4 MI001 P-dimers [44,45]. The abbreviations (N,N), etc. refer to the deamidation status of Asn373. (N,N) stands for not deamidated, and (iD,iD) stands for fully deamidated, i.e. both Asn373 residues have been transformed into isoAsp residues.
Synthetic HBGAs and KDs for binding to GII.4 Saga and GII.4 MI001 P-dimers [44,45]. The abbreviations (N,N), etc. refer to the deamidation status of Asn373. (N,N) stands for not deamidated, and (iD,iD) stands for fully deamidated, i.e. both Asn373 residues have been transformed into isoAsp residues.
Norovirus P-domains do not bind to sialylated glycans
To our surprise, sialic acid bearing glycans such as gangliosides do not bind to human norovirus GII.4 P-dimers [44], including the initially studied VA387 strain [42] and the MI001 strain studied in our laboratory [36]. Moreover, extension of our studies into murine noroviruses, where binding of viruses to sialylated glycans via P-dimers has been postulated based on biological assays, unequivocally shows that there is no binding of MNV P-dimers to sialic acid bearing glycans, such as gangliosides [54–56]. We have put considerable effort into systematically varying the experimental conditions such as buffer, pH, and temperature to exclude that these are false-negative results. At the same time, we also showed that HBGAs do not bind to murine norovirus P-dimers. Taken together a revised picture of glycan recognition by norovirus P-dimers has emerged. This is highlighted by Table 1, contrasting the results from previous and current research.
P-dimer . | Binding to HBGAs . | Binding to sialylated glycans . | ||
---|---|---|---|---|
Previous . | Current . | Previous . | Current . | |
Human norovirus | KD in the range of ca. 0.5–5 mM | KD in the range of ca. 2–30 mM | KD in the range of ca. 0.5–5 mM | No binding |
Murine norovirus | No binding | No binding | Indirect evidence of binding from infection assays | No binding |
P-dimer . | Binding to HBGAs . | Binding to sialylated glycans . | ||
---|---|---|---|---|
Previous . | Current . | Previous . | Current . | |
Human norovirus | KD in the range of ca. 0.5–5 mM | KD in the range of ca. 2–30 mM | KD in the range of ca. 0.5–5 mM | No binding |
Murine norovirus | No binding | No binding | Indirect evidence of binding from infection assays | No binding |
Linking glycan binding, protein dynamics and quaternary structure
Native mass spectrometry reports directly on oligomeric state
Native MS has proven to be an apt methodology to investigate the relationship between the monomer and its twin-relative when affinities are in the low µM range, e.g. for antibodies and half antibodies [57]. Interestingly, the ratio of P-monomer to P-dimer seems highly dependent on several different factors such as strain and deamidation [47]. Deamidation appears to increase the monomer fraction. Notably, strain dependencies also occur on the level of noroVLPs out of the VP1 protein, both in terms of stability and assembly state, which may point to additional mechanisms for host cell attachment or entry [58,59].
Therefore, it makes sense that strain-dependencies should be regarded as a major influence on any attributes of noroviruses. Still, the impact of deamidation on dimerization is convincing and well supported by the increase in overall protein flexibility that can be observed in hydrogen–deuterium-exchange (HDX) MS, on which a closer look will be taken next.
HDX MS yields insights into protein dynamics
HDX MS is based on, as the name suggests, switching out hydrogen for deuterium atoms. Hydrogens of OH, NH, and SH groups are labile and naturally exchanging with other hydrogen atoms in the surrounding solution. For HDX MS, these are replaced with an isotope of hydrogen: deuterium. The exchange with the heavier isotope results in a net mass gain of ∼1, which can be observed using MS. Usually, the exchange is measured on the peptide level to gather local structural information. Since side-chain hydrogens exchange extremely fast, only the backbone amide hydrogen-deuterium exchange is generally probed. This process obviously applies only for solvent-accessible hydrogens. The ratio of exchange is influenced by several factors such as pH, temperature, and the local environment of a given peptide [24,60]. Other relevant properties are conformational dynamics and flexibility of protein regions, as this may increase solvent-accessibility and thus accelerate deuterium exchange [61]. This of course includes ligand interaction, which makes HDX MS a most suitable tool to examine protein–glycan interaction.
Initial HDX experiments showed that the canonical binding site (G443, Y444) of the wildtype P-dimer is protected in the presence of either B trisaccharide or methyl α-l-fucopyranoside, but not when d-galactose is applied. Interestingly, protection like this could not be observed in fully deamidated P-dimers. As it turns out, the entire P2 domain, the structure most proximal to the binding site, is much more flexible than in the wildtype [45]. This also fits the aforementioned MS data on increased amounts of monomer in deamidated species. Notably, the higher exchange and flexibility in deamidated P-dimers cannot only be attributed to the presence of P-monomers, i.e. there is additional increased dynamics. As written before, strain-dependent differences may prove to be important. Thus, the structures of different GII strains were compared, with the result that all share the canonical binding site motif with little or no long-range effects, except GII. 10 Vietnam 026 (Figure 6). For this specific strain, glycan interaction induced folding in a loop proximal to the binding site. This long-range effect is supported by MD simulations [47].
HDX MS and MD on GII.10 Vietnam P-dimer in presence of glycans.
Top: Regions showing protection in presence of HBGA B trisaccharide or fucose in blue revealing protection also in regions distal from the binding site suggesting long-range effects. Bottom: overlay of crystal structures for various P-dimers in presence of glycans showing an extended loop in GII.10 Vietnam 026 (a). Whereas the loop is folded into a short helix in presence of glycans, it unfolds and flops around in absence of glycans as revealed by MD simulations (b). Figures reproduced under creative commons license attribution from [47].
Top: Regions showing protection in presence of HBGA B trisaccharide or fucose in blue revealing protection also in regions distal from the binding site suggesting long-range effects. Bottom: overlay of crystal structures for various P-dimers in presence of glycans showing an extended loop in GII.10 Vietnam 026 (a). Whereas the loop is folded into a short helix in presence of glycans, it unfolds and flops around in absence of glycans as revealed by MD simulations (b). Figures reproduced under creative commons license attribution from [47].
Deamidation in many GII.4 strains may point towards some important mechanism in infection. Under physiological conditions, deamidation usually has a half-life of 1.6 days, which implies mixed species are to be expected [45]. Partially deamidated dimers do still interact with glycans, which results overall in a more deprotective effect, thus exposing more residues to solvent [47]. Glycan interaction may, therefore, be a mechanism to open these partially deamidated P-dimers to access additional receptor bindings sites.
Ion mobility MS and glycan binding
Ion mobility mass spectrometry (IMMS) is another way to implement MS technology in protein–ligand interaction research. Here, the travel time of ionized proteins through an ion mobility cell are analyzed to monitor large conformational changes. Initially, the impact glycan-binding has on P-dimer structure was reported to be relatively high with an over 10% increase in arrival time [39]. Compared with the above-described data collected from HDX, NMR, and crystallography, which indicate more subtle changes, the reported effects appear too large. Unfolding processes may be the cause of this, though this remains unclear. Therefore, we revisited these measurements. IMMS is suitable for large changes in protein structure, therefore, arrival time differences above 3–5% are commonly recognized as significant. Here, changes of about 1.6% were determined for glycan-bound and non-bound GII. 4 Saga dimers, as expected for the addition of a small molecule. Notably, this was independent of glycan selection [43]. These experiments imply that IMMS is not a suitable approach to decipher the subtle interplay at work here.
Conclusion
This review presents and compares previous and current research into the binding of norovirus capsid proteins to glycan ligands, employing MS, and NMR methodology. It is explained how previous studies have suffered from methodological limitations, not known at the time, and it is then described how combinations of the highly complementary MS and NMR technologies provide a robust toolset to unambiguously identify and characterize the weak binding of glycan ligands to receptor proteins in general.
Perspectives
Detection and characterization of low-affinity protein–ligand interactions are a challenge and require careful control experiments. The combination of NMR and MS techniques is well suited to identify potential traps.
The glycan recognition code for human and murine norovirus P-dimers has been revised: Only human but not murine norovirus P-dimers bind to HBGAs, and sialic acid bearing glycans are neither recognized by human nor by murine norovirus P-dimers.
It is conceivable that glycans and other ligands are modulators that enable further virus–host interactions. This ought to be considered planning future experiments to identify additional interaction partners.
Competing Interests
The authors declare that there are no competing interests associated with the manuscript.
Funding
T.P., S.T., and C.U. thank the Deutsche Forschungsgemeinschaft (DFG) for grants Pe494/12–1, TA1093-2, and UE 183/1–1, respectively (all three FOR2327, ViroCarb). T.P. thanks the State of Schleswig-Holstein for supplying the NMR infrastructure (European Funds for Regional Development, LPW-E/1.1.2/857). The Leibniz Institute for Experimental Virology is supported by the Free and Hanseatic City Hamburg and the Federal Ministry of Health (Bundesministerium für Gesundheit, BMG). C.U. and L.T. acknowledge funding from EU Horizon 2020 project VIRUSCAN 731868.
Author Contributions
Manuscript conceptualization and writing: T.P. and C.U.; Manuscript comments and editing: T.P., R.C., T.M., A.M., P.O., S.T., L.T., and C.U.
Abbreviations
- ADH
alcohol dehydrogenase
- CSP NMR
chemical shift perturbation NMR experiments
- CSPs
chemical shift perturbations
- ESI-MS
electrospray ionization mass spectrometry
- HBGAs
histo blood group antigens
- HBGAs
histo blood group antigens
- HDX
hydrogen-deuterium-exchange
- IMMS
ion mobility mass spectrometry
- MS
mass spectrometry
- NMR
nuclear magnetic resonance
- STD NMR
saturation transfer difference NMR spectroscopy
- STD
saturation transfer difference
- SVD
singular value decomposition
- VLPs
virus-like particles