Differentiation of Glycan Diversity with Serial Affinity Column Set ( SACS )

Targeted glycoproteomics is an effective way to discover disease-associated glycoproteins in proteomics and serial affinity chromatography (SAC) using lectin and glycan-targeting antibodies shows glycan diversity on the captured glycoproteins. This study suggests a way to determine glycan heterogeneity and structural analysis on the post-translationally modified proteins through serial affinity column set (SACS) using four Lycopersicon esculentum lectin (LEL) columns. The great advantage of this method is that it differentiates between glycoproteins on the basis of their binding affinity. Through this study, some proteins were identified to have glycoforms with different affinity on a single glycoprotein. It will be particularly useful in determining biomarkers in which the disease-specific feature is a unique glycan, or a group of glycans.


Introduction
2][3] Mass spectrometry (MS) is a significant methodology for proteomics in harmony with a wide variety of separation methods.Tandem mass spectrometry (MS/MS) is a major tool in protein identification.][10] The application of mass spectrometry and MS/MS to proteomics takes advantage of the huge amount of genomic and proteomic data stored in databases. 11roteome complexity usually exceeds the analytical capacity of instrumentation.One solution to this problem is to use higher resolution mass spectrometers, i.e. to "put everything in the mass spectrometer, analyze it, and let acquired data sorted out".However, this method is very costly and also ignores the fact that the difficulty of getting peptides into the gas phase increases as sample complexity increases due to different ionization efficiencies, which is known in that relatively non-polar peptides in mixtures suppress the ionization of polar peptides in electrospray ionization (ESI) mass spectrometry. 12In matrix assisted laser desorption ionization (MALDI) mass spectrometry, positively charged species suppress the ionization of neutral and negatively charged peptides. 13Ultra-high resolution instruments are useless if the requisite proteins and peptides do not enter the mass spectrometer.Another solution is that complex components should be fractionated with much higher resolution before entering the mass spectrometer. 14Even though this strategy would diminish matrix effects, it still needs ultra-sophisticated separation systems with huge numbers of theoretical plates and an accompanying long analysis time in sorting huge spectral data from 10 5 to 10 6 components to find a small number of relevant peptides. 15n both of these approaches, one needs to examine an entire proteome to answer biological questions, but this will not be necessary in many cases.Structure specific affinity selection can be used as a way to look at a portion of a particular proteome.This is a targeted strategy.A targeted strategy has been used in several different ways.One is to derivatize a specific functional group in proteins with an affinity selectable reagent that can be targeted with an affinity chromatography column.Avidin selection of proteins or peptides in which sulfhydryl or carbonyl groups have been derivatized with biotin is such an example. 16,17 biotin residue (attached through a linker) permits the selective capture of the cysteine-containing peptides out of the digest resulting in a significant reduction of the complexity of the sample through an avidin-biotin interaction. 18Another approach is to directly select proteins or peptides based on shared structural features; for instance, direct selection of histidine containing peptides with copper loaded immobilized metal affinity chromatography (Cu 2+ -IMAC) columns. 19Selection of phosphorylated peptides with either Fe 3+ -IMAC or Ga 3+ -IMAC columns is another example. 181][22][23][24][25][26] If combined together, there could be more than a dozen ways to target structural features in proteomics. 27[24]28,29 In case of glycosylation associated with diseases, targeting disease-associated glycans would be an effective way to select and identify disease-related glycoproteins in complex matrices such as plasma. 30Based on this idea, lectins and glycan-targeting antibodies have been used as tools in capturing glycosylated proteins. 20- 25,31-32This study shows a way to differentiate glycan diversity in the same protein in a serial affinity column set (SACS) using four Lycopersicon esculentum lectin (LEL) columns.

Serial Affinity Column Set (SACS)
Agarose-bound LEL sorbent was individually selfpacked in four 4.6 mm × 50 mm columns.Protein concentration in healthy human plasma was estimated using the Bradford assay to prepare samples with equal amounts of total proteins.For the SACS experiment, the four LEL columns were connected in series.Healthy human plasma was loaded directly onto serially connected soft-gel affinity columns with mobile phase A (0.10 M HEPES buffer, pH 7.5 containing 1 mM CaCl 2 and 1 mM MgCl 2 ) at a flow rate of 0.3 mL/min.Following extensive washing with mobile phase A to remove nonspecifically and weakly bound proteins, the serial LEL affinity columns were disassembled and affinity-selected proteins were eluted from each column individually with solution B (0.5 M acetic acid-HCl solution, pH 2.5).All the elution curves were obtained with an absorbance detector operating at 280 nm using 20 AD LC systems from Shimadzu Scientific Instruments, Inc. (Kyoto, Japan).

Saturation Test of Columns
7][28][29][30] The amount of injected sample was increased from 25 μg to 250 μg by 25 μg increments.The saturation amount of healthy human plasma samples was 200 μg of human plasma in the LEL column (data not shown).Therefore, the maximum injection amount was decided as 200 μg of plasma below the saturation amount in order to prevent overloading.

Proteolysis
Captured proteins were desalted using HLB Oasis SPE cartridges, dried with a centrivap concentrator after being adjusted to pH 7.5 with a 0.5 M NH 4 HCO 3 buffer, and were then reconstituted with 8 M urea in a 50 mM HEPES buffer containing 10 mM CaCl 2 .The denatured proteins were reduced with 10 mM DTT.After two hours of incubation at 50 o C, iodoacetic acid was added to a final concentration of 20 mM for alkylation and incubated in darkness for additional two hours.L-cysteine was then added to the reaction mixture to a final concentration of 40 mM, and the mixture was incubated for 30 minutes at room temperature.After dilution with a 50 mM HEPES buffer to a final urea concentration of 1 M, proteomics grade trypsin (2%, w/w, enzyme to protein) was added and incubated overnight at 37 o C. The proteolysis reaction was stopped by the addition of TLCK (trypsin/TLCK ratio of 1:1 (w/w)).The resulting peptide mixture was desalted with HLB Oasis SPE cartridges and dried into 50-100 μL with the centrivap concentrator after being adjusted to pH 7.5 with the 0.5 M NH 4 HCO 3 buffer for PNGase F digestion.

PNGase F Digestion
N-Linked glycopeptides in a digested peptide mixture were deglycosylated by treatment with PNGase F. A desalted tryptic peptides to adjust to pH 7.0-8.0.Five microliters (2500 U) of PNGase F was added to the pHadjusted tryptic peptides and then the mixture was incubated overnight at 37 o C. Following deglycosylation, samples were desalted using HLB Oasis SPE cartridges and concentrated using a C18 microspin column.The PNGase F-treated peptide mixtures were reconstituted in 0.1% formic acid solution and stored at -80 o C until analysis with an LTQ-Orbitrap instrument.

LC-MS/MS-Based Protein Identification
Proteins were identified with an LTQ-Orbitrap hybrid mass spectrometer (Thermo Fisher, Bremen, Germany).The peptide mixtures resulting from PNGase F treatment were separated on an Agilent 1100 HPLC system using a 75 μm × 120 mm C18 Reversed-phase Chromatography (RPC) column packed with 5 µm C18 Magic beads.RPC separations were achieved using a 60 min linear mobile phase gradient from 98% solvent A with 2% solvent B to 60% solvent A with 40% solvent B at a flow rate of 300 nL/min.Solvent A was composed of DI water to which formic acid had been added to a concentration of 0.1%.Solvent B was prepared with ACN to which formic acid had been added to a concentration of 0.1%.The electrospray ionization emitter tip was generated on the prepacked column with a laser puller (model P-2000, Sutter Instrument Co.).The HPLC system was coupled directly to the LTQ-Orbitrap hybrid mass spectrometer equipped with a nanoelectrospray ion source (Proxeon Biosystems, Odense, Denmark).The MS was operated in the data-dependent mode, in which a survey full scan MS spectrum (from m/z 300 to 1600) was acquired in the Orbitrap with a resolution of 60,000 at m/z 400.This was then followed by MS/MS scans of the three most abundant ions with +2 and +3 charge states.Target ions already selected for MS/MS were dynamically excluded for 180s.The resulting fragment ions were recorded in the linear ion trap.
Automated MS/MS data analysis was performed utilizing Protein Pilot software 5.0 using the Pro Group™ algorithm (SCIEX, Concord, Ontario, Canada) for protein identification.The minimum acceptance criterion for peptide identification was a 99% confidence level.Most of the proteins were identified based on the presence of at least two peptides from a protein identified by the Pro Group™ algorithm at the 99% confidence level.An unused score cutoff of 4 was the minimum value for identifying proteins with the Protein Pilot 5.0 Software.Proteins are listed according to their Swiss-Prot entry names and accession numbers.

Results and Discussion
Glycoproteomics enables comprehensive analysis of glycoproteins using proteomics tools.There are two approaches to recognize disease-associated aberrations of glycans in glycoproteomics.One is to do affinity selection of glycopeptides from trypsin digests of plasma.The selected peptides are identified by RPC-MS/MS after removal of the glycans from glycopeptides with Nglycosidase F (PNGase F).An attractive characteristic of this approach is that proteins and glycosylation sites are identified simultaneously. 33he other strategy is to carry out affinity selection of glycoproteins from plasma at the protein level.Then the selected glycoproteins are identified in two ways (Figure 1).At first, the glycoproteins are trypsin digested immediately and the produced peptides are analyzed directly by MALDI-MS/MS or by ESI-MS/MS after RPC for the identification of proteins.] Affinity selection of glycoproteins at the protein level has a great advantage.In general affinity selection of glycopeptides at peptide level requires deglycosylation for identification.Although that is relatively easy with N-glycosylation proteins, it is difficult to deglycosylate and identify O-glycosylated peptides.In contrast, in affinity selection at the protein level, affinity selected N-and O-glycosylated proteins are easily identified by their non-glycosylated peptides. 20,25The number of non-glycosylated peptides are usually about 50 times more than that of glycosylated peptides.
An interesting feature of glycoproteins is that it may contain multiple lectin-or antibody-targetable structures within an oligosaccharide at a single or several sites.When lectin or glycan targeting antibody affinity columns are connected in series, loaded, and then eluted separately, this method is referred to as serial affinity chromatography (SAC).It can determine whether individual glycan structures appear alone or together with other glycans in Figure 1.Two strategies of protein identification in protein level affinity selection.specific glycoproteins using different types of affinity columns in different orders under the protein level selection.It also elucidates that there is independent occurrence of different affinity targetable glycan features in the same glycoprotein, and multiple targetable glycan features are coresident in the same glycoprotein, i.e. the heterogeneity of the glycan structures at different glycosylation sites.SAC is able to assess whether serial lectin and glycan-targeting antibody affinity chromatography is of any value in assessing glycan occurrence and heterogeneity among glycoproteins in blood.The major advantage of this approach is that it can determine whether a targeted glycan appearing at one site in an affinity selected protein will be present with other glycans at other sites in the same protein. 20,25This was achieved using LEL, a lectin widely used in monitoring cancer.LEL binds specifically to β-(1 → 4)-linked galactosyl residues in poly-N-acetyllactosamine and GlcNAc residues in N-acetylglucosamine (GlcNAc) n , where n ranges from 1 to 4. The structure and number of residues in a glycan can play a role in affinity selection.For example, in the series (GlcNAc β 1−4 ) 1 , (GlcNAc β 1−4 ) 2 , (GlcNAc β 1−4 ) 3 , to (GlcNAc β 1−4 ) 4 , the binding affinity of LEL from tomato increases with the size and number of the N-acetylglucosamine oligomers bound to a glycan. 22This allows LEL to differentiate between glycoforms. 21n this study, four small identical LEL columns were coupled in series to form a single affinity chromatography system with the last column in the series connected to an absorbance detector.The serial affinity column set (SACS) 25 was then loaded with human plasma proteins.At the completion of loading, the column set was disassembled, the four columns were eluted individually, the captured proteins were trypsin digested, the peptides were deglycosylated with PNGase F, and the parent proteins were identified through mass spectral analyses.
Two proteins were found to be bound to LEL1 and also to LEL4, while other two proteins bound to LEL2 and LEL4 in Table 1.This is very important.Failure to bind sequentially suggests that proteins bound to (LEL1 and LEL4) or (LEL2 and LEL4) probably carry glycans that differ in their structure and LEL binding affinity.These proteins would lead to glycoproteins being bound nonsequentially in the LEL train.
In this way, SAC can determine glycan diversity of a protein; the types of glycan structures which the protein has.For example, there might be glycan diversity such as a single glycan (A, or B) or A-B containing protein isoforms such as: In peptide-affinity SAC, it is not possible to differentiate these three glycoprotein isoforms, because after digestion, the protein sequence is destroyed.As a result, even though both of AB and BA sequence of glycan targeting serial affinity selectors are used separately, there is no difference in peptide captures between two serial selectors.However, in protein-affinity SAC, when AB sequence of glycan targeting serial affinity selectors are used, the B targeting affinity selector cannot capture the following A-B glycan structure, because this A-B glycan structure has already been stripped off by the A targeting affinity selector.On the other hand, when BA sequence of glycan targeting serial affinity selector is used, the A targeting affinity selector cannot capture this A-B glycan structure because this A-B glycan structure has been already stripped off by the B targeting affinity selector.By comparing these two results, a type of protein heterogeneity can be elucidated by this SAC.

Conclusion
In conclusion, targeted proteomics is an effective way in proteomics to save time and effort enormously and SAC and SACS suggest ways to differentiate glycan diversity and structural analysis on post-translationally modified proteins using various affinity selectors.