HIGHER SEQUENCE COVERAGE OF PROTEINS WITH UPLC/MS<sup>E</sup>

MS^E data were acquired in positive ion mode for non-derivatized samples and positive and negative ion modes for derivatized samples with collision cell energy alternating between low energy (4 eV) to collect peptide precursor (MS) data, and elevated energy (ramping from 20 to 40 eV) to obtain peptide fragmentation (MS^E) data (standard MS^{E procedure).
Sampling of the lock spray channel (1 ng/μL leucine enkephalin
in 50:50 isopropyl alcohol/water containing 0.1% formic acid)
was performed every 1 min to ensure high mass accuracy.}

Data processing

The acquired data were processed by ProteinLynx Global Server software (PLGS; v. 3.0.1, Waters). Peak lists were generated after deisotoping and deconvolution. Separate databases containing sequence of each of the investigated protein were created and the data were searched with trypsin as a digestion reagent and three potential miscleavages. Peptide and fragment tolerance were set to automatic. Oxidation M and dehydratation ST were allowed as variable modifications in all protein data sets, while deamidation N was added for Epo and Trn data and phosphorylation for Csn data set.

For the derivatized peptides, N-term reagent modifier was created for 5-formylbenzene-1,3-disulfonic acid and used as fixed modification in workflow parameters.

Results and Discussion

By comparing the percentage of protein sequence coverage obtained using conventional method (non-derivatized + mode) with the percentage obtained using our method (derivatized +/- mode), we have found that our method provided higher sequence coverage for each of 12 analyzed proteins. The results are shown in Figure 4 and summarized in Table 1 . More detailed information about coverage of the analyzed proteins can be found in separate document (Appendix_2015-05-19).

Double Bracket: ü Up to 59% higher sequence coverage

Figure 4. Protein sequence coverage (%) calculated after PLGS data processing and database matching of non-derivatizated and derivatizated peptides acquired in MS^E analyses in positive and positive/ negative ion mode, respectively.

Our method provided sequence coverage that ranged from 88-100% (96% on average), in contrast to conventional method that ranged from 31-94% (80% on average). Derivatization method provided 16% higher sequence coverage, with the best result obtained for Csn sample where the difference in sequence coverage reached 59%.

Table 1. Protein sequence coverage (%) calculated after database matching of non-derivatizated and derivatizated peptides acquired in MS^E analyses in positive and positive/ negative ion modes, respectively.

Protein:

BSA

Trn

Epo

Lyz

Als

Csn

IleRS

CCA

TEV

LeuRS

EF-Tu

SerRS

Non-derivatized (+ mode)

87.81

93.84

53.37

87.07

71.98

30.80

93.60

90.53

87.60

92.09

86.04

85.63

Derivatized

(+/- mode) %

95.72

100

93.26

98.64

88.19

89.73

97.76

99.51

98.35

94.88

99.24

96.93

HIGHER SEQUENCE COVERAGE OF PROTEINS WITH UPLC/MSE

Introduction

Experimental

Sample Preparation

System and Method conditions

Data processing

Results and Discussion

Conclusions

References

APPENDIX