Meng-Qiu Dong

Find an error

Name:
Organization: National Institute of Biological Sciences
Department: Key Lab of Intelligent Information Processing
Title:
Co-reporter:Hao Chi;Haifeng Chen;Kun He;Long Wu;Bing Yang;Rui-Xiang Sun;Jianyun Liu;Wen-Feng Zeng;Chun-Qing Song;Si-Min He
Journal of Proteome Research February 1, 2013 Volume 12(Issue 2) pp:615-625
Publication Date(Web):2017-2-22
DOI:10.1021/pr3006843
De novo peptide sequencing is the only tool for extracting peptide sequences directly from tandem mass spectrometry (MS) data without any protein database. However, neither the accuracy nor the efficiency of de novo sequencing has been satisfactory, mainly due to incomplete fragmentation information in experimental spectra. Recent advancement in MS technology has enabled acquisition of higher energy collisional dissociation (HCD) and electron transfer dissociation (ETD) spectra of the same precursor. These spectra contain complementary fragmentation information and can be collected with high resolution and high mass accuracy. Taking these advantages, we have developed a new algorithm called pNovo+, which greatly improves the accuracy and speed of de novo sequencing. On tryptic peptides, 86% of the topmost candidate sequences deduced by pNovo+ from HCD + ETD spectral pairs matched the database search results, and the success rate reached 95% if the top three candidates were included, which was much higher than using only HCD (87%) or only ETD spectra (57%). On Asp-N, Glu-C, or Elastase digested peptides, 69–87% of the HCD + ETD spectral pairs were correctly identified by pNovo+ among the topmost candidates, or 84–95% among the top three. On average, it takes pNovo+ only 0.018 s to extract the sequence from a spectrum or spectral pair on a common personal computer. This is more than three times as fast as other de novo sequencing programs. The increase of speed is mainly due to pDAG, a component algorithm of pNovo+. pDAG finds the k longest paths in a directed acyclic graph without the antisymmetry restriction. We have verified that the antisymmetry restriction is unnecessary for high resolution, high mass accuracy data. The extensive use of HCD and ETD spectral information and the pDAG algorithm make pNovo+ an excellent de novo sequencing tool.Keywords: antisymmetry restriction; de novo peptide sequencing; ETD; HCD; k longest paths; tandem mass spectrometry;
Co-reporter:Meng-Qiu Dong;Rui-Xiang Sun;Hao Chi;Chun-Qing Song;Zhi-Yi Jing;Li Tao;Yan Fu;Bing Yang;Si-Min He;Le-Heng Wang;Li-Yun Xiu;Chao Liu
Journal of Proteome Research December 3, 2010 Volume 9(Issue 12) pp:6354-6367
Publication Date(Web):Publication Date (Web): September 30, 2010
DOI:10.1021/pr100648r
In recent years, electron transfer dissociation (ETD) has enjoyed widespread applications from sequencing of peptides with or without post-translational modifications to top-down analysis of intact proteins. However, peptide identification rates from ETD spectra compare poorly with those from collision induced dissociation (CID) spectra, especially for doubly charged precursors. This is in part due to an insufficient understanding of the characteristics of ETD and consequently a failure of database search engines to make use of the rich information contained in the ETD spectra. In this study, we statistically characterized ETD fragmentation patterns from a collection of 461 440 spectra and subsequently implemented our findings into pFind, a database search engine developed earlier for CID data. From ETD spectra of doubly charged precursors, pFind 2.1 identified 63−122% more unique peptides than Mascot 2.2 under the same 1% false discovery rate. For higher charged peptides as well as phosphopeptides, pFind 2.1 also consistently obtained more identifications. Of the features built into pFind 2.1, the following two greatly enhanced its performance: (1) refined automatic detection and removal of high-intensity peaks belonging to the precursor, charge-reduced precursor, or related neutral loss species, whose presence often set spectral matching askew; (2) a thorough consideration of hydrogen-rearranged fragment ions such as z + H and c − H for peptide precursors of different charge states. Our study has revealed that different charge states of precursors result in different hydrogen rearrangement patterns. For a fragment ion, its propensity of gaining or losing a hydrogen depends on (1) the ion type (c or z) and (2) the size of the fragment relative to the precursor, and both dependencies are affected by (3) the charge state of the precursor. In addition, we discovered ETD characteristics that are unique for certain types of amino acids (AAs), such as a prominent neutral loss of SCH2CONH2 (90.0014 Da) from z ions with a carbamidomethylated cysteine at the N-terminus and a neutral loss of histidine side chain C4N2H5 (81.0453 Da) from precursor ions containing histidine. The comprehensive list of ETD characteristics summarized in this paper should be valuable for automated database search, de novo peptide sequencing, and manual spectral validation.Keywords: ETD; hydrogen rearrangement; peptide sequencing; search engine;
Co-reporter:Yue-He Ding, Sheng-Bo Fan, Shuang Li, Bo-Ya Feng, Ning Gao, Keqiong Ye, Si-Min He, and Meng-Qiu Dong
Analytical Chemistry 2016 Volume 88(Issue 8) pp:4461
Publication Date(Web):March 24, 2016
DOI:10.1021/acs.analchem.6b00281
Chemical cross-linking of proteins coupled with mass spectrometry (CXMS) is a powerful tool to study protein folding and to map the interfaces between interacting proteins. The most commonly used cross-linkers in CXMS are BS3 and DSS, which have similar structures and generate the same linkages between pairs of lysine residues in spatial proximity. However, there are cases where no cross-linkable lysine pairs are present at certain regions of a protein or at the interface of two interacting proteins. In order to find the cross-linkers that can best complement the performance of BS3 and DSS, we tested seven additional cross-linkers that either have different spacer arm structures or that target different amino acids (BS2G, EGS, AMAS, GMBS, Sulfo-GMBS, EDC, and TFCS). Using BSA, aldolase, the yeast H/ACA protein complex, and E. coli 70S ribosomes, we showed that, in terms of providing structural information not obtained through the use of BS3 and DSS, EGS and Sulfo-GMBS worked better than the other cross-linkers that we tested. EGS generated a large number of cross-links not seen with the other amine-specific cross-linkers, possibly due to its hydrophilic spacer arm. We demonstrate that incorporating the cross-links contributed by the EGS and amine-sulfhydryl cross-linkers greatly increased the accuracy of Rosetta in docking the structure of the yeast H/ACA protein complex. Given the improved depth of useful information it can provide, we suggest that the multilinker CXMS approach should be used routinely when the amount of a sample permits.
Co-reporter:Zhou Gong;Yue-He Ding;Xu Dong;Na Liu;E. Erquan Zhang
Biophysics Reports 2015 Volume 1( Issue 3) pp:127-138
Publication Date(Web):2015 December
DOI:10.1007/s41048-015-0015-y
Chemical cross-linking coupled with mass spectrometry (CXMS) identifies protein residues that are close in space, and has been increasingly used for modeling the structures of protein complexes. Here we show that a single structure is usually sufficient to account for the intermolecular cross-links identified for a stable complex with sub-µmol/L binding affinity. In contrast, we show that the distance between two cross-linked residues in the different subunits of a transient or fleeting complex may exceed the maximum length of the cross-linker used, and the cross-links cannot be fully accounted for with a unique complex structure. We further show that the seemingly incompatible cross-links identified with high confidence arise from alternative modes of protein-protein interactions. By converting the intermolecular cross-links to ambiguous distance restraints, we established a rigid-body simulated annealing refinement protocol to seek the minimum set of conformers collectively satisfying the CXMS data. Hence we demonstrate that CXMS allows the depiction of the ensemble structures of protein complexes and elucidates the interaction dynamics for transient and fleeting complexes.
Co-reporter:Chao Liu, Chun-Qing Song, Zuo-Fei Yuan, Yan Fu, Hao Chi, Le-Heng Wang, Sheng-Bo Fan, Kun Zhang, Wen-Feng Zeng, Si-Min He, Meng-Qiu Dong, and Rui-Xiang Sun
Analytical Chemistry 2014 Volume 86(Issue 11) pp:5286
Publication Date(Web):May 5, 2014
DOI:10.1021/ac404246w
In relative protein abundance determination from peptide intensities recorded in full mass scans, a major complication that affects quantitation accuracy is signal interference from coeluting ions of similar m/z values. Here, we present pQuant, a quantitation software tool that solves this problem. pQuant detects interference signals, identifies for each peptide a pair of least interfered isotopic chromatograms: one for the light and one for the heavy isotope-labeled peptide. On the basis of these isotopic pairs, pQuant calculates the relative heavy/light peptide ratios along with their 99.75% confidence intervals (CIs). From the peptides ratios and their CIs, pQuant estimates the protein ratios and associated CIs by kernel density estimation. We tested pQuant, Census and MaxQuant on data sets obtained from mixtures (at varying mixing ratios from 10:1 to 1:10) of light- and heavy-SILAC labeled HeLa cells or 14N- and 15N-labeled Escherichia coli cells. pQuant quantitated more peptides with better accuracy than Census and MaxQuant in all 14 data sets. On the SILAC data sets, the nonquantified “NaN” (not a number) ratios generated by Census, MaxQuant, and pQuant accounted for 2.5–10.7%, 1.8–2.7%, and 0.01–0.5% of all ratios, respectively. On the 14N/15N data sets, which cannot be quantified by MaxQuant, Census and pQuant produced 0.9–10.0% and 0.3−2.9% NaN ratios, respectively. Excluding these NaN results, the standard deviations of the numerical ratios calculated by Census or MaxQuant are 30–100% larger than those by pQuant. These results show that pQuant outperforms Census and MaxQuant in SILAC and 15N-based quantitation.
Co-reporter:Tie-Mei Li, Jie Chen, Xiangke Li, Xiao-Jun Ding, Yao Wu, Li-Feng Zhao, She Chen, Xiaoguang Lei, and Meng-Qiu Dong
Analytical Chemistry 2013 Volume 85(Issue 19) pp:9281
Publication Date(Web):September 6, 2013
DOI:10.1021/ac402025c
Under favorable conditions, Caenorhabditis elegans larvae grow into reproductive adults after a series of molting cycles. When environmental conditions are harsh, they arrest as dauer larvae. Dafachronic acid (DA), a C. elegans steroid hormone, is required for reproductive development. Here, we report a mass spectrometry (MS) method for absolute quantitation of DA in C. elegans. The extraction of DA from C. elegans was optimized to achieve a recovery rate of greater than 83%. The MS sensitivity to DA increased 100-fold after carboxyl group derivatization with 2-picolylamine. High-resolution selected ion monitoring (HR-SIM) on a Q-Orbitrap mass spectrometer Q Exactive outperformed targeted-MS2 on the same instrument and selected reaction monitoring (SRM) on a triple-quadrupole mass spectrometer TSQ Quantum Discovery. With a limit of quantification as low as 1 pg of DA, the HR-SIM method enables absolute quantification of endogenous DA during the reproductive development of C. elegans. We found that in wild-type (WT) worms, DA increases from 0.04 ± 0.02 ng/mg protein in the L1 larval stage to 1.21 ± 0.67 ng/mg protein in the L2 larval stage and decreases again after the L3 stage. In comparison, four genetic mutants that have a constitutive dauer-formation phenotype due to disrupted insulin, TGF-β, or cGMP signaling all have a very low DA level in the L2 stage (below 15% of the WT). These mutants are able to escape the dauer fate and most of them grow into fertile adults when supplied with exogenous DA. Therefore, a DA spike in the L2 stage is critical for the reproductive development of C. elegans.
Co-reporter:Yanmei Zhao;Wei Sun;Mei-Jun Zhang;Hao Chi;Fanxia Meng;Chun-Qing Song;Yunlong Shang;Andreas F. Hühmer;Xuan Ma;Steven W. L'Hernault;Youqiao Hu;Zhiqi Hao;Long Miao;Pan Zhang;Bin Wang;Si-Min He
PNAS 2012 Volume 109 (Issue 5 ) pp:1542-1547
Publication Date(Web):2012-01-31
DOI:10.1073/pnas.1109912109
Spermiogenesis is a series of poorly understood morphological, physiological and biochemical processes that occur during the transition of immotile spermatids into motile, fertilization-competent spermatozoa. Here, we identified a Serpin (serine protease inhibitor) family protein (As_SRP-1) that is secreted from spermatids during nematode Ascaris suum spermiogenesis (also called sperm activation) and we showed that As_SRP-1 has two major functions. First, As_SRP-1 functions in cis to support major sperm protein (MSP)-based cytoskeletal assembly in the spermatid that releases it, thereby facilitating sperm motility acquisition. Second, As_SRP-1 released from an activated sperm inhibits, in trans, the activation of surrounding spermatids by inhibiting vas deferens-derived As_TRY-5, a trypsin-like serine protease necessary for sperm activation. Because vesicular exocytosis is necessary to create fertilization-competent sperm in many animal species, components released during this process might be more important modulators of the physiology and behavior of surrounding sperm than was previously appreciated.
Co-reporter:Hao Chi, Rui-Xiang Sun, Bing Yang, Chun-Qing Song, Le-Heng Wang, Chao Liu, Yan Fu, Zuo-Fei Yuan, Hai-Peng Wang, Si-Min He and Meng-Qiu Dong
Journal of Proteome Research 2010 Volume 9(Issue 5) pp:2713-2724
Publication Date(Web):2017-2-22
DOI:10.1021/pr100182k
De novo peptide sequencing has improved remarkably in the past decade as a result of better instruments and computational algorithms. However, de novo sequencing can correctly interpret only ∼30% of high- and medium-quality spectra generated by collision-induced dissociation (CID), which is much less than database search. This is mainly due to incomplete fragmentation and overlap of different ion series in CID spectra. In this study, we show that higher-energy collisional dissociation (HCD) is of great help to de novo sequencing because it produces high mass accuracy tandem mass spectrometry (MS/MS) spectra without the low-mass cutoff associated with CID in ion trap instruments. Besides, abundant internal and immonium ions in the HCD spectra can help differentiate similar peptide sequences. Taking advantage of these characteristics, we developed an algorithm called pNovo for efficient de novo sequencing of peptides from HCD spectra. pNovo gave correct identifications to 80% or more of the HCD spectra identified by database search. The number of correct full-length peptides sequenced by pNovo is comparable with that obtained by database search. A distinct advantage of de novo sequencing is that deamidated peptides and peptides with amino acid mutations can be identified efficiently without extra cost in computation. In summary, implementation of the HCD characteristics makes pNovo an excellent tool for de novo peptide sequencing from HCD spectra.
3-Pyrrolidinesulfonic acid,1-[4-(2,5-dihydro-2,5-dioxo-1H-pyrrol-1-yl)-1-oxobutoxy]-2,5-dioxo-,sodium salt
3-Pyrrolidinesulfonic acid,1,1'-[(1,8-dioxo-1,8-octanediyl)bis(oxy)]- bis[2,5-dioxo-,disodium salt
(2,5-DIOXOPYRROLIDIN-1-YL) 6-[(2,2,2-TRIFLUOROACETYL)AMINO]HEXANOATE
EGS
2,5-Dioxopyrrolidin-1-yl 2-(2,5-dioxo-2,5-dihydro-1H-pyrrol-1-yl)acetate
2-dihydroxyphosphinoyloxyacrylic acid