Co-reporter:Yang Chen;Yaxing Li;Jing Zhang;Zhipeng Chen;Xin Cao;Lijuan Yang;Gong Zhang;Qing-Yu He;Jiayong Zhong
Journal of Proteome Research September 4, 2015 Volume 14(Issue 9) pp:3693-3709
Publication Date(Web):2017-2-22
DOI:10.1021/pr501103r
Finding protein evidence (PE) for protein coding genes is a primary task of the Phase I Chromosome-Centric Human Proteome Project (C-HPP). Currently, there are 2948 PE level 2–4 coding genes per neXtProt, which are deemed missing proteins in the human proteome. As most samples prepared and analyzed in the C-HPP framework were focusing on detergent soluble proteins, we posit that as a natural composition the cytoplasmic detergent-insoluble proteins (DIPs) represent a source of finding missing proteins. We optimized a workflow and separated cytoplasmic DIPs from three human lung and three human hepatoma cell lines via differential speed centrifugation. We verified that the detergent-soluble proteins (DSPs) could be sufficiently depleted and the cytoplasmic DIP isolation was partially reproducible with Spearman r > 0.70 according to two independent SILAC MS experiments. Through label-free MS, we identified 4524 and 4156 DIPs from lung and liver cells, respectively. Among them, a total of 23 missing proteins (22 PE2 and 1 PE4) were identified by MS, and 18 of them had translation evidence; in addition, six PE5 proteins were identified by MS, three with translation evidence. We showed that cytoplasmic DIPs were not an enrichment of transmembrane proteins and were chromosome-, cell type-, and tissue-specific. Furthermore, we demonstrated that DIPs were distinct from DSPs in terms of structural and physical–chemical features. In conclusion, we have found 23 missing proteins and 6 PE5 proteins from the cytoplasmic insoluble proteome that is biologically and physical-chemically different from the soluble proteome, suggesting that cytoplasmic DIPs carry comprehensive and valuable information for finding PE of missing proteins. The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium with the data set identifier PXD001694.Keywords: Chromosome-Centric Human Proteome Project; detergent-insoluble proteins; missing proteins; neXtProt;
Co-reporter:Wanling Zhang, Xing Chen, Ziqi Yan, Yang Chen, Yizhi Cui, Bingjun Chen, Chujun Huang, Weiwen Zhang, Xingfeng Yin, Qing-Yu He, Fang He, and Tong Wang
Journal of Proteome Research December 1, 2017 Volume 16(Issue 12) pp:4468-4468
Publication Date(Web):October 1, 2017
DOI:10.1021/acs.jproteome.7b00352
Preeclampsia (PE) is a placenta disease, featured by hypertension, proteinuria, and other multiorgan dysfunctions, and its etiology is unclear. We and others have shown that intensive endoplasmic reticulum (ER) stress and unfolded protein response (UPR) occur in the PE placenta. In this study, we isolated detergent-insoluble proteins (DIPs) from human placenta tissues, which were enriched with protein aggregates, to characterize the placenta UPR in PE. With data-independent acquisition (DIA) mass spectrometry, we identified 2066 DIPs across all normal (n = 10) and PE (n = 10) placenta samples, among which 110 and 108 DIPs were significantly up- and down-regulated in PE, respectively. Per clustering analysis, differential DIPs could generally distinguish PE from normal placentas. We verified the MS quantitation of endoglin and vimentin by immunoblotting. In addition, we observed that PE placenta tissues have remarkably more endoglin in the cytoplasm. Furthermore, we found that DIPs were evenly distributed across different chromosomes and could be enriched in diversified gene ontology terms, while differential DIPs avoided to distribute on X-chromosome. Significantly up-regulated DIPs in PE were focused on the top functions of lipid metabolism, while 23 of these DIPs could form the top network regulating cellular movement, development, growth, and proliferation. Our results implicate that human PE placentas have disease-relevant differential DIPs, which reflect aberrantly aggregated proteins of placental tissues. The mass spectrometry proteomics data have been deposited to ProteomeXchange consortium with the data set identifier PXD006654, and iProX database (accession number: IPX0000948000).Keywords: B/D-HPP; C-HPP; detergent-insoluble proteins; preeclampsia; unfolded protein response;
Co-reporter:Jiahui Guo, Yizhi Cui, Ziqi Yan, Yanzhang Luo, Wanling Zhang, Suyuan Deng, Shengquan Tang, Gong Zhang, Qing-Yu He, and Tong Wang
Journal of Proteome Research 2016 Volume 15(Issue 11) pp:4060-4072
Publication Date(Web):July 29, 2016
DOI:10.1021/acs.jproteome.6b00391
Identification of all phosphorylation forms of known proteins is a major goal of the Chromosome-Centric Human Proteome Project (C-HPP). Recent studies have found that certain phosphoproteins can be encapsulated in exosomes and function as key regulators in tumor microenvironment, but no deep coverage phosphoproteome of human exosomes has been reported to date, which makes the exosome a potential source for the new phosphosite discovery. In this study, we performed highly optimized MS analyses on the exosomal and cellular proteins isolated from human colorectal cancer SW620 cells. With stringent data quality control, 313 phosphoproteins with 1091 phosphosites were confidently identified from the SW620 exosome, from which 202 new phosphosites were detected. Exosomal phosphoproteins were significantly enriched in the 11q12.1–13.5 region of chromosome 11 and had a remarkably high level of tyrosine-phosphorylated proteins (6.4%), which were functionally relevant to ephrin signaling pathway-directed cytoskeleton remodeling. In conclusion, we here report the first high-coverage phosphoproteome of human cell-secreted exosomes, which leads to the identification of new phosphosites for C-HPP. Our findings provide insights into the exosomal phosphoprotein systems that help to understand the signaling language being delivered by exosomes in cell–cell communications. The mass spectrometry proteomics data have been deposited to the ProteomeXchange consortium with the data set identifier PXD004079, and iProX database (accession number: IPX00076800).Keywords: C-HPP; exosome; new phosphosites; phosphoproteome; signaling pathway;
Co-reporter:Lijuan Yang, Xinlei Lian, Wanling Zhang, Jie Guo, Qing Wang, Yaxing Li, Yang Chen, Xingfeng Yin, Pengyuan Yang, Fei Lan, Qing-Yu He, Gong Zhang, and Tong Wang
Journal of Proteome Research 2015 Volume 14(Issue 9) pp:3645-3657
Publication Date(Web):July 23, 2015
DOI:10.1021/acs.jproteome.5b00480
The chromosome-centric human proteome project (C-HPP) has made great progress of finding protein evidence (PE) for missing proteins (PE2–4 proteins defined by the neXtProt), which now becomes an increasingly challenging field. As a majority of samples tested in this field were from adult tissues/cells, the developmental stage specific or relevant proteins could be missed due to biological source availability. We posit that epigenetic interventions may help to partially bypass such a limitation by stimulating the expression of the “silenced” genes in adult cells, leading to the increased chance of finding missing proteins. In this study, we established in vitro human cell models to modify the histone acetylation, demethylation, and methylation with near physiological conditions. With mRNA-seq analysis, we found that histone modifications resulted in overall increases of expressed genes in an even distribution manner across different chromosomes. We identified 64 PE2–4 and six PE5 proteins by MaxQuant (FDR < 1% at both protein and peptide levels) and 44 PE2–4 and 7 PE5 proteins by Mascot (FDR < 1% at peptide level) searches, respectively. However, only 24 PE2–4 and five PE5 proteins in Mascot, and 12 PE2–4 and one PE5 proteins in MaxQuant searches could, respectively, pass our stringently manual spectrum inspections. Collectively, 27 PE2–4 and five PE5 proteins were identified from the epigenetically modified cells; among them, 19 PE2–4 and three PE5 proteins passed FDR < 1% at both peptide and protein levels. Gene ontology analyses revealed that the PE2–4 proteins were significantly involved in development and spermatogenesis, although their chemical–physical features had no statistical difference from the background. In addition, we presented an example of suspicious PE5 peptide spectrum matched with unusual AA substitutions related to post-translational modification. In conclusion, the epigenetically manipulated cell models should be a useful tool for finding missing proteins in C-HPP. The mass spectrometry data have been deposited to the iProx database (accession number: IPX00020200).
Co-reporter:Jieming Guo, Xinlei Lian, Jiayong Zhong, Tong Wang and Gong Zhang
Molecular BioSystems 2015 vol. 11(Issue 2) pp:370-378
Publication Date(Web):08 Oct 2014
DOI:10.1039/C4MB00462K
We previously found that shorter mRNAs are preferably translated in various eukaryotic cells. However, the theoretical basis of this phenomenon is unclear. We hypothesize that shorter mRNA length correlates to the decreased translational error rate to reduce the energy consumption on defective protein degradation. In this study, we established a computational model to explain the length-dependent translation initiation efficiency. We provided mathematical evidence that this translational preference, rather than the protein degradation, is a major factor to shape the genome-wide length-dependent protein abundance. As deducted, we simulated that shorter mRNA length is a determinant of initiation circularization time. Furthermore, our model unveiled that preferentially translating shorter mRNAs benefits the energy efficiency on the proteome functionality. We proposed that cancer cells tend to hijack this evolutionary mechanism by counteracting the higher translational error rate. In conclusion, our model provides insights into the nature of the global length-dependent translational control and its biological significance.
Co-reporter:Jiayong Zhong, Yizhi Cui, Jiahui Guo, Zhipeng Chen, Lijuan Yang, Qing-Yu He, Gong Zhang, and Tong Wang
Journal of Proteome Research 2014 Volume 13(Issue 1) pp:50-59
Publication Date(Web):2017-2-22
DOI:10.1021/pr4007409
Chromosome-centric human proteome project (C-HPP) aims at differentiating chromosome-based and tissue-specific protein compositions in terms of protein expression, quantification, and modification. We previously found that the analysis of translating mRNA (mRNA attached to ribosome-nascent chain complex, RNC-mRNA) can explain over 94% of mRNA-protein abundance. Therefore, we propose here to use full-length RNC-mRNA information to illustrate protein expression both qualitatively and quantitatively. We performed RNA-seq on RNC-mRNA (RNC-seq) and detected 12 758 and 14 113 translating genes in human normal bronchial epithelial (HBE) cells and human colorectal adenocarcinoma Caco-2 cells, respectively. We found that most of these genes were mapped with >80% of coding sequence coverage. In Caco-2 cells, we provided translating evidence on 4180 significant single-nucleotide variations. While using RNC-mRNA data as a standard for proteomic data integration, both translating and protein evidence of 7876 genes can be acquired from four interlaboratory data sets with different MS platforms. In addition, we detected 1397 noncoding mRNAs that were attached to ribosomes, suggesting a potential source of new protein explorations. By comparing the two cell lines, a total of 677 differentially translated genes were found to be nonevenly distributed across chromosomes. In addition, 2105 genes in Caco-2 and 750 genes in HBE cells are expressed in a cell-specific manner. These genes are significantly and specifically clustered on multiple chromosomes, such as chromosome 19. We conclude that HPP/C-HPP investigations can be considerably improved by integrating RNC-mRNA analysis with MS, bioinformatics, and antibody-based verifications.
Co-reporter:Chengpu Zhang, Ning Li, Linhui Zhai, Shaohang Xu, Xiaohui Liu, Yizhi Cui, Jie Ma, Mingfei Han, Jing Jiang, Chunyuan Yang, Fengxu Fan, Liwei Li, Peibin Qin, Qing Yu, Cheng Chang, Na Su, Junjie Zheng, Tao Zhang, Bo Wen, Ruo Zhou, Liang Lin, Zhilong Lin, Baojin Zhou, Yang Zhang, Guoquan Yan, Yinkun Liu, Pengyuan Yang, Kun Guo, Wei Gu, Yang Chen, Gong Zhang, Qing-Yu He, Songfeng Wu, Tong Wang, Huali Shen, Quanhui Wang, Yunping Zhu, Fuchu He, and Ping Xu
Journal of Proteome Research 2014 Volume 13(Issue 1) pp:114-125
Publication Date(Web):2017-2-22
DOI:10.1021/pr400900j
Our first proteomic exploration of human chromosome 1 began in 2012 (CCPD 1.0), and the genome-wide characterization of the human proteome through public resources revealed that 32–39% of proteins on chromosome 1 remain unidentified. To characterize all of the missing proteins, we applied an OMICS-integrated analysis of three human liver cell lines (Hep3B, MHCC97H, and HCCLM3) using mRNA and ribosome nascent-chain complex-bound mRNA deep sequencing and proteome profiling, contributing mass spectrometric evidence of 60 additional chromosome 1 gene products. Integration of the annotation information from public databases revealed that 84.6% of genes on chromosome 1 had high-confidence protein evidence. Hierarchical analysis demonstrated that the remaining 320 missing genes were either experimentally or biologically explainable; 128 genes were found to be tissue-specific or rarely expressed in some tissues, whereas 91 proteins were uncharacterized mainly due to database annotation diversity, 89 were genes with low mRNA abundance or unsuitable protein properties, and 12 genes were identifiable theoretically because of a high abundance of mRNAs/RNC-mRNAs and the existence of proteotypic peptides. The relatively large contribution made by the identification of enriched transcription factors suggested specific enrichment of low-abundance protein classes, and SRM/MRM could capture high-priority missing proteins. Detailed analyses of the differentially expressed genes indicated that several gene families located on chromosome 1 may play critical roles in mediating hepatocellular carcinoma invasion and metastasis. All mass spectrometry proteomics data corresponding to our study were deposited in the ProteomeXchange under the identifiers PXD000529, PXD000533, and PXD000535.
Co-reporter:Gong Zhang;QingYu He
Science China Life Sciences 2014 Volume 57( Issue 3) pp:358-360
Publication Date(Web):2014 March
DOI:10.1007/s11427-014-4618-1