Co-reporter:Jiansong Fang, Xiaocong Pang, Rong Yan, Wenwen Lian, Chao Li, Qi Wang, Ai-Lin Liu and Guan-Hua Du
RSC Advances 2016 vol. 6(Issue 12) pp:9857-9871
Publication Date(Web):18 Jan 2016
DOI:10.1039/C5RA23035G
Neuronal cell death from oxidative stress is a strong factor of many neurodegenerative diseases. To tackle these problems, phenotypic drug screening assays are a possible alternative strategy. The aim of this study is to develop the neuroprotective models against glutamate or H2O2-induced neurotoxicity by machine learning approaches, which helps in discovering neuroprotective compounds. Four different single classifiers (neural network, k nearest neighbors, classification tree and random forest) were constructed based on two large datasets containing 1260 and 900 known active or inactive compounds, which were integrated to develop the combined Bayesian models to obtain superior performance. Our results showed that both of the Bayesian models (combined-NB-1 and combined-NB-2) outperformed the corresponding four single classifiers. Additionally, structural fingerprint descriptors were added to improve the predictive ability of the models, resulting in the two best models NB-1-LPFP4 and NB-2-LCFP6. The best two models gave Matthews correlation coefficients of 0.972 and 0.956 for 5-fold cross validation as well as 0.953 and 0.902 for the test set, respectively. To illustrate the practical applications of the two models, NB-1-LPFP4 and NB-2-LCFP6 were used to perform virtual screening for discovering neuroprotective compounds, and 70 compounds were selected for further cell-based assay. The assay results showed that 28 compounds exhibited neuroprotective effects against glutamate-induced and H2O2-induced neurotoxicity simultaneously. Our results suggested the method that integrated single classifiers into combined Bayesian models could be feasible to predict neuroprotective compounds.
Co-reporter:Jiansong Fang;Xiaocong Pang;Ping Wu;Rong Yan;Li Gao;Chao Li;Wenwen Lian;Qi Wang;Guan-hua Du
Chemical Biology & Drug Design 2016 Volume 87( Issue 5) pp:649-663
Publication Date(Web):
DOI:10.1111/cbdd.12700
A dataset of 67 berberine derivatives for the inhibition of butyrylcholinesterase (BuChE) was studied based on the combination of quantitative structure–activity relationships models, molecular docking, and molecular dynamics methods. First, a series of berberine derivatives were reported, and their inhibitory activities toward butyrylcholinesterase (BuChE) were evaluated. By 2D- quantitative structure–activity relationships studies, the best model built by partial least-square had a conventional correlation coefficient of the training set (R2) of 0.883, a cross-validation correlation coefficient () of 0.777, and a conventional correlation coefficient of the test set () of 0.775. The model was also confirmed by Y-randomization examination. In addition, the molecular docking and molecular dynamics simulation were performed to better elucidate the inhibitory mechanism of three typical berberine derivatives (berberine, C2, and C55) toward BuChE. The predicted binding free energy results were consistent with the experimental data and showed that the van der Waals energy term (ΔEvdw) difference played the most important role in differentiating the activity among the three inhibitors (berberine, C2, and C55). The developed quantitative structure–activity relationships models provide details on the fine relationship linking structure and activity and offer clues for structural modifications, and the molecular simulation helps to understand the inhibitory mechanism of the three typical inhibitors. In conclusion, the results of this study provide useful clues for new drug design and discovery of BuChE inhibitors from berberine derivatives.
Co-reporter:Wenwen Lian;Jiansong Fang;Chao Li;Xiaocong Pang
Molecular Diversity 2016 Volume 20( Issue 2) pp:439-451
Publication Date(Web):2016 May
DOI:10.1007/s11030-015-9641-z
Neuraminidase (NA) is a critical enzyme in the life cycle of influenza virus, which is known as a successful paradigm in the design of anti-influenza agents. However, to date there are no classification models for the virtual screening of NA inhibitors. In this work, we built support vector machine and Naïve Bayesian models of NA inhibitors and non-inhibitors, with different ratios of active-to-inactive compounds in the training set and different molecular descriptors. Four models with sensitivity or Matthews correlation coefficients greater than 0.9 were chosen to predict the NA inhibitory activities of 15,600 compounds in our in-house database. We combined the results of four optimal models and selected 60 representative compounds to assess their NA inhibitory profiles in vitro. Nine NA inhibitors were identified, five of which were oseltamivir derivatives with large C-5 substituents exhibiting potent inhibition against H1N1 NA with \(\hbox {IC}_{50}\) values in the range of 12.9–185.0 nM, and against H3N2 NA with \(\hbox {IC}_{50}\) values between 18.9 and 366.1 nM. The other four active compounds belonged to novel scaffolds, with \(\hbox {IC}_{50}\) values ranging 39.5–63.8 \(\upmu \)M against H1N1 NA and 44.5–114.1 \(\upmu \)M against H3N2 NA. This is the first time that classification models of NA inhibitors and non-inhibitors are built and their prediction results validated experimentally using in vitro assays.
Co-reporter:Chao Li;Jian-Song Fang;Wen-Wen Lian;Xiao-Cong Pang;Guan-Hua Du
Chemical Biology & Drug Design 2015 Volume 85( Issue 4) pp:427-438
Publication Date(Web):
DOI:10.1111/cbdd.12425
The anti-influenza virus activities of 50 resveratrol (RV: 3, 5, 4′-trihydroxy-trans-stilbene) derivatives were evaluated using a neuraminidase (NA) activity assay. The results showed that 35 compounds exerted an inhibitory effect on the NA activity of the influenza virus strain A/PR/8/34 (H1N1) with 50% inhibitory concentration (IC50) values ranging from 3.56 to 186.1 μm. Next, the 35 RV derivatives were used to develop 3D quantitative structure–activity relationship (3D QSAR) models for understanding the chemical–biological interactions governing their activities against NA. The comparative molecular field analysis (CoMFA r2 = 0.973, q2 = 0.620, qtest2 = 0.661) and the comparative molecular similarity indices analysis (CoMSIA r2 = 0.956, q2 = 0.610, qtest2 = 0.531) were applied. Afterward, molecular docking was performed to study the molecular interactions between the RV derivatives and NA. Finally, a cytopathic effect (CPE) reduction assay was used to evaluate the antiviral effects of the RV derivatives in vitro. Time-of-addition studies demonstrated that the RV derivatives might have a direct effect on viral particle infectivity. Our results indicate that the RV derivatives are potentially useful antiviral compounds for new drug design and development for influenza treatment.
Co-reporter:Jiansong Fang;Ranyao Yang;Li Gao;Shengqian Yang;Xiaocong Pang
Molecular Diversity 2015 Volume 19( Issue 1) pp:149-162
Publication Date(Web):2015 February
DOI:10.1007/s11030-014-9561-3
Cyclin-dependent kinase 5 (CDK5) has emerged as a principal therapeutic target for Alzheimer’s disease. It is highly desirable to develop computational models that can predict the inhibitory effects of a compound towards CDK5 activity. In this study, two machine learning tools (naive Bayesian and recursive partitioning) were used to generate four single classifiers from a large dataset containing 462 CDK5 inhibitors and 1,500 non-inhibitors. Then, two types of consensus models [combined classifier-artificial neural networks (CC-ANNs) and consensus prediction] were applied to combine four single classifiers to obtain superior performance. The results showed that both consensus models outperformed four single classifiers, and (MCC \(=\) 0.806) was superior to consensus prediction (MCC \(=\) 0.711) for an external test set. To illustrate the practical applications of the CC-ANN model in virtual screening, an in-house dataset containing 29,170 compounds was screened, and 40 compounds were selected for further bioactivity assays. The assay results showed that 13 out of 40 compounds exerted CDK5/p35 inhibitory activities with IC\(_{50}\) values ranging from 9.23 to \(229.76 \;\upmu \hbox {M}\). Interestingly, three new scaffolds that had not been previously reported as CDK5 inhibitors were found in this study. These studies prove that our protocol is an effective approach to predict small-molecule CDK5 affinity and identify novel lead compounds.
Co-reporter:Jiansong Fang, Ranyao Yang, Li Gao, Dan Zhou, Shengqian Yang, Ai-lin Liu, and Guan-hua Du
Journal of Chemical Information and Modeling 2013 Volume 53(Issue 11) pp:3009-3020
Publication Date(Web):October 21, 2013
DOI:10.1021/ci400331p
Butyrylcholinesterase (BuChE, EC 3.1.1.8) is an important pharmacological target for Alzheimer’s disease (AD) treatment. However, the currently available BuChE inhibitor screening assays are expensive, labor-intensive, and compound-dependent. It is necessary to develop robust in silico methods to predict the activities of BuChE inhibitors for the lead identification. In this investigation, support vector machine (SVM) models and naive Bayesian models were built to discriminate BuChE inhibitors (BuChEIs) from the noninhibitors. Each molecule was initially represented in 1870 structural descriptors (1235 from ADRIANA.Code, 334 from MOE, and 301 from Discovery studio). Correlation analysis and stepwise variable selection method were applied to figure out activity-related descriptors for prediction models. Additionally, structural fingerprint descriptors were added to improve the predictive ability of models, which were measured by cross-validation, a test set validation with 1001 compounds and an external test set validation with 317 diverse chemicals. The best two models gave Matthews correlation coefficient of 0.9551 and 0.9550 for the test set and 0.9132 and 0.9221 for the external test set. To demonstrate the practical applicability of the models in virtual screening, we screened an in-house data set with 3601 compounds, and 30 compounds were selected for further bioactivity assay. The assay results showed that 10 out of 30 compounds exerted significant BuChE inhibitory activities with IC50 values ranging from 0.32 to 22.22 μM, at which three new scaffolds as BuChE inhibitors were identified for the first time. To our best knowledge, this is the first report on BuChE inhibitors using machine learning approaches. The models generated from SVM and naive Bayesian approaches successfully predicted BuChE inhibitors. The study proved the feasibility of a new method for predicting bioactivities of ligands and discovering novel lead compounds.
Co-reporter:Li Gao;Jian-Song Fang;Xiao-Yu Bai;Dan Zhou;Yi-Tao Wang;Guan-Hua Du
Chemical Biology & Drug Design 2013 Volume 81( Issue 6) pp:675-687
Publication Date(Web):
DOI:10.1111/cbdd.12127
The flavonoid baicalein has been proven effective in animal models of parkinson's disease; however, the potential biological targets and molecular mechanisms underlying the antiparkinsonian action of baicalein have not been fully clarified. In the present study, the potential targets of baicalein were predicted by in silico target fishing approaches including database mining, molecular docking, structure-based pharmacophore searching, and chemical similarity searching. A consensus scoring formula has been developed and validated to objectively rank the targets. The top two ranked targets catechol-O-methyltransferase (COMT) and monoamine oxidase B (MAO-B) have been proposed as targets of baicalein by literatures. The third-ranked one (N-methyl-d-aspartic acid receptor, NMDAR) with relatively low consensus score was further experimentally tested. Although our results suggested that baicalein significantly attenuated NMDA-induced neurotoxicity including cell death, intracellular nitric oxide (NO) and reactive oxygen species (ROS) generation, extracellular NO reduction in human SH-SY5Y neuroblastoma cells, baicalein exhibited no inhibitory effect on [3H]MK-801 binding study, indicating that NMDAR might not be the target of baicalein. In conclusion, the results indicate that in silico target fishing is an effective method for drug target discovery, and the protective role of baicalein against NMDA-induced neurotoxicity supports our previous research that baicalein possesses antiparkinsonian activity.
Co-reporter:Li Gao, Mian Zu, Song Wu, Ai-Lin Liu, Guan-Hua Du
Bioorganic & Medicinal Chemistry Letters 2011 21(19) pp: 5964-5970
Publication Date(Web):
DOI:10.1016/j.bmcl.2011.07.071
Co-reporter:Zining He, Wenwen Lian, Jiawei Liu, Runsheng Zheng, Hui Xu, Guanhua Du, Ailin Liu
Phytochemistry Letters (March 2017) Volume 19() pp:160-167
Publication Date(Web):March 2017
DOI:10.1016/j.phytol.2016.12.031
Co-reporter:Mian Zu, Fan Yang, Weiling Zhou, Ailin Liu, Guanhua Du, Lishu Zheng
Antiviral Research (June 2012) Volume 94(Issue 3) pp:217-224
Publication Date(Web):June 2012
DOI:10.1016/j.antiviral.2012.04.001
Co-reporter:Chao Li, Xiaowei Song, Junke Song, Xiaocong Pang, Zhe Wang, Ying Zhao, Wenwen Lian, Ailin Liu, Guanhua Du
Acta Pharmaceutica Sinica B (January 2016) Volume 6(Issue 1) pp:64-70
Publication Date(Web):January 2016
DOI:10.1016/j.apsb.2015.10.001
Co-reporter:Jiansong Fang, Ping Wu, Ranyao Yang, Li Gao, Chao Li, Dongmei Wang, Song Wu, Ai-Lin Liu, Guan-Hua Du
Acta Pharmaceutica Sinica B (December 2014) Volume 4(Issue 6) pp:
Publication Date(Web):1 December 2014
DOI:10.1016/j.apsb.2014.10.002
In this study two genistein derivatives (G1 and G2) are reported as inhibitors of acetylcholinesterase (AChE) and butyrylcholinesterase (BuChE), and differences in the inhibition of AChE are described. Although they differ in structure by a single methyl group, the inhibitory effect of G1 (IC50=264 nmol/L) on AChE was 80 times stronger than that of G2 (IC50=21,210 nmol/L). Enzyme-kinetic analysis, molecular docking and molecular dynamics (MD) simulations were conducted to better understand the molecular basis for this difference. The results obtained by kinetic analysis demonstrated that G1 can interact with both the catalytic active site and peripheral anionic site of AChE. The predicted binding free energies of two complexes calculated by the molecular mechanics/generalized born surface area (MM/GBSA) method were consistent with the experimental data. The analysis of the individual energy terms suggested that a difference between the net electrostatic contributions (ΔEele+ΔGGB) was responsible for the binding affinities of these two inhibitors. Additionally, analysis of the molecular mechanics and MM/GBSA free energy decomposition revealed that the difference between G1 and G2 originated from interactions with Tyr124, Glu292, Val294 and Phe338 of AChE. In conclusion, the results reveal significant differences at the molecular level in the mechanism of inhibition of AChE by these structurally related compounds.The inhibitory potency of G1 (IC50=264 nmol/L) was 80 times stronger than that of G2 (IC50=21,210 nmol/L) against acetylcholinesterase. The combination of enzyme-kinetic analysis, molecular docking and molecular dynamics simulations was conducted to better understand the molecular basis of the inhibition against AChE.Download full-size image
Co-reporter:Jiansong Fang; Yongjie Li; Rui Liu; Xiaocong Pang; Chao Li; Ranyao Yang; Yangyang He; Wenwen Lian; Ai-Lin Liu;Guan-Hua Du
Journal of Chemical Information and Modeling () pp:
Publication Date(Web):December 22, 2014
DOI:10.1021/ci500574n
To determine chemical–protein interactions (CPI) is costly, time-consuming, and labor-intensive. In silico prediction of CPI can facilitate the target identification and drug discovery. Although many in silico target prediction tools have been developed, few of them could predict active molecules against multitarget for a single disease. In this investigation, naive Bayesian (NB) and recursive partitioning (RP) algorithms were applied to construct classifiers for predicting the active molecules against 25 key targets toward Alzheimer’s disease (AD) using the multitarget-quantitative structure–activity relationships (mt-QSAR) method. Each molecule was initially represented with two kinds of fingerprint descriptors (ECFP6 and MACCS). One hundred classifiers were constructed, and their performance was evaluated and verified with internally 5-fold cross-validation and external test set validation. The range of the area under the receiver operating characteristic curve (ROC) for the test sets was from 0.741 to 1.0, with an average of 0.965. In addition, the important fragments for multitarget against AD given by NB classifiers were also analyzed. Finally, the validated models were employed to systematically predict the potential targets for six approved anti-AD drugs and 19 known active compounds related to AD. The prediction results were confirmed by reported bioactivity data and our in vitro experimental validation, resulting in several multitarget-directed ligands (MTDLs) against AD, including seven acetylcholinesterase (AChE) inhibitors ranging from 0.442 to 72.26 μM and four histamine receptor 3 (H3R) antagonists ranging from 0.308 to 58.6 μM. To be exciting, the best MTDL DL0410 was identified as an dual cholinesterase inhibitor with IC50 values of 0.442 μM (AChE) and 3.57 μM (BuChE) as well as a H3R antagonist with an IC50 of 0.308 μM. This investigation is the first report using mt-QASR approach to predict chemical–protein interaction for a single disease and discovering highly potent MTDLs. This protocol may be useful for in silico multitarget prediction of other diseases.