Detailed Information

Cited 173 time in webofscience Cited 180 time in scopus
Metadata Downloads

Machine-Learning-Based Prediction of Cell-Penetrating Peptides and Their Uptake Efficiency with Improved Accuracy

Authors
Manayalan, BalachandranSubramaniyam, SathiyamoorthyShin, Tae HwanKim, Myeong OkLee, Gwang
Issue Date
Aug-2018
Publisher
AMER CHEMICAL SOC
Keywords
cell-penetrating peptides; feature selection; machine learning; extremely randomized tree; random forest; uptake efficiency
Citation
JOURNAL OF PROTEOME RESEARCH, v.17, no.8, pp 2715 - 2726
Pages
12
Indexed
SCI
SCIE
SCOPUS
Journal Title
JOURNAL OF PROTEOME RESEARCH
Volume
17
Number
8
Start Page
2715
End Page
2726
URI
https://scholarworks.gnu.ac.kr/handle/sw.gnu/11404
DOI
10.1021/acs.jproteome.8b00148
ISSN
1535-3893
1535-3907
Abstract
Cell-penetrating peptides (CPPs) can enter cells as a variety of biologically active conjugates and have various biomedical applications. To offset the cost and effort of designing novel CPPs in laboratories, computational methods are necessitated to identify candidate CPPs before in vitro experimental studies. We developed a two-layer prediction framework called machine-learning-based prediction of cell-penetrating peptides (MLCPPs). The first-layer predicts whether a given peptide is a CPP or non-CPP, whereas the second-layer predicts the uptake efficiency of the predicted CPPs. To construct a two-layer prediction framework, we employed four different machine-learning methods and five different compositions including amino acid composition (AAC), dipeptide composition, amino acid index, composition transition distribution, and physicochemical properties (PCPs). In the first layer, hybrid features (combination of AAC and PCP) and extremely randomized tree outperformed state-of-the-art predictors in CPP prediction with an accuracy of 0.896 when tested on independent data sets, whereas in the second layer, hybrid features obtained through feature selection protocol and random forest produced an accuracy of 0.725 that is better than state-of-the-art predictors. We anticipate that our method MLCPP will become a valuable tool for predicting CPPs and their uptake efficiency and might facilitate hypothesis-driven experimental design. The MLCPP server interface along with the benchmarking and independent data sets are freely accessible at www.thegleelab.org/MLCPP.
Files in This Item
There are no files associated with this item.
Appears in
Collections
ETC > Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Myeong Ok photo

Kim, Myeong Ok
대학원 (응용생명과학부)
Read more

Altmetrics

Total Views & Downloads

BROWSE