EBSCO Logo
Connecting you to content on EBSCOhost
Results
Title

emPDBA: protein-DNA binding affinity prediction by combining features from binding partners and interface learned with ensemble regression model.

Authors

Yang, Shuang; Gong, Weikang; Zhou, Tong; Sun, Xiaohan; Chen, Lei; Zhou, Wenxue; Li, Chunhua

Abstract

Protein–deoxyribonucleic acid (DNA) interactions are important in a variety of biological processes. Accurately predicting protein-DNA binding affinity has been one of the most attractive and challenging issues in computational biology. However, the existing approaches still have much room for improvement. In this work, we propose an ensemble model for Protein-DNA Binding Affinity prediction (emPDBA), which combines six base models with one meta-model. The complexes are classified into four types based on the DNA structure (double-stranded or other forms) and the percentage of interface residues. For each type, emPDBA is trained with the sequence-based, structure-based and energy features from binding partners and complex structures. Through feature selection by the sequential forward selection method, it is found that there do exist considerable differences in the key factors contributing to intermolecular binding affinity. The complex classification is beneficial for the important feature extraction for binding affinity prediction. The performance comparison of our method with other peer ones on the independent testing dataset shows that emPDBA outperforms the state-of-the-art methods with the Pearson correlation coefficient of 0.53 and the mean absolute error of 1.11 kcal/mol. The comprehensive results demonstrate that our method has a good performance for protein-DNA binding affinity prediction. Availability and implementation: The source code is available at https://github.com/ChunhuaLiLab/emPDBA/.

Subjects

REGRESSION analysis; DNA structure; PEARSON correlation (Statistics); COMPUTATIONAL biology; FEATURE selection

Publication

Briefings in Bioinformatics, 2023, Vol 24, Issue 4, p1

ISSN

1467-5463

Publication type

Academic Journal

DOI

10.1093/bib/bbad192

EBSCO Connect | Privacy policy | Terms of use | Copyright | Manage my cookies
Journals | Subjects | Sitemap
© 2025 EBSCO Industries, Inc. All rights reserved