PredPRBA: Prediction of Protein-RNA Binding Affinity Using Gradient Boosted Regression Trees

Deng, Lei and Yang, Wenyi and Liu, Hui (2019) PredPRBA: Prediction of Protein-RNA Binding Affinity Using Gradient Boosted Regression Trees. Frontiers in Genetics, 10. ISSN 1664-8021

[thumbnail of pubmed-zip/versions/1/package-entries/fgene-10-00637.pdf] Text
pubmed-zip/versions/1/package-entries/fgene-10-00637.pdf - Published Version

Download (2MB)

Abstract

Protein-RNA interactions play essential roles in many biological aspects. Quantifying the binding affinity of protein-RNA complexes is helpful to the understanding of protein-RNA recognition mechanisms and identification of strong binding partners. Due to experimentally measured protein-RNA binding affinity data available is still limited to date, there is a pressing demand for accurate and reliable computational approaches. In this paper, we propose a computational approach, PredPRBA, which can effectively predict protein-RNA binding affinity using gradient boosted regression trees. We build a dataset of protein-RNA binding affinity that includes 103 protein-RNA complex structures manually collected from related literature. Then, we generate 37 kinds of sequence and structural features and explore the relationship between the features and protein-RNA binding affinity. We find that the binding affinity mainly depends on the structure of RNA molecules. According to the type of RNA associated with proteins composed of the protein-RNA complex, we split the 103 protein-RNA complexes into six categories. For each category, we build a gradient boosted regression tree (GBRT) model based on the generated features. We perform a comprehensive evaluation for the proposed method on the binding affinity dataset using leave-one-out cross-validation. We show that PredPRBA achieves correlations ranging from 0.723 to 0.897 among six categories, which is significantly better than other typical regression methods and the pioneer protein-RNA binding affinity predictor SPOT-Seq-RNA. In addition, a user-friendly web server has been developed to predict the binding affinity of protein-RNA complexes. The PredPRBA webserver is freely available at http://PredPRBA.denglab.org/.

Item Type: Article
Subjects: Open STM Article > Medical Science
Depositing User: Unnamed user with email support@openstmarticle.com
Date Deposited: 16 Feb 2023 10:56
Last Modified: 16 Apr 2025 12:48
URI: http://articles.sendtopublish.com/id/eprint/218

Actions (login required)

View Item
View Item