Machine Learning Approaches for Predicting Protein Complex Similarity

Farhoodi, Roshanak; Akbal-Delibas, Bahar; Haspel, Nurit

Machine Learning Approaches for Predicting Protein Complex Similarity

dc.contributor.author	Farhoodi, Roshanak
dc.contributor.author	Akbal-Delibas, Bahar
dc.contributor.author	Haspel, Nurit
dc.date.accessioned	2019-06-27T08:01:35Z
dc.date.available	2019-06-27T08:01:35Z
dc.date.issued	2017
dc.description.abstract	Discriminating native-like structures from false positives with high accuracy is one of the biggest challenges in protein-protein docking. While there is an agreement on the existence of a relationship between various favorable intermolecular interactions (e.g. Van der Waals electrostatic and desolvation forces) and the similarity of a conformation to its native structure the precise nature of this relationship is not known. Existing protein-protein docking methods typically formulate this relationship as a weighted sum of selected terms and calibrate their weights by using a training set to evaluate and rank candidate complexes. Despite improvements in the predictive power of recent docking methods producing a large number of false positives by even state-of-the-art methods often leads to failure in predicting the correct binding of many complexes. With the aid of machine learning methods we tested several approaches that not only rank candidate structures relative to each other but also predict how similar each candidate is to the native conformation. We trained a two-layer neural network a multilayer neural network and a network of Restricted Boltzmann Machines against extensive data sets of unbound complexes generated by RosettaDock and PyDock. We validated these methods with a set of refinement candidate structures. We were able to predict the root mean squared deviations (RMSDs) of protein complexes with a very small often less than 1.5 angstrom error margin when trained with structures that have RMSD values of up to 7 angstrom. In our most recent experiments with the protein samples having RMSD values up to 27 angstrom the average prediction error was still relatively small attesting to the potential of our approach in predicting the correct binding of protein-protein complexes.	en_US]
dc.identifier.doi	10.1089/cmb.2016.0137	en_US
dc.identifier.issn	1066-5277	en_US
dc.identifier.issn	1557-8666	en_US
dc.identifier.issn	1066-5277
dc.identifier.issn	1557-8666
dc.identifier.scopus	2-s2.0-85009060594	en_US
dc.identifier.uri	https://hdl.handle.net/20.500.12469/412
dc.identifier.uri	https://doi.org/10.1089/cmb.2016.0137
dc.language.iso	en	en_US
dc.publisher	Mary Ann Liebert Inc Publ	en_US
dc.relation.ispartof	Journal of Computational Biology
dc.rights	info:eu-repo/semantics/closedAccess	en_US
dc.subject	Machine learning	en_US
dc.subject	Neural networks	en_US
dc.subject	Protein docking and refinement	en_US
dc.subject	RMSD prediction	en_US
dc.subject	Scoring functions	en_US
dc.title	Machine Learning Approaches for Predicting Protein Complex Similarity	en_US
dc.type	Article	en_US
dspace.entity.type	Publication
gdc.author.institutional	Akbal-Delibas, Bahar	en_US
gdc.bip.impulseclass	C5
gdc.bip.influenceclass	C5
gdc.bip.popularityclass	C5
gdc.coar.access	metadata only access
gdc.coar.type	text::journal::journal article
gdc.collaboration.industrial	false
gdc.description.department	Fakülteler, Mühendislik ve Doğa Bilimleri Fakültesi, Bilgisayar Mühendisliği Bölümü	en_US
gdc.description.endpage	51
gdc.description.issue	1
gdc.description.publicationcategory	Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı	en_US
gdc.description.scopusquality	Q3
gdc.description.startpage	40	en_US
gdc.description.volume	24	en_US
gdc.description.wosquality	Q2
gdc.identifier.openalex	W2536546197
gdc.identifier.pmid	27748625	en_US
gdc.identifier.wos	WOS:000391761300005	en_US
gdc.index.type	WoS
gdc.index.type	Scopus
gdc.index.type	PubMed
gdc.oaire.diamondjournal	false
gdc.oaire.impulse	1.0
gdc.oaire.influence	2.5287303E-9
gdc.oaire.isgreen	false
gdc.oaire.keywords	Protein Conformation
gdc.oaire.keywords	Proteins
gdc.oaire.keywords	Protein docking and refinement
gdc.oaire.keywords	Machine Learning
gdc.oaire.keywords	Molecular Docking Simulation
gdc.oaire.keywords	Scoring functions
gdc.oaire.keywords	Machine learning
gdc.oaire.keywords	Animals
gdc.oaire.keywords	Humans
gdc.oaire.keywords	Neural Networks, Computer
gdc.oaire.keywords	RMSD prediction
gdc.oaire.keywords	Neural networks
gdc.oaire.keywords	Protein Binding
gdc.oaire.popularity	9.697225E-10
gdc.oaire.publicfunded	false
gdc.oaire.sciencefields	0301 basic medicine
gdc.oaire.sciencefields	0303 health sciences
gdc.oaire.sciencefields	03 medical and health sciences
gdc.openalex.collaboration	International
gdc.openalex.fwci	0.17332617
gdc.openalex.normalizedpercentile	0.68
gdc.opencitations.count	1
gdc.plumx.crossrefcites	1
gdc.plumx.mendeley	13
gdc.plumx.pubmedcites	1
gdc.plumx.scopuscites	0
gdc.relation.journal	Journal of Computational Biology
gdc.scopus.citedcount	0
gdc.virtual.author	Delıbaş, Ayşe Bahar
gdc.wos.citedcount	0
relation.isAuthorOfPublication	229c2e99-3e3a-429f-bc29-b66887aeacda
relation.isAuthorOfPublication.latestForDiscovery	229c2e99-3e3a-429f-bc29-b66887aeacda
relation.isOrgUnitOfPublication	fd8e65fe-c3b3-4435-9682-6cccb638779c
relation.isOrgUnitOfPublication	2457b9b3-3a3f-4c17-8674-7f874f030d96
relation.isOrgUnitOfPublication	b20623fc-1264-4244-9847-a4729ca7508c
relation.isOrgUnitOfPublication.latestForDiscovery	fd8e65fe-c3b3-4435-9682-6cccb638779c

Collections

WoS İndeksli Yayınlar Koleksiyonu
Bilgisayar Mühendisliği Bölümü Koleksiyonu
PubMed İndeksli Yayınlar Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu

Machine Learning Approaches for Predicting Protein Complex Similarity

Files

Collections