Keyword search (4,163 papers available)

"Butler G" Authored Publications:

Title Authors PubMed ID
1 Ion channel classification through machine learning and protein language model embeddings Ghazikhani H; Butler G; 39572876
ENCS
2 SPOT: A machine learning model that predicts specific substrates for transport proteins Kroll A; Niebuhr N; Butler G; Lercher MJ; 39325691
ENCS
3 Comparative genomic analysis of thermophilic fungi reveals convergent evolutionary adaptations and gene losses Steindorff AS; Aguilar-Pontes MV; Robinson AJ; Andreopoulos B; LaButti K; Kuo A; Mondo S; Riley R; Otillar R; Haridas S; Lipzen A; Grimwood J; Schmutz J; Clum A; Reid ID; Moisan MC; Butler G; Nguyen TTM; Dewar K; Conant G; Drula E; Henrissat B; Hansel C; Singer S; Hutchinson MI; de Vries RP; Natvig DO; Powell AJ; Tsang A; Grigoriev IV; 39266695
CSFG
4 Exploiting protein language models for the precise classification of ion channels and ion transporters Ghazikhani H; Butler G; 38656743
CSFG
5 Enhanced identification of membrane transport proteins: a hybrid approach combining ProtBERT-BFD and convolutional neural networks Ghazikhani H; Butler G; 37497772
ENCS
6 Integrative approach for detecting membrane proteins. Alballa M, Butler G 33349234
CSFG
7 BENIN: Biologically enhanced network inference. Wonkap SK, Butler G 32698722
ENCS
8 TooT-T: discrimination of transport proteins from non-transport proteins. Alballa M, Butler G 32321420
CSFG
9 TranCEP: Predicting the substrate class of transmembrane transport proteins using compositional, evolutionary, and positional information. Alballa M, Aplop F, Butler G 31935244
CSFG
10 Analytical and computational approaches to define the Aspergillus niger secretome. Tsang A, Butler G, Powlowski J, Panisko EA, Baker SE 19618504
BIOLOGY
11 SnowyOwl: accurate prediction of fungal genes by using RNA-Seq and homology information to select among ab initio models. Reid I, O'Toole N, Zabaneh O, Nourzadeh R, Dahdouli M, Abdellateef M, Gordon PM, Soh J, Butler G, Sensen CW, Tsang A 24980894
CSFG
12 Machine learning for biomedical literature triage. Almeida H, Meurs MJ, Kosseim L, Butler G, Tsang A 25551575
CSFG
13 mycoCLAP, the database for characterized lignocellulose-active proteins of fungal origin: resource and text mining curation support. Strasser K, McDonnell E, Nyaga C, Wu M, Wu S, Almeida H, Meurs MJ, Kosseim L, Powlowski J, Butler G, Tsang A 25754864
CSFG
14 An Adaptive Defect Weighted Sampling Algorithm to Design Pseudoknotted RNA Secondary Structures. Zandi K, Butler G, Kharma N 27499762
CSFG

 

Title:SPOT: A machine learning model that predicts specific substrates for transport proteins
Authors:Kroll ANiebuhr NButler GLercher MJ
Link:https://pubmed.ncbi.nlm.nih.gov/39325691/
DOI:10.1371/journal.pbio.3002807
Publication:PLoS biology
Keywords:
PMID:39325691 Category: Date Added:2024-09-26
Dept Affiliation: ENCS
1 Institute for Computer Science and Department of Biology, Heinrich Heine University, Düsseldorf, Germany.
2 Department of Computer Science and Software Engineering, Concordia University, Montreal, Quebec, Canada.

Description:

Transport proteins play a crucial role in cellular metabolism and are central to many aspects of molecular biology and medicine. Determining the function of transport proteins experimentally is challenging, as they become unstable when isolated from cell membranes. Machine learning-based predictions could provide an efficient alternative. However, existing methods are limited to predicting a small number of specific substrates or broad transporter classes. These limitations stem partly from using small data sets for model training and a choice of input features that lack sufficient information about the prediction problem. Here, we present SPOT, the first general machine learning model that can successfully predict specific substrates for arbitrary transport proteins, achieving an accuracy above 92% on independent and diverse test data covering widely different transporters and a broad range of metabolites. SPOT uses Transformer Networks to represent transporters and substrates numerically. To overcome the problem of missing negative data for training, it augments a large data set of known transporter-substrate pairs with carefully sampled random molecules as non-substrates. SPOT not only predicts specific transporter-substrate pairs, but also outperforms previously published models designed to predict broad substrate classes for individual transport proteins. We provide a web server and Python function that allows users to explore the substrate scope of arbitrary transporters.





BookR developed by Sriram Narayanan
for the Concordia University School of Health
Copyright © 2011-2026
Cookie settings
Concordia University