Keyword search (4,163 papers available)

"Butler G" Authored Publications:

Title Authors PubMed ID
1 Ion channel classification through machine learning and protein language model embeddings Ghazikhani H; Butler G; 39572876
ENCS
2 SPOT: A machine learning model that predicts specific substrates for transport proteins Kroll A; Niebuhr N; Butler G; Lercher MJ; 39325691
ENCS
3 Comparative genomic analysis of thermophilic fungi reveals convergent evolutionary adaptations and gene losses Steindorff AS; Aguilar-Pontes MV; Robinson AJ; Andreopoulos B; LaButti K; Kuo A; Mondo S; Riley R; Otillar R; Haridas S; Lipzen A; Grimwood J; Schmutz J; Clum A; Reid ID; Moisan MC; Butler G; Nguyen TTM; Dewar K; Conant G; Drula E; Henrissat B; Hansel C; Singer S; Hutchinson MI; de Vries RP; Natvig DO; Powell AJ; Tsang A; Grigoriev IV; 39266695
CSFG
4 Exploiting protein language models for the precise classification of ion channels and ion transporters Ghazikhani H; Butler G; 38656743
CSFG
5 Enhanced identification of membrane transport proteins: a hybrid approach combining ProtBERT-BFD and convolutional neural networks Ghazikhani H; Butler G; 37497772
ENCS
6 Integrative approach for detecting membrane proteins. Alballa M, Butler G 33349234
CSFG
7 BENIN: Biologically enhanced network inference. Wonkap SK, Butler G 32698722
ENCS
8 TooT-T: discrimination of transport proteins from non-transport proteins. Alballa M, Butler G 32321420
CSFG
9 TranCEP: Predicting the substrate class of transmembrane transport proteins using compositional, evolutionary, and positional information. Alballa M, Aplop F, Butler G 31935244
CSFG
10 Analytical and computational approaches to define the Aspergillus niger secretome. Tsang A, Butler G, Powlowski J, Panisko EA, Baker SE 19618504
BIOLOGY
11 SnowyOwl: accurate prediction of fungal genes by using RNA-Seq and homology information to select among ab initio models. Reid I, O'Toole N, Zabaneh O, Nourzadeh R, Dahdouli M, Abdellateef M, Gordon PM, Soh J, Butler G, Sensen CW, Tsang A 24980894
CSFG
12 Machine learning for biomedical literature triage. Almeida H, Meurs MJ, Kosseim L, Butler G, Tsang A 25551575
CSFG
13 mycoCLAP, the database for characterized lignocellulose-active proteins of fungal origin: resource and text mining curation support. Strasser K, McDonnell E, Nyaga C, Wu M, Wu S, Almeida H, Meurs MJ, Kosseim L, Powlowski J, Butler G, Tsang A 25754864
CSFG
14 An Adaptive Defect Weighted Sampling Algorithm to Design Pseudoknotted RNA Secondary Structures. Zandi K, Butler G, Kharma N 27499762
CSFG

 

Title:SnowyOwl: accurate prediction of fungal genes by using RNA-Seq and homology information to select among ab initio models.
Authors:Reid IO'Toole NZabaneh ONourzadeh RDahdouli MAbdellateef MGordon PMSoh JButler GSensen CWTsang A
Link:https://www.ncbi.nlm.nih.gov/pubmed/24980894?dopt=Abstract
DOI:10.1186/1471-2105-15-229
Publication:BMC bioinformatics
Keywords:
PMID:24980894 Category:BMC Bioinformatics Date Added:2019-06-07
Dept Affiliation: CSFG
1 Centre for Structural and Functional Genomics, Concordia University, 7141 Sherbrooke St, W, Montreal, QC H4B 1R6, Canada. ian.reid@concordia.ca.

Description:

SnowyOwl: accurate prediction of fungal genes by using RNA-Seq and homology information to select among ab initio models.

BMC Bioinformatics. 2014 Jul 01;15:229

Authors: Reid I, O'Toole N, Zabaneh O, Nourzadeh R, Dahdouli M, Abdellateef M, Gordon PM, Soh J, Butler G, Sensen CW, Tsang A

Abstract

BACKGROUND: Locating the protein-coding genes in novel genomes is essential to understanding and exploiting the genomic information but it is still difficult to accurately predict all the genes. The recent availability of detailed information about transcript structure from high-throughput sequencing of messenger RNA (RNA-Seq) delineates many expressed genes and promises increased accuracy in gene prediction. Computational gene predictors have been intensively developed for and tested in well-studied animal genomes. Hundreds of fungal genomes are now or will soon be sequenced. The differences of fungal genomes from animal genomes and the phylogenetic sparsity of well-studied fungi call for gene-prediction tools tailored to them.

RESULTS: SnowyOwl is a new gene prediction pipeline that uses RNA-Seq data to train and provide hints for the generation of Hidden Markov Model (HMM)-based gene predictions and to evaluate the resulting models. The pipeline has been developed and streamlined by comparing its predictions to manually curated gene models in three fungal genomes and validated against the high-quality gene annotation of Neurospora crassa; SnowyOwl predicted N. crassa genes with 83% sensitivity and 65% specificity. SnowyOwl gains sensitivity by repeatedly running the HMM gene predictor Augustus with varied input parameters and selectivity by choosing the models with best homology to known proteins and best agreement with the RNA-Seq data.

CONCLUSIONS: SnowyOwl efficiently uses RNA-Seq data to produce accurate gene models in both well-studied and novel fungal genomes. The source code for the SnowyOwl pipeline (in Python) and a web interface (in PHP) is freely available from http://sourceforge.net/projects/snowyowl/.

PMID: 24980894 [PubMed - indexed for MEDLINE]





BookR developed by Sriram Narayanan
for the Concordia University School of Health
Copyright © 2011-2026
Cookie settings
Concordia University