Overview
-
深度学习在蛋白质功能预测中的应用.
合成生物学, 2023.
[PDF]
-
A comprehensive computational benchmark for evaluating deep learning-based protein function prediction approaches.
Briefings in Bioinformatics, 2024.
[PDF]
-
Deep learning methods for protein function prediction.
Proteomics, 2024.
[PDF]
-
Protein function prediction with gene ontology: from traditional to deep learning models.
PeerJ, 2021.
[PDF]
-
The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens.
Genome Biology, 2019.
[PDF]
-
Protein function annotation using protein domain family resources.
Methods, 2016.
[PDF]
Evaluation Metrics
-
A large-scale assessment of sequence database search tools for homology-based protein function prediction.
bioRxiv, 2023.
[PDF]
-
Evaluation: A large-scale evaluation of computational protein function prediction.
Nature Methods, 2013.
[PDF]
Toolbars
-
InterPro in 2022.
Nucleic Acids Research, 2022.
[PDF]
[Web Server]
Public Database
-
Protein sequence database:
UniProt.
-
Protein structure database:
PDB,
AlphaFold database.
-
Protein function database:
Gene Ontology,
GOA.
-
Protein-ligand strcture database:
BioLip.
-
Gene co-expression database:
COXPRESdb,
ATTED-II.
Template-Based Methods
-
QAUST: Protein function prediction using structure similarity, protein interaction, and functional motifs.
Genomics Proteomics Bioinformatics, 2021.
Source: structure, protein-protein network, and functional motifs.
[PDF]
-
MLC: Metric learning on expression data for gene function prediction.
Bioinformatics, 2020.
Source: gene expression.
[PDF]
[Code]
-
INGA 2.0: Improving protein function prediction for the dark proteome.
Nucleic Acids Research, 2019.
Source: sequenc,protein-protein network, domain.
[PDF]
[Web Server]
-
MetaGO: Predicting gene ontology of non-homologous proteins through low-resolution protein structure prediction and protein-protein network mapping.
Journal of Molecular Biology, 2018.
Source: sequence, structure, and protein-protein network.
[PDF]
[Web Server]
-
FunFams: Functional classification of CATH superfamilies: a domain-based approach for protein function annotation.
Bioinformatics, 2017.
Source: family and domain.
[PDF]
[Web Server]
-
COFACTOR 2.0: Improved protein function prediction by combining structure, sequence and protein-protein interaction information.
Nucleic Acids Research, 2017.
Source: sequence, structure, and protein-protein network.
[PDF]
[Web Server]
-
GoFDR: A sequence alignment based method for predicting protein functions.
Methods, 2017.
Source: sequence.
[PDF]
-
INGA: Protein function prediction combining interaction networks, domain assignments and sequence similarity.
Nucleic Acids Research, 2015.
Source: sequenc,protein-protein network, domain.
[PDF]
-
MS-kNN: Protein function prediction by integrating multiple data sources.
BMC Bioinformatics, 2013.
Source: sequence, protein-protein network, and gene expression.
[PDF]
-
dcGO: A domain-centric solution to functional genomics via dcGO predictor.
BMC Bioinformatics, 2013.
Source: domain.
[PDF]
[Web Server]
-
COFACTOR 1.0: An accurate comparative algorithm for structure-based protein function annotation.
Nucleic Acids Research, 2012.
Source: structure.
[PDF]
-
FINDSITE: A combined evolution/structure-based approach to protein function prediction.
Briefings in Bioinformatics, 2009.
Source: structure.
[PDF]
-
MultiPfam2GO: Predicting protein function from domain content.
Bioinformatics, 2009.
Source: domain.
[PDF]
-
Exploiting indirect neighbours and topological weight to predict protein function from protein–protein interactions.
Bioinformatics, 2006.
Source: protein-protein network.
[PDF]
-
ProFunc: A server for predicting protein function from 3D structure.
Nucleic Acids Research, 2005.
Source: structure.
[PDF]
-
Blast2GO: A universal tool for annotation, visualization and analysis in functional genomics research.
Bioinformatics, 2005.
Source: sequence.
[PDF]
-
GOtcha: A new method for prediction of protein function assessed by the annotation of seven genomes.
BMC Bioinformatics, 2004.
Source: sequence.
[PDF]
-
Global protein function prediction from protein-protein interaction networks.
Nature Biotechnology, 2003.
Source: protein-protein network.
[PDF]
-
Predicting protein function from protein/protein interaction data: a probabilistic approach.
Bioinformatics, 2003.
Source: protein-protein network.
[PDF]
Machine Learning-Based Methods
-
NetGO: Improving large-scale protein function prediction with massive network information.
Nucleic Acids Research, 2019.
[PDF]
-
GOLabeler: Improving sequence-based large-scale protein function prediction by learning to rank.
Bioinformatics, 2018.
[PDF]
-
FFPred: an integrated feature-based function prediction server for vertebrate proteomes.
Nucleic Acids Research, 2008.
[PDF]
Deep Learning-Based Methods
-
DeepSS2GO: Protein function prediction from secondary structure.
Briefings in Bioinformatics, 2024.
[PDF]
[Code]
-
MSF-PFP: A novel multisource feature fusion model for protein function prediction.
Journal of Chemical Information and Modeling, 2024.
[PDF]
[Code]
-
Chemical-SA-BiLSTM: Grain protein function prediction based on self-attention mechanism and bidirectional LSTM.
Briefings in Bioinformatics, 2023.
[PDF]
[Code]
-
ProteInfer: Deep neural networks for protein functional inference.
eLife, 2023.
[PDF]
[Code]
-
DeepPFP-CO: A deep learning framework for predicting protein functions with co-occurrence of GO terms.
IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2023.
[PDF]
[Web Server]
-
TripletGO: Integrating transcript expression profiles with protein homology inferences for gene function prediction.
Genomics Proteomics Bioinformatics, 2022.
[PDF]
[Web Server]
-
DeepGOZero: Improving protein function prediction from sequence and zero-shot learning based on ontology axioms.
Bioinformatics, 2022.
[PDF]
[Code]
-
NetGO 2.0: Improving large-scale protein function prediction with massive sequence, text, domain, family and network information.
Nucleic Acids Research, 2021.
[PDF]
-
DeepGOA: A deep learning framework for gene ontology annotations with sequence- and network-based information.
IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2021.
[PDF]
[Code]
-
NetQuilt: Deep multispecies network-based protein function prediction using homology-informed network similarity.
Bioinformatics , 2021.
[PDF]
[Code]
-
MultiPredGO: Deep multi-modal protein function prediction by amalgamating protein structure, sequence, and interaction information.
IEEE Journal of Biomedical and Health Informatics , 2021.
[PDF]
[Code]
-
TALE: Transformer-based protein function Annotation with joint sequence-Label Embedding.
Bioinformatics, 2021.
[PDF]
[Code]
-
DeepGraphGO: graph neural network for large-scale, multispecies protein function prediction.
Bioinformatics, 2021.
[PDF]
[Code]
-
Deep_CNN_LSTM_GO: Protein function prediction from amino-acid sequences.
Computational Biology and Chemistry, 2021.
[PDF]
-
FFPred-GAN: Protein function prediction is improved by creating synthetic feature samples with generative adversarial networks.
Nature Machine Intelligence, 2020.
[PDF]
[Code]
-
DeepAdd: Protein function prediction from k-mer embedding and additional features.
Computational Biology and Chemistry, 2020.
[PDF]
-
DeepGOPlus: Improved protein function prediction from sequence.
Bioinformatics, 2020.
[PDF]
[Web Server]
-
DeepGO: Predicting protein functions from sequence and interactions using a deep ontology-aware classifier.
Bioinformatics, 2018.
[PDF]
[Code]
Pre-Trained Model-Based Methods
-
AnnoPRO: A strategy for protein function annotation based on multi-scale protein representation and a hybrid deep learning of dual-path encoding.
Genome Biology, 2024.
[PDF]
[Code]
-
DualNetGO: A dual network model for protein function prediction via effective feature selection.
Bioinformatics, 2024.
[PDF]
[Code]
-
Domain-PFP: Protein function prediction using function-aware domain embedding representations.
Communications Biology, 2023.
[PDF]
[Code]
-
CFAGO: Cross-fusion of network and attributes based on attention mechanism for protein function prediction.
Bioinformatics, 2023.
[PDF]
[Code]
-
MELISSA: Semi-supervised embedding for protein function prediction across multiple networks
bioRxiv, 2023.
[PDF]
[Code]
-
HiFun: Homology independent protein function prediction by a novel protein-language self-attention model.
Briefings in Bioinformatics, 2023.
[PDF]
[Code]
-
PredGO: Large-scale predicting protein functions through heterogeneous feature fusion.
Briefings in Bioinformatics, 2023.
[PDF]
[Code]
-
MGEGFP: A multi-view graph embedding method for gene function prediction based on adaptive estimation with GCN.
Briefings in Bioinformatics, 2023.
[PDF]
[Code]
-
MMSMAPlus: A multi-view multi-scale multi-attention embedding model for protein function prediction.
Briefings in Bioinformatics, 2023.
[PDF]
[Code]
-
PFmulDL: A novel strategy enabling multi-class and multi-label protein function annotation by integrating diverse deep learning methods.
Computers in Biology and Medicine, 2022.
[PDF]
[Code]
-
DeepFRI: Structure-based protein function prediction using graph convolutional networks.
Nature Communications, 2021.
[PDF]
[Web Server]
[Code]
-
deepNF: Deep network fusion for protein function prediction.
Bioinformatics, 2018.
[PDF]
[Code]
Large Language Model-Based Methods
-
DeepGO-SE: Protein function prediction as approximate semantic entailment.
Nature Machine Intelligence, 2024.
Model: ESM2.
[PDF]
[Code]
-
PhiGnet: Accurate prediction of protein function using statistics-informed graph networks.
Nature Communications, 2024.
Model: ESM-1b.
[PDF]
[Code]
-
GPSFun: Geometry-aware protein sequence function predictions with language models.
Nucleic Acids Research, 2024.
Model: ESM2.
[PDF]
[Web Server]
-
GO-LTR: Protein function prediction through multi‑view multi‑label latent tensor reconstruction.
BMC Bioinformatics, 2024.
Model: ProtTrans.
[PDF]
[Code]
-
TransFew: Improving protein function prediction by learning and integrating representations of protein sequences and function labels.
bioRxiv, 2024.
Model: ESM2.
[PDF]
[Code]
-
DeepGOMeta: Predicting functions for microbes.
bioRxiv, 2024.
Model: ESM2.
[PDF]
[Code]
-
GNNGO3D: Protein function prediction based on 3D structure and functional hierarchy learning.
IEEE Transactions on Knowledge and Data Engineering, 2023.
Model: ESM-1b.
[PDF]
-
Struct2GO: Protein function prediction based on graph pooling algorithm and AlphaFold2 structure information.
Bioinformatics, 2023.
Model: SeqVec.
[PDF]
[Code]
-
SPROF-GO: Fast and accurate protein function prediction from sequence through pretrained language model and homology-based label diffusion.
Briefings in Bioinformatics, 2023.
Model: ProtTrans.
[PDF]
[Code]
-
HNetGO: Protein function prediction via heterogeneous network transformer.
Briefings in Bioinformatics, 2023.
Model: SeqVec.
[PDF]
[Code]
-
NetGO 3.0: A protein language model improves large-scale functional Annotations.
Genomics Proteomics Bioinformatics, 2023.
Model: ESM-1b.
[PDF]
[Web Server]
-
HEAL: Hierarchical graph transformer with contrastive learning for protein function prediction.
Bioinformatics, 2023.
Model: ESM-1b.
[PDF]
[Code]
-
TransFun: Combining protein sequences and structures with transformers and equivariant graph neural networks to predict protein function.
Bioinformatics, 2023.
Model: ESM-1b.
[PDF]
[Web Server]
-
TEMPROT: Protein function annotation using transformers embeddings and homology search.
BMC Bioinformatics, 2023.
Model: ProtBERT-BFD.
[PDF]
[Web Server]
-
PANDA2: Protein function prediction using graph neural networks.
NAR Genomics and Bioinformatics, 2022.
Model: ESM-1b.
[PDF]
[Web Server]
-
ATGO: Integrating unsupervised language model with triplet neural networks for protein gene ontology prediction.
PLoS Computational Biology, 2022.
Model: ESM-1b.
[PDF]
[Web Server]
-
GAT-GO: Accurate protein function prediction via graph attention networks with predicted structure information.
Briefings in Bioinformatics, 2022.
Model: ESM-1b.
[PDF]
-
GOPredSim: Embeddings from deep learning transfer GO annotations beyond homology.
Scientific Reports, 2021.
Model: SeqVec.
[PDF]
[Web Server]
[Code]
|