Amazing technological breakthrough possible @S-Logix pro@slogix.in

Office Address

  • #5, First Floor, 4th Street Dr. Subbarayan Nagar Kodambakkam, Chennai-600 024 Landmark : Samiyar Madam
  • pro@slogix.in
  • +91- 81240 01111

Social List

Machine Learning Approach to Gene Essentiality Prediction: A Review - 2021

machine-learning-approach-to-gene-essentiality-prediction-a-review.jpg

Machine Learning Approach to Gene Essentiality Prediction: A Review | S-Logix

Research Area:  Machine Learning

Abstract:

Essential genes are critical for the growth and survival of any organism. The machine learning approach complements the experimental methods to minimize the resources required for essentiality assays. Previous studies revealed the need to discover relevant features that significantly classify essential genes, improve on the generalizability of prediction models across organisms, and construct a robust gold standard as the class label for the train data to enhance prediction. Findings also show that a significant limitation of the machine learning approach is predicting conditionally essential genes. The essentiality status of a gene can change due to a specific condition of the organism. This review examines various methods applied to essential gene prediction task, their strengths, limitations and the factors responsible for effective computational prediction of essential genes. We discussed categories of features and how they contribute to the classification performance of essentiality prediction models. Five categories of features, namely, gene sequence, protein sequence, network topology, homology and gene ontology-based features, were generated for Caenorhabditis elegans to perform a comparative analysis of their essentiality prediction capacity. Gene ontology-based feature category outperformed other categories of features majorly due to its high correlation with the genes biological functions. However, the topology feature category provided the highest discriminatory power making it more suitable for essentiality prediction. The major limiting factor of machine learning to predict essential genes conditionality is the unavailability of labeled data for interest conditions that can train a classifier. Therefore, cooperative machine learning could further exploit models that can perform well in conditional essentiality predictions.

Keywords:  
Essential genes
Essential proteins
Feature selection
Supervised learning
Conditional essentiality
Conditionally essential genes

Author(s) Name:  Olufemi Aromolaran, Damilare Aromolaran , Itunuoluwa Isewon

Journal name:  Briefings in Bioinformatics

Conferrence name:  

Publisher name:  Oxford Academic

DOI:  10.1093/bib/bbab128

Volume Information:  Volume 22