Infolitika Journal of Data Science
Vol. 1 No. 1 (2023): September 2023

Ensemble Machine Learning Approach for Quantitative Structure Activity Relationship Based Drug Discovery: A Review

Teuku Rizky Noviandy (Department of Informatics, Faculty of Mathematics and Natural Sciences, Universitas Syiah Kuala, Banda Aceh 23111, Indonesia)
Aga Maulana (Department of Informatics, Faculty of Mathematics and Natural Sciences, Universitas Syiah Kuala, Banda Aceh 23111, Indonesia)
Ghazi Mauer Idroes (Graduate School of Mathematics and Applied Sciences, Universitas Syiah Kuala, Banda Aceh 23111, Indonesia
Department of Occupational Health and Safety, Faculty of Health Sciences, Universitas Abulyatama, Aceh Besar 23372, Indonesia)

Talha Bin Emran (Department of Pharmacy, BGC Trust University Bangladesh, Chittagong 4381, Bangladesh)
Trina Ekawati Tallei (Department of Biology, Faculty of Mathematics and Natural Sciences, Sam Ratulangi University, Manado 95115, North Sulawesi, Indonesia)
Zuchra Helwani (Department of Chemical Engineering, Universitas Riau Pekanbaru 28293, Indonesia)
Rinaldi Idroes (Department of Chemistry, Faculty of Mathematics and Natural Sciences, Universitas Syiah Kuala, Banda Aceh 23111, Indonesia)



Article Info

Publish Date
25 Sep 2023

Abstract

This comprehensive review explores the pivotal role of ensemble machine learning techniques in Quantitative Structure-Activity Relationship (QSAR) modeling for drug discovery. It emphasizes the significance of accurate QSAR models in streamlining candidate compound selection and highlights how ensemble methods, including AdaBoost, Gradient Boosting, Random Forest, Extra Trees, XGBoost, LightGBM, and CatBoost, effectively address challenges such as overfitting and noisy data. The review presents recent applications of ensemble learning in both classification and regression tasks within QSAR, showcasing the exceptional predictive accuracy of these techniques across diverse datasets and target properties. It also discusses the key challenges and considerations in ensemble QSAR modeling, including data quality, model selection, computational resources, and overfitting. The review outlines future directions in ensemble QSAR modeling, including the integration of multi-modal data, explainability, handling imbalanced data, automation, and personalized medicine applications while emphasizing the need for ethical and regulatory guidelines in this evolving field.

Copyrights © 2023






Journal Info

Abbrev

ijds

Publisher

Subject

Computer Science & IT Decision Sciences, Operations Research & Management Engineering

Description

Infolitika Journal of Data Science is a distinguished international scientific journal that showcases high caliber original research articles and comprehensive review papers in the field of data science. The journals core mission is to stimulate interdisciplinary research collaboration, facilitate ...