A probabilistic data analytics methodology based on Bayesian Belief network for predicting and understanding breast cancer survival
dc.contributor.author
Dağ, Aslı Z.
dc.contributor.author
Akcam, Zümrüt
dc.contributor.author
Kibis, Eyyub
dc.contributor.author
Şimşek, Serhat
dc.contributor.author
Delen, Dursun
dc.date.accessioned
2022-03-07T12:49:05Z
dc.date.available
2022-03-07T12:49:05Z
dc.date.issued
2022
en_US
dc.identifier.citation
Dağ, A. Z., Akcam, Z., Kibis, E., Şimşek, S. ve Delen, D. (2022). A probabilistic data analytics methodology based on Bayesian Belief network for predicting and understanding breast cancer survival. Knowledge-Based Systems, 242(108407). https://doi.org/10.1016/j.knosys.2022.108407
en_US
dc.identifier.issn
0950-7051
dc.identifier.uri
https://doi.org/10.1016/j.knosys.2022.108407
dc.identifier.uri
https://hdl.handle.net/20.500.12154/1716
dc.description.abstract
Understanding breast cancer survival has proven to be a challenging problem for practitioners and researchers. Identifying the factors affecting cancer progression, their interrelationships, and their influence on patients’ long-term survival helps make timely treatment decisions. The current study addresses this problem by proposing a Tree-Augmented Bayesian Belief Network (TAN)-based data analytics methodology comprising of four steps: data acquisition and preprocessing, variable selection via Genetic Algorithm (GA), data balancing with synthetic minority over-sampling and random undersampling methods, and finally the development of the TAN model to determine the probabilistic inter-conditional dependency structure among breast cancer-related variables along with the posterior survival probabilities The proposed model is compared to well-known machine learning models. A what-if analysis has also been conducted to verify the associations among the variables in the TAN model. The relative importance of each variable has been investigated via sensitivity analysis. Finally, a decision support tool is developed to further explore the conditional dependency structure among the cancer-related factors. The results produced by the proposed methodology, namely the patientspecific posterior survival probabilities and the conditional relationships among the variables, can be used by healthcare professionals and physicians to improve the decision-making process in planning and managing breast cancer treatments. Our generic methodology can also accommodate other types of cancer and be applied to manage various medical procedures.
en_US
dc.language.iso
eng
en_US
dc.publisher
Elsevier
en_US
dc.relation.ispartof
Knowledge-Based Systems
en_US
dc.rights
info:eu-repo/semantics/openAccess
en_US
dc.subject
Breast Cancer
en_US
dc.subject
Data Mining
en_US
dc.subject
Genetic Algorithm
en_US
dc.subject
Machine Learning
en_US
dc.subject
Sensitivity Analysis
en_US
dc.title
A probabilistic data analytics methodology based on Bayesian Belief network for predicting and understanding breast cancer survival
en_US
dc.type
article
en_US
dc.department
İHÜ, Yönetim Bilimleri Fakültesi, İşletme Bölümü
en_US
dc.authorid
0000-0001-8857-5148
en_US
dc.identifier.volume
242
en_US
dc.ihuauthorid
0000-0001-8857-5148
en_US
dc.relation.ihupublicationcategory
114
en_US
dc.relation.publicationcategory
Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı