A Novel SHAP-GAN Network for Interpretable Ovarian Cancer Diagnosis - Details

author：

Cai, Jingxun (Cai, Jingxun.) ^[1] | Lee, Zne-Jung (Lee, Zne-Jung.) ^[2] | Lin, Zhihxian (Lin, Zhihxian.) ^[3] | Yang, Ming-Ren (Yang, Ming-Ren.) ^[4]

Indexed by：

Scopus SCIE

Abstract：

Ovarian　cancer　stands　out　as　one　of　the　most　formidable　adversaries　in　women＇s　health,　largely　due　to　its　typically　subtle　and　nonspecific　early　symptoms,　which　pose　significant　challenges　to　early　detection　and　diagnosis.　Although　existing　diagnostic　methods,　such　as　biomarker　testing　and　imaging,　can　help　with　early　diagnosis　to　some　extent,　these　methods　still　have　limitations　in　sensitivity　and　accuracy,　often　leading　to　misdiagnosis　or　missed　diagnosis.　Ovarian　cancer＇s　high　heterogeneity　and　complexity　increase　diagnostic　challenges,　especially　in　disease　progression　prediction　and　patient　classification.　Machine　learning　(ML)　has　outperformed　traditional　methods　in　cancer　detection　by　processing　large　datasets　to　identify　patterns　missed　by　conventional　techniques.　However,　existing　AI　models　still　struggle　with　accuracy　in　handling　imbalanced　and　high-dimensional　data,　and　their　＂black-box＂　nature　limits　clinical　interpretability.　To　address　these　issues,　this　study　proposes　SHAP-GAN,　an　innovative　diagnostic　model　for　ovarian　cancer　that　integrates　Shapley　Additive　exPlanations　(SHAP)　with　Generative　Adversarial　Networks　(GANs).　The　SHAP　module　quantifies　each　biomarker＇s　contribution　to　the　diagnosis,　while　the　GAN　component　optimizes　medical　data　generation.　This　approach　tackles　three　key　challenges　in　medical　diagnosis:　data　scarcity,　model　interpretability,　and　diagnostic　accuracy.　Results　show　that　SHAP-GAN　outperforms　traditional　methods　in　sensitivity,　accuracy,　and　interpretability,　particularly　with　high-dimensional　and　imbalanced　ovarian　cancer　datasets.　The　top　three　influential　features　identified　are　PRR11,　CIAO1,　and　SMPD3,　which　exhibit　wide　SHAP　value　distributions,　highlighting　their　significant　impact　on　model　predictions.　The　SHAP-GAN　network　has　demonstrated　an　impressive　accuracy　rate　of　99.34%　on　the　ovarian　cancer　dataset,　significantly　outperforming　baseline　algorithms,　including　Support　Vector　Machines　(SVM),　Logistic　Regression　(LR),　and　XGBoost.　Specifically,　SVM　achieved　an　accuracy　of　72.78%,　LR　achieved　86.09%,　and　XGBoost　achieved　96.69%.　These　results　highlight　the　superior　performance　of　SHAP-GAN　in　handling　high-dimensional　and　imbalanced　datasets.　Furthermore,　SHAP-GAN　significantly　alleviates　the　challenges　associated　with　intricate　genetic　data　analysis,　empowering　medical　professionals　to　tailor　personalized　treatment　strategies　for　individual　patients.

Keyword：

extreme gradient boosting algorithm feature selection generative adversarial networks ovarian cancer SHAP

Community：

[ 1 ] [Cai, Jingxun]Fuzhou Univ, Grad Sch New Generat Elect Informat Engineer, Sch Adv Mfg, Quanzhou 362200, Peoples R China
[ 2 ] [Lee, Zne-Jung]Fuzhou Univ, Sch Adv Mfg, Dept Elect & Informat Engn, Quanzhou 362200, Peoples R China
[ 3 ] [Lin, Zhihxian]Fuzhou Univ, Sch Adv Mfg, Dept Elect & Informat Engn, Quanzhou 362200, Peoples R China
[ 4 ] [Yang, Ming-Ren]Taipei Med Univ, Grad Inst Biomed Informat, Coll Med Sci & Technol, Taipei 235, Taiwan

Reprint 's Address：

[Lee, Zne-Jung]Fuzhou Univ, Sch Adv Mfg, Dept Elect & Informat Engn, Quanzhou 362200, Peoples R China;;[Lin, Zhihxian]Fuzhou Univ, Sch Adv Mfg, Dept Elect & Informat Engn, Quanzhou 362200, Peoples R China

Email：

lemoonwwc@gmail.com |
johnlee@fzu.edu.cn |
lzx2005000@163.com |
tmu_ymz30@tmu.edu.tw

Show more details

Version：