기본 정보
연구 분야
프로젝트
논문
구성원
article|
인용수 0
·2026
SGA-DT: An adaptive fusion framework for missing data imputation and interpretable healthcare classification
Monalisa Jena, Satchidananda Dehuri, Sung-Bae Cho
IF 2.6PLoS ONE
초록

Despite advances in machine learning and medical data processing, handling missing values remains a critical and complex challenge in healthcare analytics. Missing data, especially in non-class attributes can severely compromise model accuracy, clinical reliability, and interpretability. In sensitive domains such as healthcare, improper imputation may lead to biased outcomes or delayed interventions. To address this challenge, we propose SGA-DT, an adaptive and interpretable learning framework that combines the best features of genetically optimized support vector regression (SVR) with a decision tree (DT) classifier for robust healthcare prediction. The framework adaptively selects an imputation strategy based on the level of missingness. It uses standard SVR for low, iterative SVR for moderate, and k-Nearest Neighbor (KNN) followed by SVR refinement for high missingness. Genetic algorithm (GA) is used to select the best SVR kernel and tune its hyperparameters, enhancing imputation accuracy across different data patterns. The complete dataset is then classified using DT, providing both robustness and transparency in prediction. The SGA-DT framework is evaluated on three healthcare datasets, Breast Cancer, Mammographic, and Hepatitis, along with other real-world and synthetic datasets. For interpretability analysis, decision trees are generated under varying missingness levels to support clinical transparency. Comparative results show that SGA-DT consistently outperforms multiple integrated frameworks across accuracy, precision, recall, and F-measure, demonstrating its robustness, interpretability, and generalizability in healthcare prediction tasks.

키워드
InterpretabilityImputation (statistics)Missing dataGeneralizability theorySupport vector machineDecision treeRobustness (evolution)Overfitting
타입
article
IF / 인용수
2.6 / 0
게재 연도
2026

주식회사 디써클

대표 장재우,이윤구서울특별시 강남구 역삼로 169, 명우빌딩 2층 (TIPS타운 S2)대표 전화 0507-1312-6417이메일 info@rndcircle.io사업자등록번호 458-87-03380호스팅제공자 구글 클라우드 플랫폼(GCP)

© 2026 RnDcircle. All Rights Reserved.