상세 보기
- Kim, Ungsik;
- Bae, Jiho;
- Choi, Sang-Min;
- Lee, Suwon
SCOPUS
0초록
Argumentative explainable artificial intelligence employs argumentation theory to explain the mechanisms of machine learning. Previous approaches for explaining deep learning models collectively compressed layers via clustering. However, this resulted in accumulated information loss across layers, thereby degrading the fidelity of explanations. We propose online activation value-aware clustering and aggregation, a compression algorithm that preserves the inference structure of the original neural network with greater fidelity. The proposed method sequentially compresses each layer, immediately recalculates activation values following compression, and rectifies inter-layer information loss using a singular-value-scaled ridge alignment approach. To evaluate the effectiveness of the proposed method, we introduce four novel quantitative metrics. Input-output fidelity and structural fidelity measure how accurately the compressed model preserves the original model predictions and internal activations. Input-output perturbation consistency and structural perturbation consistency assess the similarity of the changes induced by Gaussian-perturbed input data. Experiments on three benchmark datasets (Breast Cancer, California Housing, and HIGGS) demonstrate that our method achieves performance improvements ranging from 12.9% to 53.7% across the four metrics, demonstrating significantly higher explanation fidelity than existing approaches.
키워드
- 제목
- Online Activation Value-aware Clustering and Aggregation for Faithful Argumentative Explanations
- 저자
- Kim, Ungsik; Bae, Jiho; Choi, Sang-Min; Lee, Suwon
- 발행일
- 2025-11
- 유형
- Conference Paper
- 저널명
- CIKM 2025 - Proceedings of the 34th ACM International Conference on Information and Knowledge Management
- 페이지
- 1386 ~ 1395