CVPR2025

Show and Tell: Visually Explainable Deep Neural Nets via Spatially-Aware Concept Bottleneck Models

Itay Benou, Tammy Riklin Raviv

Abstract

Input Image "a hat" "long, shaggy hair" "an intelligent expression" SALF-CBM SALF-CBM "a small, dainty dog" "a ball" "a pot" Figure 1. Concept maps generated by our SALF-CBM. Inspired by human visual interpretation, our method first decomposes input images into spatially-localized structures, associated with familiar concepts, independent of a specific task. Explainability of task-specific outputs is obtained by training a final task layer on-top of these maps.