ICLR2025

Rethinking Shapley Value for Negative Interactions in Non-convex Games

Wonjoon Chang, Myeongjin Lee, Jaesik Choi

Abstract

An axiom-based solution in cooperative games. -The Shapley value ๐œ™ ! ๐‘ฃ calculates the average change in the model output ๐‘ฃ(โ‹…) according to the participation of the target feature ๐‘–. Motivation Feature Attribution & Shapley value โ€ข Theoretically, most feature attributions are grounded in the Shapley value. 3 [๐‘ฃ ๐‘† โˆช ๐‘– -๐‘ฃ ๐‘† ] โ€ข ๐‘ฃ : game (or model output) โ€ข ๐‘ : a set of the entire players (or features) โ€ข ๐‘›, ๐‘  : the size of ๐‘, ๐‘†