VLDB2021

How Divergent Is Your Data?

Eliana Pastor, Andrew Gavgavian, Elena Baralis, Luca de Alfaro

被引用 21 次

摘要

We present DivExplorer , a tool that enables users to explore datasets and find subgroups of data for which a classifier behaves in an anomalous manner. These subgroups, denoted as divergent subgroups, may exhibit, for example, higher-than-normal false positive or negative rates. DivExplorer can be used to analyze and debug classifiers. If the data has ethical or social implications, Div-Explorer can be also used to identify bias in classifiers.