NeurIPS2020

Active Structure Learning of Causal DAGs via Directed Clique Trees

Chandler Squires, Sara Magliacane, Kristjan H. Greenewald, Dmitriy Katz, Murat Kocaoglu, Karthikeyan Shanmugam

被引用 41 次

摘要

A growing body of work has begun to study intervention design for efficient structure learning of causal directed acyclic graphs (DAGs). A typical setting is a causally sufficient setting, i.e. a system with no latent confounders, selection bias, or feedback, when the essential graph of the observational equivalence class (EC) is given as an input and interventions are assumed to be noiseless. Most existing works focus on worst-case or average-case lower bounds for the number of interventions required to orient a DAG. These worst-case lower bounds only establish that the largest clique in the essential graph could make it difficult to learn the true DAG. In this work, we develop a universal lower bound for singlenode interventions that establishes that the largest clique is always a fundamental impediment to structure learning. Specifically, we present a decomposition of a DAG into independently orientable components through directed clique trees and use it to prove that the number of single-node interventions necessary to orient any DAG in an EC is at least the sum of half the size of the largest cliques in each chain component of the essential graph. Moreover, we present a two-phase intervention design algorithm that, under certain conditions on the chordal skeleton, matches the optimal number of interventions up to a multiplicative logarithmic factor in the number of maximal cliques. We show via synthetic experiments that our algorithm can scale to much larger graphs than most of the related work and achieves better worst-case performance than other scalable approaches. 1 Preliminaries We briefly review our notation and terminology for graphs. A mixed graph G is a tuple of vertices V (G), directed edges D(G), bidirected edges B(G), and undirected edges U (G). Directed, bidirected, and undirected edges between vertices i and j in G are denoted i → G j, i ↔ G j, and i -G j, respectively. We use asterisks as wildcards for edge endpoints, e.g., i * → G j denotes either i → G j or i ↔ G j. A directed cycle in a mixed graph is a sequence of edges i * → G . . . * → G i with at least one directed edge. A mixed graph is a chain graph if it has no directed cycles and B(G) = ∅, and a chain graph is called a directed acyclic graph (DAG) if we also have U (G) = ∅. An undirected graph is a mixed graph with B(G) = ∅ and D(G) = ∅. DAGs and (I-)Markov equivalence. DAGs are used to represent causal models (Pearl, 2009) . Each vertex i is associated with a random variable X i . The skeleton of graph D, skel(D), is the undirected graph with the same vertices and adjacencies as D. A distribution f is Markov w.r.t. a DAG D if it