Canonical Causal DAGs: Node Contraction as Edge Addition

Summary

The canonical causal DAG $G_{H}$ of a summary DAG $H$ is obtained by decomposing the summary DAG’s cluster nodes back into distinct nodes connected by edges. Theorem 4.1 shows that the RB of $H$ is equivalent to that of $G_{H}$ . This means node contraction is precisely equivalent to adding edges to the input DAG — the number of added edges quantifies information loss. This connection enables CaGReS to optimize summarization by minimizing edge additions.

Overview

Section 4 establishes the key theoretical connection between summary DAGs and edge additions. This connection is what makes the CaGReS algorithm tractable: instead of searching over all possible node partitions, CaGReS can evaluate the cost of each contraction by counting the edges that would be added to the canonical DAG.

Main Content

4.1 The Canonical Causal DAG

CI Sets Equivalence

Let $S$ and $T$ denote two sets of CIs over variables ${X_{1}, \dots, X_{n}}$ . We say $S \equiv T$ if every CI $σ \in S$ can be derived from $T$ using the semi-graphoid axioms, and vice versa. We say $T$ is equivalent to $S$ , written $S \Leftrightarrow T$ .

Canonical Causal DAG (Definition 5)

Let $(H, f)$ be a summary DAG for a causal DAG $G$ . Let $H$ denote a complete topological order over $V (H)$ .

The canonical causal DAG $G_{H}$ associated with $(H, f)$ is defined as:

$V (G_{H}) = V (G)$ (same nodes as original)

$(X_{i}, X_{j}) \in E (G_{H})$ if and only if:

$(X_{i}, X_{j}) \in E (G)$ , or

$(f (X_{i}), f (X_{j})) \in E (H)$ , or

$f (X_{i}) = f (X_{j})$ and $i < j$

In other words: $G_{H}$ contains all original edges, plus edges induced by the summary structure, plus a total order within each cluster (the second and third conditions).

Intuition: The canonical DAG $G_{H}$ is a supergraph of the original $G$ . All edges of $G$ are preserved; additional edges are added between nodes that now belong to the same cluster or whose clusters are connected in $H$ . The canonical DAG is always compatible with $H$ .

Example: Canonical DAG Construction (Fig. 5)

Consider a causal DAG $G_{1}$ (Fig. 3a, from the paper’s Fig. 3) and a 3-node summary $H_{1}$ . After contracting nodes $B$ and $C$ into cluster $H_{1} = BC$ :

The canonical causal DAG $G_{H_{1}}$ keeps all original edges and adds:

$B \to C$ (within-cluster order, since $B < C$ topologically)

Any edges needed to make the cluster’s neighborhood consistent

Note: Fig. 5c shows the canonical DAG has more edges than Fig. 5a (original) — the difference $∣ E (G_{H}) ∣ - ∣ E (G) ∣$ is the information loss measure.

4.2 Key Theorem: RB Equivalence

Theorem 4.1 — Node Contraction ≡ Edge Addition

Let $(H, f)$ be a summary DAG for causal DAG $G$ , and let $G_{H}$ be its corresponding canonical causal DAG.

The Recursive Basis of $H$ equals the Recursive Basis of $G_{H}$ :
$RB_{XRB} (H) = RB_{G_{H}} (G)$
Equivalently, the set of CIs encoded by $H$ is equivalent to the set of CIs encoded by $G_{H}$ .

Corollary: Optimizing over summary DAGs $H$ is equivalent to optimizing over edge additions to $G$ . The number of added edges $∣ E (G_{H}) ∣ - ∣ E (G) ∣$ is a valid proxy for information loss in causal inference.

Proof intuition: The RB of $H$ (a DAG over clusters) says each cluster-node is independent of its non-descendant clusters given its parent clusters. When expanded back to individual nodes via the canonical DAG, this is precisely captured by the within-cluster and between-cluster edges added to form $G_{H}$ .

Significance: This theorem is the theoretical foundation of CaGReS. It transforms the summarization problem from searching over exponentially many node partitions to the more tractable problem of greedily minimizing edge additions — since each contraction’s cost can be computed directly on the canonical DAG.

Why Adding Edges ≠ Destroying Causal Information

An important asymmetry in causal DAGs:

Adding edges to a causal DAG indicates potential causal dependence — never asserts false independence. The identified causal effects may be more conservative (larger adjustment sets) but remain valid.
Removing edges from a causal DAG incorrectly implies conditional independence — can lead to biased estimates if a true confounder is excluded.

Pearl (2009) already observed: “The addition of arcs to a causal diagram can never assist, the identification of causal effects in nonparametric models.” Adding edges leads to a more conservative but never incorrect causal model.

Therefore, the canonical DAG $G_{H}$ (a supergraph of $G$ ) is a valid causal model — it never introduces incorrect CI assumptions, only potential new dependencies.

Connections

Directly extends Summary Causal DAGs — the canonical DAG operationalizes the abstract definition of summary DAG compatibility.
The RB equivalence connects to Directed Acyclic Graphs — the RB is a standard concept in graphical models, and this theorem extends it to the summarization setting.
The edge-addition cost measure feeds into CaGReS Algorithm — the GetCost procedure computes exactly $∣ E (G_{H}) ∣ - ∣ E (G) ∣$ for a proposed contraction.

Second Brain

Explorer

Canonical Causal DAGs: Node Contraction as Edge Addition

Canonical Causal DAGs: Node Contraction as Edge Addition

Overview

Main Content

4.1 The Canonical Causal DAG

4.2 Key Theorem: RB Equivalence

Why Adding Edges ≠ Destroying Causal Information

Connections

See Also

Graph View

Table of Contents

Backlinks