Canonical Causal DAGs: Node Contraction as Edge Addition
Summary
The canonical causal DAG of a summary DAG is obtained by decomposing the summary DAG’s cluster nodes back into distinct nodes connected by edges. Theorem 4.1 shows that the RB of is equivalent to that of . This means node contraction is precisely equivalent to adding edges to the input DAG — the number of added edges quantifies information loss. This connection enables CaGReS to optimize summarization by minimizing edge additions.
Overview
Section 4 establishes the key theoretical connection between summary DAGs and edge additions. This connection is what makes the CaGReS algorithm tractable: instead of searching over all possible node partitions, CaGReS can evaluate the cost of each contraction by counting the edges that would be added to the canonical DAG.
Main Content
4.1 The Canonical Causal DAG
CI Sets Equivalence
Let and denote two sets of CIs over variables . We say if every CI can be derived from using the semi-graphoid axioms, and vice versa. We say is equivalent to , written .
Canonical Causal DAG (Definition 5)
Let be a summary DAG for a causal DAG . Let denote a complete topological order over .
The canonical causal DAG associated with is defined as:
- (same nodes as original)
- if and only if:
- , or
- , or
- and
In other words: contains all original edges, plus edges induced by the summary structure, plus a total order within each cluster (the second and third conditions).
Intuition: The canonical DAG is a supergraph of the original . All edges of are preserved; additional edges are added between nodes that now belong to the same cluster or whose clusters are connected in . The canonical DAG is always compatible with .
Example: Canonical DAG Construction (Fig. 5)
Consider a causal DAG (Fig. 3a, from the paper’s Fig. 3) and a 3-node summary . After contracting nodes and into cluster :
The canonical causal DAG keeps all original edges and adds:
- (within-cluster order, since topologically)
- Any edges needed to make the cluster’s neighborhood consistent
Note: Fig. 5c shows the canonical DAG has more edges than Fig. 5a (original) — the difference is the information loss measure.
4.2 Key Theorem: RB Equivalence
Theorem 4.1 — Node Contraction ≡ Edge Addition
Let be a summary DAG for causal DAG , and let be its corresponding canonical causal DAG.
The Recursive Basis of equals the Recursive Basis of :
Equivalently, the set of CIs encoded by is equivalent to the set of CIs encoded by .
Corollary: Optimizing over summary DAGs is equivalent to optimizing over edge additions to . The number of added edges is a valid proxy for information loss in causal inference.
Proof intuition: The RB of (a DAG over clusters) says each cluster-node is independent of its non-descendant clusters given its parent clusters. When expanded back to individual nodes via the canonical DAG, this is precisely captured by the within-cluster and between-cluster edges added to form .
Significance: This theorem is the theoretical foundation of CaGReS. It transforms the summarization problem from searching over exponentially many node partitions to the more tractable problem of greedily minimizing edge additions — since each contraction’s cost can be computed directly on the canonical DAG.
Why Adding Edges ≠ Destroying Causal Information
An important asymmetry in causal DAGs:
- Adding edges to a causal DAG indicates potential causal dependence — never asserts false independence. The identified causal effects may be more conservative (larger adjustment sets) but remain valid.
- Removing edges from a causal DAG incorrectly implies conditional independence — can lead to biased estimates if a true confounder is excluded.
Pearl (2009) already observed: “The addition of arcs to a causal diagram can never assist, the identification of causal effects in nonparametric models.” Adding edges leads to a more conservative but never incorrect causal model.
Therefore, the canonical DAG (a supergraph of ) is a valid causal model — it never introduces incorrect CI assumptions, only potential new dependencies.
Connections
- Directly extends Summary Causal DAGs — the canonical DAG operationalizes the abstract definition of summary DAG compatibility.
- The RB equivalence connects to Directed Acyclic Graphs — the RB is a standard concept in graphical models, and this theorem extends it to the summarization setting.
- The edge-addition cost measure feeds into CaGReS Algorithm — the GetCost procedure computes exactly for a proposed contraction.
See Also
- Summary Causal DAGs — the summary DAG definition and constraints
- CaGReS Algorithm — exploits Theorem 4.1 for efficient greedy search
- s-Separation in Summary DAGs — builds on the canonical DAG for CI identification
- Do-Calculus in Summary Causal DAGs — interventional identification in summary DAGs; relies on canonical DAG validity
- Directed Acyclic Graphs — foundational d-separation and back-door criterion that canonical DAGs must preserve