Do-Calculus in Summary Causal DAGs: Soundness, Completeness, and ATE Computation

Summary

Section 6 proves that Pearl’s do-calculus rules remain sound (Theorem 6.1) and complete (Theorem 6.2) when applied to summary causal DAGs. This means causal effect identification — including ATE computation — can be performed directly on the summary DAG rather than the original. The proof relies on the equivalence between the RB of the summary DAG and its canonical DAG (Theorem 4.1). ATE computation on summary DAGs uses the same backdoor/do-calculus machinery as on original DAGs, with adjustment sets derived from s-separation.

Overview

The fundamental utility of causal DAGs is enabling causal effect identification via do-calculus. If summarization invalidated do-calculus, summary DAGs would be useless for inference. Section 6 shows this does not happen: the do-calculus rules remain sound and complete for summary causal DAGs, making them directly usable for causal inference.

Main Content

Background: Do-Calculus

Pearl’s do-calculus consists of three rules for manipulating interventional distributions . These rules allow deriving interventional distributions from observational data when identification is possible.

ATE via Do-Calculus

The Average Treatment Effect (ATE) of treatment on outcome is:

To estimate ATE from observational data, one must control for confounding variables — variables that affect both and — using the backdoor criterion or do-calculus.

The backdoor criterion requires finding a set of variables that blocks all backdoor paths from to (paths that start with an edge into ). If such exists:

6.1 Do-Calculus Soundness in Summary Causal DAGs

Theorem 6.1 — Soundness of Do-Calculus for Summary Causal DAGs

Let be a summary causal DAG for , and let be the corresponding canonical causal DAG. Let denote the interventional distribution compatible with the summary causal DAG .

Pearl’s do-calculus rules are sound for summary causal DAGs: any causal relationship derived by applying the three do-calculus rules to is valid — i.e., it holds in the interventional distribution compatible with .

Proof sketch: The do-calculus rules are defined in terms of d-separation in the manipulated graph. By s-separation (Theorem 4.2), all CIs derived from hold in every compatible DAG . Therefore, applying do-calculus to produces results that are valid across all compatible DAGs.

6.2 Do-Calculus Completeness in Summary Causal DAGs

Theorem 6.2 — Completeness of Do-Calculus for Summary Causal DAGs

Let be a summary causal DAG for , and let be disjoint sets. Let be the set of nodes that are not ancestors of any node in .

If is d-connected to in given in (the manipulated graph with edges into removed), then there exists a causal DAG compatible with , such that is d-connected to in . More formally:

Significance: Do-calculus is also complete for summary DAGs — any identifiable causal effect in the summary DAG is also identifiable in the canonical causal DAG (and vice versa). This means the summary DAG does not lose identifiability relative to the canonical representation.

Proof sketch: Relies on the equivalence between the RB of and (Theorem 4.1) and the Shpitser-Pearl completeness of do-calculus for DAGs.

ATE Computation on Summary Causal DAGs

Practical causal inference using a summary DAG follows the same steps as on the original DAG:

ATE from Summary DAG (REDSHIFT Example)

Original DAG: 12 nodes, 23 edges (GPT-4 constructed, with 21 correct + 1 inverted + 1 missed edge).

Query: What is the causal effect of Query Template on Elapsed Time?

Using CaGReS summary ():

  1. Compute the summary DAG with CaGReS — 5 cluster nodes, ~9 edges.
  2. Identify the treatment cluster containing Query Template and outcome cluster containing Elapsed Time.
  3. Find the adjustment set using the backdoor criterion on the summary DAG (via s-separation).
  4. Estimate .

Robustness advantage: The summary DAG subsumes the extraneous edges added by GPT-4 (5 random edges). Estimating ATE on the 28-edge erroneous DAG would require adjusting for incorrect confounders. The summary DAG filters this noise: the erroneous edges from the original DAG’s monitoring view (e.g., Plan Time → Lock Wait Time) are subsumed by groupings that prioritize the core causal paths.

The Minimization Principle for ATE

ATE Minimization (Robustness)

To minimize the adjustment set in causal estimations from the summary DAG , the adjustment set should be ordered before all nodes in the treatment cluster in . Alternatively, an upper and lower bound on the adjustment set can be derived by considering all subsets of ‘s cluster in .

Connections

  • The soundness result validates CaGReS as a practical tool for causal inference: the summary DAG it produces is guaranteed to give valid causal conclusions.
  • Directly applies Frequentist Causal Estimation adjustment methods (backdoor criterion, IPW, outcome modeling) — these work identically on summary DAGs by Theorems 6.1–6.2.
  • The completeness result implies that if you can identify a causal effect in the original DAG, you can identify it in the summary DAG — no loss of identifiability from summarization.
  • Connects to Potential Outcomes Framework — the ATE formula is the do-calculus version of the potential outcomes ATE .

See Also