s-Separation in Summary DAGs: Sound and Complete CI Identification

Summary

s-Separation extends d-separation to summary causal DAGs. A CI statement $X ⊥ ⊥ Y ∣ Z$ holds in a summary DAG $(H, f)$ if $X$ and $Y$ are d-separated in every causal DAG compatible with $H$ . This is equivalently characterized by d-separation in the canonical causal DAG $G_{H}$ . Theorem 4.2 establishes that s-separation is sound and complete: the s-separation algorithm correctly identifies exactly those CIs that are valid across all compatible DAGs.

Overview

In a standard causal DAG, d-separation identifies all conditional independence relationships encoded in the graph. For a summary DAG — which represents a set of compatible causal DAGs — we need a stricter notion: a CI should only be claimed if it holds in all compatible DAGs (conservative inference). s-Separation provides this.

Main Content

CI Validity in Summary DAGs

Valid CI in a Summary Causal DAG (Definition 6)

A CI statement $(X ⊥ ⊥ Y ∣ Z)$ is valid in a summary causal DAG $(H, f)$ if and only if it holds in every causal DAG $G \in {G_{i}}_{H}$ (every DAG compatible with $H$ ).

Equivalently: $X$ and $Y$ are d-separated given $Z$ in every compatible $G$ .

This is stricter than d-separation in $H$ alone, because $H$ might encode a CI that not all compatible DAGs share.

s-Separation

s-Separation (Definition 7)

Given a summary causal DAG $(H, f)$ and disjoint sets $X, Y, Z \subseteq V (H)$ , nodes $X$ and $Y$ are s-separated given $Z$ in $(H, f)$ , denoted:
$X ⊥ ⊥_{s} Y ∣ Z_{(H, f)}$
if and only if $f^{- 1} (X)$ and $f^{- 1} (Y)$ are d-separated given $f^{- 1} (Z)$ in the canonical causal DAG $G_{H}$ .

That is: expand the summary DAG nodes back to their original variable sets, then apply standard d-separation in the canonical DAG.

Key property: Because $G_{H}$ is a supergraph of any compatible $G$ (it has the most edges), d-separation in $G_{H}$ is the strictest criterion — it identifies only CIs present in all compatible DAGs, not just some.

Algorithm for s-Separation

s-Separation Algorithm

Given summary DAG $(H, f)$ :

Construct the canonical causal DAG $G_{H}$ (using Definition 5 in Canonical Causal DAGs).

For query $(X ⊥ ⊥ Y ∣ Z)$ in the summary: expand to $(f^{- 1} (X) ⊥ ⊥ f^{- 1} (Y) ∣ f^{- 1} (Z))$ in $G_{H}$ .

Apply any standard d-separation algorithm on $G_{H}$ .

Result: the CI holds in the summary DAG iff it holds in this expanded query.

Why naive d-separation on $H$ fails: Consider a 5-node summary. Two nodes $A$ and $C$ may appear d-separated in $H$ , but in the canonical DAG the within-cluster edges create a path between $f^{- 1} (A)$ and $f^{- 1} (C)$ — so the CI does not actually hold in all compatible DAGs.

s-Separation Example (from §4.2.1)

Referring to Fig. 3, consider summary $H_{1}$ (Fig. 4a, contracting $B$ and $C$ into cluster $BC$ ).

Query: $(B ⊥ ⊥_{d} E ∣ D)$ in $H_{1}$ — does this hold?

In the canonical DAG $G_{H_{1}}$ : the cluster $BC$ is expanded; check d-separation of $f^{- 1} (B) = {B, C}$ and $f^{- 1} (E) = {E}$ given $f^{- 1} (D) = {D}$ in $G_{H_{1}}$ .

Result: $(B ⊥ ⊥ E ∣ D)$ and $(C ⊥ ⊥ E ∣ D)$ hold in $H_{1}$ (established in the paper), but $(BC ⊥ ⊥ E ∣ D)$ with the canonical expansion shows this holds only if the within-cluster $B \to C$ edge doesn’t create a path — which must be checked explicitly.

Soundness and Completeness

Theorem 4.2 — Soundness and Completeness of s-Separation

Let $(H, f)$ be a summary causal DAG for $G$ , and let $X, Y, Z \subseteq V (H)$ be disjoint sets.

$X$ and $Y$ are s-separated given $Z$ in $(H, f)$ if and only if $X$ and $Y$ are d-separated given $Z$ in every causal DAG $G \in {G_{i}}_{H}$ compatible with $H$ .

Formally:
$(X ⊥ ⊥_{s} Y ∣ Z)_{(H, f)} ⟺ \forall G \in {G_{i}}_{H} : (f^{- 1} (X) ⊥ ⊥_{d} f^{- 1} (Y) ∣ f^{- 1} (Z))_{G}$
Soundness: If s-separation says $X ⊥ ⊥ Y ∣ Z$ , then this CI holds in all compatible DAGs (no false claims of independence).

Completeness: If $X ⊥ ⊥ Y ∣ Z$ holds in all compatible DAGs, s-separation will identify it (no missed valid CIs).

Proof: Uses the equivalence of RBs (Theorem 4.1): the canonical DAG $G_{H}$ is compatible with $H$ and is a supergraph of any other compatible DAG. Therefore d-separation in $G_{H}$ is equivalent to d-separation in all compatible DAGs.

Practical Implication

s-Separation provides a conservative but correct inference tool:

It may identify fewer CIs than the true original DAG (because it uses the strictest compatible DAG).
But every CI it identifies is guaranteed to hold — no spurious independence assumptions.
This is the correct tradeoff for summarization: we lose some precision but never introduce incorrect assumptions into a causal analysis.

Connections

Extends Directed Acyclic Graphs’s d-separation to the summary context.
The soundness guarantee is critical for Do-Calculus in Summary Causal DAGs — do-calculus requires valid CI statements as its core primitive.
The conservative nature parallels the approach in Frequentist Causal Estimation — using larger adjustment sets (more confounders) is conservative but unbiased.

Second Brain

Explorer

s-Separation in Summary DAGs: Sound and Complete CI Identification

s-Separation in Summary DAGs: Sound and Complete CI Identification

Overview

Main Content

CI Validity in Summary DAGs

s-Separation

Algorithm for s-Separation

Soundness and Completeness

Practical Implication

Connections

See Also

Graph View

Table of Contents

Backlinks