s-Separation in Summary DAGs: Sound and Complete CI Identification

Summary

s-Separation extends d-separation to summary causal DAGs. A CI statement holds in a summary DAG if and are d-separated in every causal DAG compatible with . This is equivalently characterized by d-separation in the canonical causal DAG . Theorem 4.2 establishes that s-separation is sound and complete: the s-separation algorithm correctly identifies exactly those CIs that are valid across all compatible DAGs.

Overview

In a standard causal DAG, d-separation identifies all conditional independence relationships encoded in the graph. For a summary DAG — which represents a set of compatible causal DAGs — we need a stricter notion: a CI should only be claimed if it holds in all compatible DAGs (conservative inference). s-Separation provides this.

Main Content

CI Validity in Summary DAGs

Valid CI in a Summary Causal DAG (Definition 6)

A CI statement is valid in a summary causal DAG if and only if it holds in every causal DAG (every DAG compatible with ).

Equivalently: and are d-separated given in every compatible .

This is stricter than d-separation in alone, because might encode a CI that not all compatible DAGs share.

s-Separation

s-Separation (Definition 7)

Given a summary causal DAG and disjoint sets , nodes and are s-separated given in , denoted:

if and only if and are d-separated given in the canonical causal DAG .

That is: expand the summary DAG nodes back to their original variable sets, then apply standard d-separation in the canonical DAG.

Key property: Because is a supergraph of any compatible (it has the most edges), d-separation in is the strictest criterion — it identifies only CIs present in all compatible DAGs, not just some.

Algorithm for s-Separation

s-Separation Algorithm

Given summary DAG :

  1. Construct the canonical causal DAG (using Definition 5 in Canonical Causal DAGs).
  2. For query in the summary: expand to in .
  3. Apply any standard d-separation algorithm on .

Result: the CI holds in the summary DAG iff it holds in this expanded query.

Why naive d-separation on fails: Consider a 5-node summary. Two nodes and may appear d-separated in , but in the canonical DAG the within-cluster edges create a path between and — so the CI does not actually hold in all compatible DAGs.

s-Separation Example (from §4.2.1)

Referring to Fig. 3, consider summary (Fig. 4a, contracting and into cluster ).

Query: in — does this hold?

In the canonical DAG : the cluster is expanded; check d-separation of and given in .

Result: and hold in (established in the paper), but with the canonical expansion shows this holds only if the within-cluster edge doesn’t create a path — which must be checked explicitly.

Soundness and Completeness

Theorem 4.2 — Soundness and Completeness of s-Separation

Let be a summary causal DAG for , and let be disjoint sets.

and are s-separated given in if and only if and are d-separated given in every causal DAG compatible with .

Formally:

Soundness: If s-separation says , then this CI holds in all compatible DAGs (no false claims of independence).

Completeness: If holds in all compatible DAGs, s-separation will identify it (no missed valid CIs).

Proof: Uses the equivalence of RBs (Theorem 4.1): the canonical DAG is compatible with and is a supergraph of any other compatible DAG. Therefore d-separation in is equivalent to d-separation in all compatible DAGs.

Practical Implication

s-Separation provides a conservative but correct inference tool:

  • It may identify fewer CIs than the true original DAG (because it uses the strictest compatible DAG).
  • But every CI it identifies is guaranteed to hold — no spurious independence assumptions.
  • This is the correct tradeoff for summarization: we lose some precision but never introduce incorrect assumptions into a causal analysis.

Connections

See Also