s-Separation in Summary DAGs: Sound and Complete CI Identification
Summary
s-Separation extends d-separation to summary causal DAGs. A CI statement holds in a summary DAG if and are d-separated in every causal DAG compatible with . This is equivalently characterized by d-separation in the canonical causal DAG . Theorem 4.2 establishes that s-separation is sound and complete: the s-separation algorithm correctly identifies exactly those CIs that are valid across all compatible DAGs.
Overview
In a standard causal DAG, d-separation identifies all conditional independence relationships encoded in the graph. For a summary DAG — which represents a set of compatible causal DAGs — we need a stricter notion: a CI should only be claimed if it holds in all compatible DAGs (conservative inference). s-Separation provides this.
Main Content
CI Validity in Summary DAGs
Valid CI in a Summary Causal DAG (Definition 6)
A CI statement is valid in a summary causal DAG if and only if it holds in every causal DAG (every DAG compatible with ).
Equivalently: and are d-separated given in every compatible .
This is stricter than d-separation in alone, because might encode a CI that not all compatible DAGs share.
s-Separation
s-Separation (Definition 7)
Given a summary causal DAG and disjoint sets , nodes and are s-separated given in , denoted:
if and only if and are d-separated given in the canonical causal DAG .
That is: expand the summary DAG nodes back to their original variable sets, then apply standard d-separation in the canonical DAG.
Key property: Because is a supergraph of any compatible (it has the most edges), d-separation in is the strictest criterion — it identifies only CIs present in all compatible DAGs, not just some.
Algorithm for s-Separation
s-Separation Algorithm
Given summary DAG :
- Construct the canonical causal DAG (using Definition 5 in Canonical Causal DAGs).
- For query in the summary: expand to in .
- Apply any standard d-separation algorithm on .
Result: the CI holds in the summary DAG iff it holds in this expanded query.
Why naive d-separation on fails: Consider a 5-node summary. Two nodes and may appear d-separated in , but in the canonical DAG the within-cluster edges create a path between and — so the CI does not actually hold in all compatible DAGs.
s-Separation Example (from §4.2.1)
Referring to Fig. 3, consider summary (Fig. 4a, contracting and into cluster ).
Query: in — does this hold?
In the canonical DAG : the cluster is expanded; check d-separation of and given in .
Result: and hold in (established in the paper), but with the canonical expansion shows this holds only if the within-cluster edge doesn’t create a path — which must be checked explicitly.
Soundness and Completeness
Theorem 4.2 — Soundness and Completeness of s-Separation
Let be a summary causal DAG for , and let be disjoint sets.
and are s-separated given in if and only if and are d-separated given in every causal DAG compatible with .
Formally:
Soundness: If s-separation says , then this CI holds in all compatible DAGs (no false claims of independence).
Completeness: If holds in all compatible DAGs, s-separation will identify it (no missed valid CIs).
Proof: Uses the equivalence of RBs (Theorem 4.1): the canonical DAG is compatible with and is a supergraph of any other compatible DAG. Therefore d-separation in is equivalent to d-separation in all compatible DAGs.
Practical Implication
s-Separation provides a conservative but correct inference tool:
- It may identify fewer CIs than the true original DAG (because it uses the strictest compatible DAG).
- But every CI it identifies is guaranteed to hold — no spurious independence assumptions.
- This is the correct tradeoff for summarization: we lose some precision but never introduce incorrect assumptions into a causal analysis.
Connections
- Extends Directed Acyclic Graphs’s d-separation to the summary context.
- The soundness guarantee is critical for Do-Calculus in Summary Causal DAGs — do-calculus requires valid CI statements as its core primitive.
- The conservative nature parallels the approach in Frequentist Causal Estimation — using larger adjustment sets (more confounders) is conservative but unbiased.
See Also
- Canonical Causal DAGs — provides the mathematical basis for s-separation
- Summary Causal DAGs — the object s-separation operates on
- Do-Calculus in Summary Causal DAGs — uses s-separation for causal effect identification
- Directed Acyclic Graphs — foundational d-separation from which s-separation is built