Class 7

The Intervention Calculus

From probabilistic insight to defensible action - learning the algebra of workplace change

The psychosocial risk committee at Meridian Regional Hospital has spent three months building and calibrating their Bayesian network. They now know, with quantified uncertainty, that the probability of clinically significant psychological injury among their emergency department staff sits at 34%. They can see the web of causal pathways - chronic understaffing feeding into workload, workload eroding job control, low supervisor support amplifying the damage. The network is beautiful, rigorous, and entirely useless - until someone at the table asks the question that matters: "So what do we actually do?"

The chief nursing officer suggests increasing supervisor support, pointing out its strong statistical association with lower distress. The HR director counters that workload reduction would be more effective. The finance manager wants to know which option gives them the most risk reduction per dollar. And a junior analyst, who has been reading ahead, asks a question that silences the room: "Are we sure that making supervisor support high will have the same effect as observing that it's high? Because I don't think it will." That question - the difference between seeing and doing - is where this chapter begins.

The Gap Between Knowing and Doing

Everything we have built across the first six chapters of this course - the directed acyclic graphs, the conditional probability tables, the diagnostic reasoning, the calibrated assessments - has operated on what Judea Pearl calls the first rung of the Ladder of Causation: the level of association, of observing and updating beliefs (Pearl & Mackenzie, 2018). When we entered evidence that a nurse reported high workload and watched beliefs about psychological distress update via Bayes' theorem, we were asking an associational question: given what I see, what should I believe?

But the committee at Meridian Hospital is not asking what to believe. They are asking what to do. And this seemingly small shift - from passive observation to active intervention - requires us to climb to the second rung of the Ladder: the rung of intervention. Pearl's fundamental insight, developed formally in Causality (Pearl, 2009a), is that no amount of observational data alone can answer interventional questions. The mathematics of "seeing" and the mathematics of "doing" are fundamentally different operations, and confusing them is one of the most consequential errors in applied risk management.

This chapter introduces what we will call the intervention calculus - not Pearl's full formal do-calculus (which requires notation most practitioners will never need), but the intuitive reasoning framework that allows psychosocial risk professionals to use Bayesian networks as intervention planning tools. By the end of this chapter, you will be able to take a calibrated BN, simulate the effects of proposed interventions, rank those interventions by cost-effectiveness, and determine whether you should act now or gather more information first.

Observation versus Intervention: The See/Do Distinction

Let us return to the Meridian Hospital network. One of the nodes is Supervisor Support, which has two parent nodes: Management Commitment and Supervisor Workload. When management is committed to staff wellbeing and supervisors themselves are not overwhelmed, supervisor support tends to be high. Supervisor Support, in turn, causally influences Psychological Distress and Team Cohesion downstream.

Now consider two scenarios. In the first, we observe that Supervisor Support is high - perhaps through a staff survey. In the second, we intervene to make Supervisor Support high - perhaps by implementing a structured supervisor training program with protected time for staff check-ins. In both cases, the value of the Supervisor Support node is set to "High." So why would the downstream effects differ?

Why observation and intervention diverge

When we observe that Supervisor Support is high, Bayes' theorem updates our beliefs in all directions - not just downstream but also upstream. Observing high support is evidence about the parents of that node: it becomes more likely that Management Commitment is strong (because committed management tends to produce supportive supervisors) and less likely that Supervisor Workload is crushing (because overwhelmed supervisors struggle to provide support). These updated beliefs about parent nodes then propagate through their own downstream pathways, creating a cascade of revised probabilities throughout the network.

When we intervene to set Supervisor Support to high, something categorically different happens. Our intervention is an external force acting on the system. We are not learning that the natural dynamics of the workplace produced high support; we are forcing it. The intervention tells us nothing about Management Commitment or Supervisor Workload - those variables could be in any state. As Pearl (2009a) formalised, intervention on a variable severs the causal arrows coming into that variable. The node's value is no longer determined by its parents; it is determined by us.

Pearl (2009b) emphasised this point as foundational: "causal relations cannot be expressed in the language of probability alone - any mathematical approach to causal analysis must acquire new notation" (p. 99). That new notation is the do-operator. The probability of psychological distress given that we observe high supervisor support is written P(Distress | Support = High). The probability of distress given that we intervene to make support high is written P(Distress | do(Support = High)). These are different quantities, computed by different procedures, and they can yield very different numbers.

Think About It

Imagine a workplace where Supervisor Support is high only when Management Commitment is high, and Management Commitment independently reduces Psychological Distress through other pathways (e.g., resource allocation, policy quality). If you observe high Supervisor Support, you also gain evidence that Management Commitment is high, which reduces Distress via those other pathways. But if you intervene to raise Supervisor Support without changing Management Commitment, you only get the direct effect. Before reading on, predict: will the observation or the intervention show a larger reduction in Distress? Why?

Graph surgery: the mechanics of simulating interventions

The computational procedure for simulating an intervention in a Bayesian network is elegantly simple - so simple that it has earned the vivid name graph surgery (Pearl, 2009a). To compute the effect of intervening on variable X:

Delete all incoming arrows to node X. This "mutilates" the graph, severing X from its parents. The node is no longer a function of its natural causes; it is now under external control.
Set X to the desired value. Fix the node's probability table so that it takes the intervention value with certainty (e.g., Supervisor Support = High with probability 1.0).
Propagate beliefs through the modified network using standard Bayesian updating. Because the incoming arrows to X have been removed, information does not flow backward through X to its former parents.

The result is the post-interventional distribution - the probability distribution over all variables in the network that would obtain if we performed the intervention. This is what Pearl calls the "truncated factorization" of the joint distribution: we take the original factorization of the joint probability (the product of all conditional probability tables) and simply delete the factor for X, replacing it with a point mass at the intervention value (Pearl, 2009a).

Gonzalez-Jimenez et al. (2022) demonstrated this approach in a risk-assessment context, constructing a BN with 41 causal factors and computing intervention effects using graph surgery. Their key finding echoes our pedagogical point: "associative reasoning answers 'how does new evidence change beliefs about Y?' while intervention reasoning answers 'how does doing X change Y?'" (p. 4). The difference, they showed, was not merely academic - it produced materially different risk estimates and therefore different intervention recommendations.

The widget below lets you experience this distinction directly. Use the Meridian Hospital network to compare what happens when you observe versus intervene on Supervisor Support.

See vs. Do Comparator

Compare the effects of observing that Supervisor Support is High versus intervening to set it High. The hospital BN has confounding parents (Management Commitment → Supervisor Support ← Supervisor Workload), and Management Commitment also directly affects other downstream nodes.

From Single Interventions to Ranked Priorities

The see/do distinction is critical, but it is only the first step. A psychosocial risk committee rarely faces a binary choice. The Meridian Hospital team has at least six modifiable hazard nodes in their network: Workload, Supervisor Support, Role Clarity, Job Control, Social Support, and Management Commitment. Each could be the target of an intervention. The committee needs to know: which intervention produces the greatest reduction in our target outcome?

The answer requires systematically applying graph surgery to each modifiable node, computing the post-interventional probability of the target outcome (say, Psychological Injury), and comparing the results. This procedure - what we might call an intervention sweep - yields a ranked list of intervention targets ordered by their causal impact on the outcome of interest.

The intervention sweep procedure

For each modifiable node X_i in the network:

Perform graph surgery: remove incoming arrows to X_i.
Set X_i to its best feasible state (e.g., Workload = Manageable).
Propagate and record the post-interventional probability of the target outcome.
Compute the absolute risk reduction (ARR): baseline probability minus post-interventional probability.
Restore the original network and repeat for the next node.

This is exactly the approach demonstrated by Mohammadfam et al. (2017), who constructed a workplace safety BN and systematically computed the impact of improving each modifiable factor on overall safety behaviour. Their results showed that the nodes with the strongest statistical associations were not always the best intervention targets - a finding that only becomes visible when you use intervention reasoning rather than associational reasoning. Similarly, García-Herrero et al. (2013) used a BN modelling occupational stress to quantify how social support interventions reduce stress caused by high demands, demonstrating the practical value of intervention simulation in psychosocial risk contexts.

Introducing cost: from effectiveness to cost-effectiveness

A ranked list of interventions by risk reduction is valuable, but it is incomplete. In every real organisation, interventions compete for finite resources. A program that reduces psychological injury risk by 12 percentage points but costs $500,000 may be less attractive than one that reduces risk by 8 percentage points at $50,000. This is where cost-effectiveness analysis enters the intervention calculus.

Gavious et al. (2018), in their systematic review of the economic evaluation of occupational safety and health interventions, found that economic evidence for evaluating psychosocial interventions "remains thin" but that "economic incentives are a key decision factor for employers" (p. 226). The gap they identified - between the recognised need for cost-effectiveness data and the scarcity of such data - is precisely the gap that BN-based intervention simulation can help fill.

The cost-effectiveness ratio for an intervention is computed as:

Cost-Effectiveness Ratio = Intervention Cost ÷ Absolute Risk Reduction

Lower ratios indicate better value: less money per unit of risk reduction. When we plot each intervention as a point in cost–effectiveness space (cost on the x-axis, risk reduction on the y-axis), a Pareto frontier emerges - the set of interventions for which no other single intervention is both cheaper and more effective. Interventions not on the frontier are said to be dominated: there exists at least one alternative that is superior on both dimensions. Rational resource allocation should focus on the frontier.

Think About It

If Intervention A costs $100,000 and reduces injury risk by 10 percentage points, while Intervention B costs $40,000 and reduces risk by 5 percentage points, which has the better cost-effectiveness ratio? Now imagine you have a budget of $140,000. What combination would you choose? Consider why a portfolio approach might outperform selecting a single "best" intervention.

Building an intervention portfolio

In practice, organisations rarely implement a single intervention. They assemble portfolios - combinations of interventions targeting different nodes in the causal network. The power of the BN framework is that it allows us to simulate combined interventions: perform graph surgery on multiple nodes simultaneously and propagate the joint effect. The cumulative risk reduction from a portfolio is typically not the simple sum of individual reductions (because interventions may share downstream pathways), but the BN computes the correct combined effect automatically.

This portfolio approach aligns directly with the hierarchy of controls mandated by the Safe Work Australia Code of Practice and the systematic approach recommended by ISO 45003. Both frameworks emphasise that psychosocial risk management requires addressing hazards at multiple levels - not placing all organisational eggs in one interventional basket.

The widget below lets you conduct a full intervention sweep, incorporate costs, identify the Pareto frontier, and build an intervention portfolio for Meridian Hospital.

Intervention Priority Ranker

Select a target outcome, view ranked risk reductions from graph-surgery interventions on each modifiable node, enter cost data to compute cost-effectiveness ratios, and build your intervention portfolio.

Target outcome to minimise:

Your Recommendation Portfolio

When the Best Action Is to Investigate: The Expected Value of Information

There is an intervention the committee has not yet considered - one that does not appear in any hazard node or budget line. It is the intervention of gathering more information. Sometimes the most valuable thing an organisation can do is not act, but learn. The question is: how do we quantify the value of learning?

This is the domain of expected value of information analysis, formalised in the foundational work of Raiffa and Schlaifer (1961). The core idea is breathtakingly practical: before spending money on an intervention, ask whether spending a smaller amount on investigation would change which intervention you select - and if so, how much better your decision would be.

Expected Value of Perfect Information (EVPI)

The Expected Value of Perfect Information (EVPI) for a given node answers the question: if we could learn the true state of this variable with absolute certainty, how much would that improve our expected outcome? More precisely, EVPI is the difference between the expected value of the best decision made after learning the node's true state (averaged over all possible states) and the expected value of the best decision made with current uncertainty.

Consider Role Clarity in the Meridian Hospital network. Currently, based on survey data from six months ago, the committee estimates Role Clarity is Low with probability 0.6 and High with probability 0.4. Their current best intervention plan (the portfolio from the Priority Ranker) yields an expected psychological injury rate of 18%. But:

If Role Clarity were known to be Low (probability 1.0), the optimal portfolio might shift - perhaps adding a role-clarification workshop - and yield an expected injury rate of 15%.
If Role Clarity were known to be High (probability 1.0), the optimal portfolio might drop the workshop (saving money) and still yield 17%.

The EVPI is the expected improvement: (0.6 × improvement if Low) + (0.4 × improvement if High). If this number is large, it means the committee's decision is sensitive to Role Clarity, and investing in a quick assessment of role clarity before acting could substantially improve outcomes. If EVPI is near zero, the current uncertainty about Role Clarity does not matter much - the same portfolio is optimal regardless.

Expected Value of Sample Information (EVSI)

EVPI represents an upper bound - the value of perfect knowledge, which is rarely attainable. In practice, we collect imperfect information: a survey with measurement error, a focus group with selection bias, an audit with limited scope. The Expected Value of Sample Information (EVSI) quantifies the value of such imperfect evidence. As Raiffa and Schlaifer (1961) formalised, EVSI is always less than or equal to EVPI, and the difference depends on the diagnostic accuracy of the data collection method. If a survey reliably distinguishes Low from High role clarity (high sensitivity and specificity), EVSI approaches EVPI. If the survey is noisy, EVSI may be much lower.

The practical decision rule is simple: if EVSI exceeds the cost of the investigation, investigate before acting. If EVSI is less than the investigation cost, act now with current knowledge. This transforms the seemingly subjective question "should we collect more data?" into a quantified comparison.

Think About It

You are managing a psychosocial risk BN with considerable uncertainty about two nodes: Workload and Social Support. Your intuition says you should investigate Workload first because it seems like the "bigger" risk factor. But what if your intervention portfolio is already optimised for high-workload scenarios and would not change regardless of what you learned about Workload? In that case, EVPI for Workload is near zero. Meanwhile, learning about Social Support might flip your choice between two competing interventions. Which node has higher EVPI - the "bigger" risk factor or the one that would actually change your decision?

Connecting investigation to action

The expected value of information framework creates a natural bridge between the diagnostic reasoning of Chapter 5 and the implementation planning of Chapter 8. Chapter 5 taught us how to update beliefs when evidence arrives; this chapter teaches us which evidence to seek based on its decision-relevant value. And the answer is not always the evidence that reduces the most uncertainty in a general sense (Shannon entropy, for instance), but the evidence that most changes what we would do.

This is a subtle but crucial distinction. A risk professional might want to investigate the node with the widest uncertainty bands, reasoning that "we know the least about this." But EVPI analysis often reveals that the most decision-relevant investigation targets a node with moderate uncertainty whose true state would tip the balance between two close-ranked interventions. The value of information is decision-relative, not absolute.

The widget below lets you explore this directly. Based on your portfolio from the Priority Ranker, determine which unobserved node has the highest EVPI - and compare that to your intuitive guess.

Information Value Calculator

Before acting on your intervention portfolio, consider: would gathering more data about any specific hazard improve your decision? Select the node you intuitively think is most valuable to investigate, then see how your intuition compares to the computed EVPI.

Your intuitive pick - which node would you investigate first?

Putting It All Together: The Intervention Calculus Workflow

We can now articulate a complete workflow for converting a calibrated Bayesian network into a defensible intervention plan. This is the intervention calculus in practice:

Establish the baseline. Using the calibrated BN (with all current evidence entered), record the probability of the target outcome. For Meridian Hospital, this is P(Psychological Injury = Yes) = 0.34.
Identify modifiable nodes. Distinguish between nodes that represent hazards amenable to intervention and those that represent fixed features of the environment (e.g., hospital type, patient acuity mix).
Conduct the intervention sweep. For each modifiable node, perform graph surgery, set the node to its best feasible state, propagate, and record the post-interventional outcome probability. Compute absolute risk reductions.
Incorporate costs. Attach estimated intervention costs to each modifiable node. Compute cost-effectiveness ratios. Identify the Pareto frontier.
Build the portfolio. Select a combination of non-dominated interventions that fits within the budget. Simulate the combined effect using simultaneous graph surgery.
Assess information value. For each unobserved node with substantial remaining uncertainty, compute EVPI. If any EVPI exceeds the cost of investigation, investigate before finalising the portfolio.
Document and justify. Present the ranked interventions, their expected effects, and the rationale for each selection - grounded in the causal structure of the network, not merely in statistical associations.

This workflow does something that no purely statistical or purely qualitative approach can achieve: it quantifies the causal impact of proposed actions, accounts for confounding structure, incorporates resource constraints, and identifies when more information is needed. It is, in essence, the decision-support architecture that frameworks like ISO 45003 call for but rarely specify how to implement.

Common pitfalls to avoid

Even with the intervention calculus in hand, several traps await the unwary practitioner:

Confusing association with intervention. The entire first section of this chapter exists because this error is ubiquitous. Always use graph surgery for intervention reasoning; never substitute observational conditioning.
Ignoring interactions between interventions. The risk reduction from intervening on two nodes simultaneously is generally not the sum of individual reductions. Always simulate the combined portfolio, not just the sum of parts.
Assuming all states are equally achievable. Setting Workload to "Low" in the model is easy; achieving it in a hospital emergency department is hard. The feasibility of the target state must be assessed alongside the cost.
Neglecting implementation fidelity. Graph surgery sets a node to its target value with certainty. Real interventions achieve partial, probabilistic shifts. A more realistic simulation would set the intervention node to, say, 80% High and 20% Moderate, reflecting expected implementation quality.
Treating EVPI as a reason to delay indefinitely. There is always a node with non-zero EVPI. The decision to investigate must compare EVPI to investigation cost and the cost of delayed action. Sometimes acting under uncertainty is better than waiting for perfect knowledge.

Think About It

You have computed that an intervention on Supervisor Support would reduce psychological injury probability by 8 percentage points under the assumption of perfect implementation (support set to High with certainty). But realistically, you expect the training program to shift Supervisor Support to High for about 70% of supervisors and Moderate for 30%. How would you modify the graph surgery procedure to account for this? What would you expect to happen to the estimated risk reduction?

The Ethical Imperative of Causal Reasoning

There is a deeper reason why the see/do distinction matters beyond computational accuracy: it carries ethical weight. When an organisation chooses an intervention based on observational associations alone - "supervisor support is correlated with lower distress, so let's mandate supervisor check-ins" - they may be investing resources in an action whose apparent effect is largely confounded by management commitment. The intervention fails, workers continue to suffer, and the organisation concludes that "psychosocial interventions don't work." This is not merely a waste of money; it is a failure of the duty of care owed to workers.

Pearl and Mackenzie (2018) argued that the inability to distinguish observational from interventional reasoning has been "the main obstacle to a science of cause and effect" (p. 28). In workplace health and safety, the stakes of this obstacle are measured in human suffering. The intervention calculus is not an academic exercise - it is a tool for ensuring that limited resources are directed toward actions that will actually change outcomes, as determined by causal structure rather than spurious correlation.

The Safe Work Australia Code of Practice requires that psychosocial hazard controls be "effective" and "suitable." A BN-based intervention calculus provides a principled, transparent, and auditable method for demonstrating that proposed controls are expected to be effective - and for identifying when they are not.

Key Takeaways

Observing that a variable has a certain value (entering evidence) and intervening to set it to that value (performing an action) are fundamentally different operations that yield different probability distributions.
Graph surgery - removing incoming arrows to the intervention target and fixing its value - is the computational procedure for simulating interventions in a Bayesian network.
An intervention sweep systematically applies graph surgery to every modifiable node, producing a ranked list of interventions ordered by their causal impact on the target outcome.
Combining risk reduction estimates with cost data yields cost-effectiveness ratios and a Pareto frontier that identifies dominated interventions and efficient choices.
The Expected Value of Perfect Information (EVPI) quantifies whether gathering more data about a specific node would change the optimal intervention decision, helping distinguish when to act from when to investigate.
The intervention calculus workflow - baseline, sweep, cost-effectiveness, portfolio, information value - provides a defensible, structured approach to psychosocial risk intervention that meets the demands of ISO 45003 and national codes of practice.
The see/do distinction is not merely technical but ethical: organisations have a duty to base interventions on causal reasoning, not spurious association.

Looking Ahead

Chapter 8 moves from the calculus of what to intervene on to the practical realities of how to implement. We will examine implementation planning - specifying intervention timelines, responsibility assignments, monitoring indicators, and the feedback loops that connect post-intervention data back into the BN for continuous model updating. The models we have built will become living decision-support systems, not one-time analyses.

References

García-Herrero, S., Mariscal, M. A., García-Rodríguez, J., & Ritzel, D. O. (2013). Using Bayesian networks to analyze occupational stress caused by work demands: Preventing stress through social support. Accident Analysis & Prevention, 57, 114–123. https://doi.org/10.1016/j.aap.2013.05.009

Gavious, I., Mizrahi, S., Shani, Y., & Minchuk, Y. (2018). Economic evaluation of occupational safety and health interventions from the employer perspective: A systematic review. Journal of Occupational and Environmental Medicine, 60(3), 226–233. https://doi.org/10.1097/JOM.0000000000001213

Gonzalez-Jimenez, H., Leiras, A., & Hamacher, S. (2022). Exploiting the capabilities of Bayesian networks for engineering risk assessment: Causal reasoning through interventions. Risk Analysis, 42(12), 2842–2867. https://doi.org/10.1111/risa.13918

Mohammadfam, I., Kalatpour, O., Golmohamadi, R., & Khotanlou, H. (2017). Constructing a Bayesian network model for improving safety behavior of employees at workplaces. Safety Science, 95, 77–85. https://doi.org/10.1016/j.ssci.2016.05.008

Pearl, J. (2009a). Causality: Models, reasoning, and inference (2nd ed.). Cambridge University Press. https://doi.org/10.1017/CBO9780511803161

Pearl, J. (2009b). Causal inference in statistics: An overview. Statistics Surveys, 3, 96–146. https://doi.org/10.1214/09-SS057

Pearl, J., & Mackenzie, D. (2018). The book of why: The new science of cause and effect. Basic Books.

Raiffa, H., & Schlaifer, R. (1961). Applied statistical decision theory. Harvard University Press.