Directly Causing Inconsistency

The following was written for, and ended up winning, the Strawson Prize (2025).

Singular direct causation features prominently in our ethical (Byrne forthcoming), legal (Byrne 2023), and metalinguistic reasoning (Martin 2018, a.o.). I argue that it is to be understood as a metaphysical analogue of the metalinguistic relation of witnessing from mathematical logic: whereas the latter relates to an existentially quantified sentence, the former relates to a proposition. I prove that direct causation is inconsistent with the view that necessary equivalence suffices for propositional identity, and direct causation by propositions or events is inconsistent outright. Such results are fundamental limitations on any Strawsonian (1951) project of descriptive metaphysics. I then take up a revisionary metaphysical project, showing how Martin’s (2018) account of direct causation can be improved to yield sufficiency in context (compare Kodsi and Kaiserman, manuscript). This is an attractive replacement for direct causation, as well as for causation more generally.

Introduction

In his Individuals: An Essay in Descriptive Metaphysics (1951), P. F. Strawson wrote that he was “content to describe the actual structure of our thoughts about the world” (p. 9). After all,

there are categories and concepts which, in their most fundamental character, change not at all. Obviously these are not the specialities of the most refined thinking. They are the commonplaces of the least refined thinking; and are yet the indispensable core of the conceptual equipment of the most sophisticated human beings (p. 10).

A paradigm of such a category is that of causation. Causal knowledge is central to our ability to respond appropriately to our environment. For one thing, it enables us to reason from knowledge of what has happened to knowledge of what will happen. For instance, if a human or other animal knows that berries nourish—or, more explicitly, that eating berries causes one to be nourished—and knows that it has just eaten berries, then it can infer that it will be nourished. This ability to anticipate results can also be deployed “offline” against what has only happened in the imagination, enabling us to evaluate candidate courses of action by reasoning about what consequences each would have if taken. For instance, if one wants to be nourished, knowing that berries nourish but pebbles do not nourish can help one decide to eat berries rather than pebbles. Compare Williamson 2020 on counterfactual knowledge.

Causal knowledge can be formulated in generic terms, as in (1), or in singular terms, as in (2).

(1) Smoking causes cancer.

(2) Ann’s smoking caused her cancer.

Our starting examples about berries were formulated in generic terms, but could have been written in singular terms (‘these berries’, ‘those pebbles’). The generic formulations might be simpler cognitively; however, they are almost certainly more complex semantically. In particular, we conjecture that the semantics of generic causal claims will be delivered simply by applying an adequate general account of generics to an independent account of the semantics of singular causal claims. For this reason, we focus on the semantics of singular causal claims—a formidable enough task.

Among singular causal claims, there is another independent distinction between lexical causal claims, where the causal relation is encoded in more specific words—as in (3)—and explicit causal claims, where an explicit causal operator is used—as in (4). Note that both (3) and (4) are singular causal claims, but it is easy to think of their generic counterparts.

(3) The sunlight melted the snow.

(4) The sunlight made the snow melt.

What is the relationship between lexical and explicit causal claims? It is orthodoxy in semantics that lexical causal claims express only direct causation, while explicit causal claims may also express indirect causation. See for instance Martin 2018, arguing against this assumption. For example, suppose that the sunlight charges a solar panel powering a heated driveway, which melts the snow: the sunlight causes the snow to melt, but seems to do so only indirectly. As a result, some might judge (5) to be true (although we disagree; more on this later).

(5) ? The sunlight made the snow melt, but didn’t melt the snow.

Characterising Direct Causation

One attempt at regimenting the idea that direct causation is what’s expressed by lexical causal claims is given by Byrne (2021). Let direct causation be the relation expressed by ‘makes’ so as to validate the following schema, where ‘ϕ’ is an operator that has both a transitive usage ϕt (which allows but need not require a second argument) and an intransitive usage ϕi (which does not allow a second argument). We use ‘⇔’ to indicate synonymy (understood weakly as necessary truth-functional equivalence, without requiring cognitive equivalence).

(6) a makes b ϕia ϕt b

One example of such an operator ϕ is ‘bake’. Suppose that Jake is baking a cake. Then, Jake is baking, in the sense that there is something such that Jake is baking it. At the same time, the cake is baking, but in a very different sense: there is something such that it is baking the cake. It would be very odd if Jake were baking in the latter sense, or the cake were baking in the former sense. The sense in which Jake bakes is transitive, even though it occurs without a second argument; the sense in which the cake bakes is intransitive. So, ‘Jake’, ‘bake’, and ‘the cake’ can be substituted for ‘a’, ‘ϕ’, and ‘b’ respectively.

(7) Jake makes the cake bake ⇔ Jake bakes the cake.

Or, to return to our earlier example:

(8) The sunlight makes the snow melt ⇔ The sunlight melts the snow

Such examples may be multiplied fairly easily. Above, we gave a syntactic characterisation of (in)transitivity. But this is slightly confusing, because the transitive usage can also occur without a second argument. As the explanation for ‘bake’ suggested, we can instead give a general semantic characterisation of the distinction. The transitive usage is an active one, whereas the intransitive usage is a passive one. We can regiment this thought as follows.

(9) c ϕt ⇔ ∃x (c ϕt x)

(10) c ϕi ⇔ ∃x (c ϕt x)

One issue for Byrne’s schema is that some verbs which seem pre-theoretically to encode causation—like ‘destroy’ or ‘decimate’—nevertheless lack the required intransitive usage. Byrne’s solution is to note that we could of course coin such usages, so as to validate the schema. However, we need not resort to such drastic measures. Given the semantic characterisation (10) of the intransitive usage as the passive one, we can understand Byrne’s schema (8) as merely a corollary case of the more general paraphrase schema (11). Byrne does not make this connection, but it is common practice to characterise the causative alternation that Byrne’s schema requires explicitly as the availability of a passive intransitive reading.

(11) a makes (it such that) ∃x (xϕb) ⇔ aϕb

Thus, the restriction on ϕ in Byrne’s schema is artificial: it is in some sense a mere artefact of (8). The paraphrase schema (11) is indeed much more general; it places no restrictions whatsoever on the transitive operator ϕ. For instance, it can handle ‘destroy’.

(12) The wave makes (it such that) something destroyed the sandcastle ⇔ The wave destroyed the sandcastle

Of course, the left hand side is not entirely natural English; let us instead write (12) as follows.

(12’) The wave directly causes the sandcastle to get destroyed ⇔ The wave destroyed the sandcastle

In English, some transitive operators may be separable. We thus assimilate resultatives, and indeed all transitive operators, rather than just those traditionally understood as causatives, although we treat the result state as having been wiped dry instead of simply being dry. We can capture the latter if having wiped something dry is equivalent to one’s wiping having dried it, and similarly for other resultatives. See for instance Levin 2018.

(13) Ben directly causes the table to be wiped dry ⇔ Ben wipes the table dry

Further, wh-clauses and that-clauses may be substituted for ‘a’ and ‘b’. (Formally, the relata of direct causation can also be variables and propositions.)

(15) Cora directly causes where it is to be known ⇔ Cora knows where it is

(16) That he’s dead will directly cause her to be shocked ⇔ That he’s dead will shock her

Finally, notice that the paraphrase schema (11) may be iterated, which is a fairly distinctive feature. We’ll use this later to help motivate contextualism about lexical causal claims, but it will not be needed for the inconsistency results.

(17) The poison directly caused him to be directly caused to be killed ⇔ The poison directly caused him to be killed (⇔ The poison killed him)

These nice features of the paraphrase schema (11) are not naturally available to Byrne’s schema (8); we suggest that the only reason to buy Byrne’s schema (8) is as a corollary or special case of the paraphrase schema.

The paraphrase schema (11) is reminiscent of the witness schema (18) from mathematical logic. For comparison, (11) is reproduced, this time with ‘▷’ in place of ‘makes (it such that)’, and ‘b’ assimilated into the operator—formally, we have φ = λ_y_.(yϕb).

(11’) a ▷ ∃x.φ(x) ⇔ φ(a)

a’ witnesses ‘∃x.φ(x)’ ⇔ ‘φ(a)’ is true

Read (11’) as ‘a directly causes there to be a φ just in case a is a φ’; or, abusing jargon, ‘a witnesses that something has φ just in case a has φ’.

Officially, although the quotation marks are typically elided, the witness schema (18) is metalinguistic: it relates strings of symbols. By contrast, the direct causation schema (11’) is metaphysical: it relates what’s expressed by those strings of symbols. In particular, witnessing is a relation to an existentially quantified sentence, whereas direct causation is a relation to a proposition.

It is worth remarking that this formulation looks incredibly promising so far. In particular, it positions direct causation as standing to lexical causal claims as knowledge stands to factive mental state claims (Williamson 2000); as metaphysical possibility stands to objective possibility claims (Williamson 2016); and as absolute universal quantification stands to restricted universal claims (Williamson 2003). Generally, having the technology of the broadest species of some family of properties allows one to bootstrap from a pre-theoretical grasp of those properties individually to a theoretical characterisation of the entire family, as particular versions of the most general species. For instance, to be nomologically necessary is to be metaphysically necessary given the actual laws of nature. One should feel hope for a more principled and general understanding of lexical causal claims. Unfortunately, as we will now see, those hopes cannot come to fruition.

Intensionalism Directly Causes Inconsistency

As a warm-up, let us first show that (11’) is inconsistent with the background metaphysical principles that necessarily equivalent propositions are identical (‘intensionalism’), that anything which exists necessarily exists (‘necessitism’), and that there necessarily are at least two things (which also follows from necessitism and the fact that there are at least two things, given the necessity of distinctness, a principle that allows us to move to the last line—but we’re already done before then, too).

Pick any a. By necessitism, it is a necessary truth that there is something which is a. As there are necessarily at least two things, it is also a necessary truth that there is something which is not a. By intensionalism, since there is only one necessary truth, these truths are identical. That is, we can substitute ∃x(x = a) and ∃x(xa) for one another salva veritate.

But then, given (11’) as well, it is easy to see that the following are equivalent.

That is, every impossibility is a necessary truth. Clearly, this trivialises the logic. Now, friends of intensionalism and necessitism—like Bacon and Dorr (2024) and Williamson (2014)—may be content to stop here: after all, we have just learned that direct causation is inconsistent given what we know about propositional identity and existence.

As we saw in (16), the direct causation schema (11’) need not be restricted such that the direct cause must be an individual (of type e); indeed, direct causes may be propositions (of type t), like the proposition that some man is dead. Perhaps this is the wrong way to think about that-clauses in English; but, given the resources of higher order logic, and the orthodoxy of the thought that propositions or events are causes, it is independently worth exploring. Of course, that inconsistency results at any type is also a negative sign. Let’s focus on this special case. Of course, this means that the logic with which we characterised the direct causation schema (11’) had better be able to quantify into sentence position (intuitively, to quantify over propositions). Once we’re in a higher-order language, we can dispense with the schematic presentation as well, and put the direct causation schema as a single sentence which quantifies explicitly into sentence position and operator position (specifically, monadic operators of type ⟨t, t⟩, like ‘¬’). Finally, we may as well rewrite ‘⇔’ explicitly as necessary equivalence. Note that ‘▷’, as it appears below, is of type ⟨t,⟨t, t⟩⟩, like the operator ‘∧’. Making these changes, we have the following propositional direct causation formula (PDC). For an introduction to higher order languages of this sort as applied to metaphysics, see the exposition in, e.g., Williamson (2014) or Bacon and Dorr (2024).

(PDC) ∀φp□((p ▷ ∃q φq) ↔ φp)

In English, read: ‘p directly causes there to be a φ just in case p is a φ’.

Note also that this formula follows from the generalisation across all types (which, to be a sentence rather than a schema, requires quantifying over types; see generally Wilhelm 2025).

(DC) σ⟨σ, t⟩φσx □((xx, tσq φq) ↔ φp)

This is the natural way to (implicitly) define direct causation by anything of any type. But, given intensionalism and necessitism, (DC) is false. Indeed, since our above argument generalised to any type for which there are—loosely speaking—at least two things of that type, (DC) is false for every such type. For instance, at type t, we have ⊤ (the necessary truth) and ⊥ (the impossibility). We can formulate our discovery as follows.

(NDC) σ (∃<sub_σ</sub>xσy (xy) → ¬∃σ,⟨t, t⟩⟩∆ ∀σ, tφσx □((x∆σq φq) ↔ φp))

This is a theorem of any logic expressive enough to quantify over types and strong enough to prove intensionalism for propositions and necessitism at all types. The schematic version of (NDC) is provable, of course, without quantifying over types. Either (NDC) or a schematic version thereof applies to individuals (including events) and to propositions, and in particular shows that there cannot be direct causation by individuals (including events) or propositions. Indeed, there cannot be direct causation by any type of which there is more than one instance.