Abstract

Proof theorists have long been struggling to provide simple accounts of various modal logics. Drawing on the recent literature on metainferences, I develop in the present paper a novel approach to this challenge: regular sequent systems augmented with natural deduction-like rules for assuming and discharging sequents. Based on this approach, I introduce an elegant calculus for the modal logic |$S4$|⁠. Various discharging conditions on assumptions are shown to yield the weaker logics |$K,D$|⁠, and |$T$|⁠. The rules in these calculi have attractive proof-theoretic properties: they are explicit, symmetrical, and non-circular, and they enjoy the subformula property.

1 Introduction

Modal logics pose a long-standing challenge to proof theorists.1 While their model theoretic accounts—in terms of various constraints on accessibility relations in Kripke models—are neat and transparent, there do not appear to be correspondingly elegant proof-theoretic accounts. The problem is how to formulate proof systems where, as for the non-modal connectives, the rules governing modal operators can be said to define their meanings. To be sure, axiomatic systems for modal logics are easy to formulate, but as is known, such systems are inadequate for meaning-theoretical purposes. Many natural deduction systems and sequent calculi of modal logics face analogous problems. Still other systems—notably, hypersequent and display calculi—are highly abstract, and so it is questionable whether they are fit for meaning-theoretical purposes, as they abstract away from our actual inferential practice.

Drawing on the recent literature on metainferences, I develop in the present paper a novel approach to this challenge, in the form of sequent calculi augmented with natural deduction-like rules for assuming and discharging sequents. Based on this approach, I introduce an elegant calculus for the modal logic |$S4$|⁠. Various discharging conditions on assumptions are shown to yield the weaker logics |$K,D$|⁠, and |$T$|⁠. The question of whether we can provide similar accounts of other systems such as |$B$| and |$S5$| will be left open, as a task for future research.

I proceed as follows. Section 2 sets the stage by introducing the model-theoretic accounts of |$K,D,T,$| and |$S4$|⁠, as well as their axiomatic proof systems. In Section 3, I introduce the basic sequent system that is going to be analyzed here, and prove that it is sound and complete with respect to |$S_{4}$|-Kripke models. In Section 4, I introduce various discharging conditions on assumptions that weaken the proof system of |$S4$|⁠, thereby yielding proof systems for |$K,D,$| and |$T$|⁠; I also establish soundness and completeness results for the latter systems, each with respect to its relevant class of models. I conclude with directions for future research.

2 The existing approaches

I work here with a propositional language |$\mathscr{L}$| that includes, besides the connectives |$\wedge ,\vee ,\neg $|⁠, and |$\supset $|⁠, the modal operator |$\square $|⁠. Let us recall that a Kripke Frame |$\mathscr{F}=<W_{\mathscr{F}},R_{\mathscr{F}}>$| is an ordered pair of a non-empty set of possible worlds (points of evaluation) |$W_{\mathscr{F}}$|⁠, and a binary accessibility relation |$R_{\mathscr{F}}\subseteq W_{\mathscr{F}}\times W_{\mathscr{F}}$|⁠. We denote by |$\mathbb{K}$| the class of all frames, by |$\mathbb{D}$| the class of all such frames in which the accessibility relation is directed, by |$\mathbb{T}$| the the class of all such frames in which the accessibility relation is reflexive, and by |$\mathbb{S}4$| the the class of all such frames in which the accessibility relation is both reflexive and transitive. A Kripke model |$M_{\mathscr{F}}=<\mathscr{F},\models _{M_{\mathscr{F}}}>$| is an ordered pair of a Kripke frame |$\mathscr{F}$| and |$\models _{M_{\mathscr{F}}}\subseteq W_{\mathscr{F}}\times \mathscr{L}$| (or simply |$\models $|⁠, where the relevant model is known) is a satisfaction relation. The semantic clauses for the connectives are regular, and for |$\square $| we have:

(1) |$w\models \square A$| iff for all |$w^{\prime}$| such that |$wRw^{\prime}$|⁠: |$w^{\prime}\models A$|⁠.

We say that a sentence |$A$| is valid with respect to some frame |$\mathbb{F}$| if there is no model |$M_{\mathscr{F}}=<\mathscr{F},\models _{M_{\mathscr{F}}}>$| (where |$\mathscr{F\in \mathbb{F}}$|⁠) such that, for some |$w\in W_{\mathscr{F}}$|  |$w\nvDash _{M_{\mathscr{F}}}A$|⁠; we say that a sequent |$\varGamma \Rightarrow \varDelta $| is valid with respect to some frame |$\mathbb{F}$| if there is no model |$M_{\mathscr{F}}$| (where |$\mathscr{F\in \mathbb{F}}$|⁠) such that, for some |$w\in W_{\mathscr{F}}$|⁠, |$w\models _{M_{\mathscr{F}}}\gamma $| for all |$\gamma \in \varGamma $| but |$w\nvDash _{M_{\mathscr{F}}}\delta $| for all |$\delta \in \varDelta $|⁠.

Let |$CL$| be some Hilbert-style proof system for classical logic.2 We add to |$CL$| the rule of necessitation (⁠|$Nec$|⁠): |${\frac{A}{\square A}}$| (where the derivation of |$A$| does not involve any non-axiom premise). To generate axiomatic (Hilbert-style) proof systems for |$K,D,T,$| and |$S4$|⁠, we need the following axioms:

|$K$|⁠: |$\square (A\supset B)\supset (\square A\supset \square B)$|

|$D$|⁠: |$\square A\supset \neg \square \neg A$|

|$T$|⁠: |$\square A\supset A$|

|$S4$|⁠: |$\square A\supset \square \square A$|

To generate the system |$K$|⁠, we add the axiom |$K$| to |$CL+Nec$|⁠; to generate the system |$D$|⁠, we add the axiom |$D$| to the system |$K$|⁠; to generate the system |$T$|⁠, we add the axiom |$T$| to the system |$K$|⁠; and to generate the system |$S4$|⁠, we add the axiom |$S4$| to the system |$T$|⁠. It is known that |$S4$| is stronger than |$T$|⁠, which is stronger than |$D$|⁠, which is stronger than |$K$|⁠.

Now, the apparent problem with these axioms—and with the systems generated by these axioms—is that they are inadequate for meaning-theoretical purposes. Namely, the axioms do not reveal much about the meaning of the modal vocabulary—that is, of “necessarily,” or |$\square $|—that is at stake here. This problem is particularly important to those interested in proof-theoretic semantics. Thus, attempts were made to develop more transparent proof systems for these logics. Unfortunately, regular natural deduction systems and sequent calculi seem to face the same predicament (see, e.g. [2, 7, 8, 14, 18–20, 28, 30, 32]). For one thing, consider the list of “regular” sequent rules in Table 1, discussed in the literature. A sequent calculus for classical logic along with the rule |$K$| yields a sequent system for the modal logic |$K$| [30]; such a calculus along with |$K$| and |$D$| yields a sequent system for |$D$| [32]; such a calculus along with |$K$| and |$T$| yields a sequent system for |$T$| [20]; and such a calculus along with |$K$|⁠, |$T$|⁠, and |$S_{4}$| yields a sequent system for |$S4$| (ibid).

Table 1.

Sequent rules for modal logics

graphic
graphic
Table 1.

Sequent rules for modal logics

graphic
graphic

However, these rules—and thereby the resultant systems—lack what many authors take to be sine qua non properties in proof-theoretic semantics. Let me be more explicit here. The following is a (partial) list of such properties, along with explanations as to why the above rules violate them:3

1. Explicitness: A rule for an expression |$e$| is explicit if there is only one essential occurrence of |$e$| in the conclusion. This property guarantees that the rule defines only one status of a sentence at a time. Regarding the above rules, only |$T$| is explicit, and so none of the above systems enjoys explicitness as they all contain the |$K$| rule.

2. Symmetry: A set of rules for an expression |$e$| is symmetrical if each rule introduces |$e$| either on the left-hand side or on the right-hand side, and there is at least one rule for each side. This property guarantees that the rules specify |$e$|’s premisory and conclusory roles separately. None of the above systems enjoys that property, as they all contain the |$K$| rule that introduces |$\square $| on both sides (and the same goes for the |$S4$| rule.)

3. Non-Circularity: A set of rules for an expression |$e$| is non-circular if there is no essential occurrence of |$e$| in the premises of each rule. This property guarantees that the rules define |$e$| in a non-circular way. In this sense, |$S4$| is clearly circular.

All in all, it seems unclear whether we can provide simple and intuitive sequent systems (or natural-deduction systems, for that matter) for these logics that enjoy properties 1-3.

It is worth mentioning other kinds of systems that are discussed in the literature. First, there are approaches that enrich the object language with model-theoretic notions like relations between worlds (see, e.g. [18]); the problem with such systems is that there is something ad hoc about them, as they sneak in (to the syntax) model-theoretic notions. Second, there are abstract systems such as hypersequent and display calculi (see, e.g. [1, 15, 21, 23, 24, 26, 27, 35]); however, the notion of inference—with which sequents are naturally associated—is replaced in such systems by more abstract notions such as hypersequents. It is thus questionable whether such systems are fit for meaning-theoretical purposes, as they abstract away from our actual inferential practice.4

The present paper develops a novel technical approach to the challenge of accounting for modal logics proof-theoretically: regular sequent systems augmented with natural deduction-like rules for assuming and discharging sequents. As opposed to the sequent rules described above, the rules in my systems are all explicit, symmetrical, and non-circular; moreover, they enjoy the subformula property. I now turn to discuss the first (and main) system, which is a calculus for |$S4$|⁠.

3 A sequent system for |$S4$|

3.1 Proof theory

The underlying proof system for classical logic that will be in use throughout this paper is the sequent calculus in Table 2, known as |$G3cp$| [17, pp. 49–60].

Table 2.

The |$G3cp$| sequent rules

graphic
graphic
Table 2.

The |$G3cp$| sequent rules

graphic
graphic

The contexts are read here as sets rather than multisets, and so Contraction and Permutation are built-in properties. Here are some properties of this system: (i) even though the axioms are restricted to atomic formulas, one can prove by structural induction that for every formula |$A$| and sets of formulas |$\varGamma ,\varDelta $|⁠, the sequent |$\varGamma ,A\Rightarrow A,\varDelta $| is derivable; (ii) the operational rules of |$G3cp$| are all invertible; and (iii) weakening and cut are both admissible in |$G3cp$| (ibid).

Now, my goal in this paper is to add to |$G3cp$| rules for the modal expression |$\square $|⁠. Ideally, there should be left and right rules for |$\square $|⁠, allowing us to derive sequents like |$\varGamma ,\square A\Rightarrow \varDelta $| and |$\varGamma \Rightarrow \square A,\varDelta $|⁠, respectively, where (i) the contexts |$\varGamma ,\varDelta $| are arbitrary, (ii) |$A$| occurs unembedded (at least once) in the premises of each rule, and (iii) |$\square A$| does not occur (unembedded or otherwise) in any such premise.

As for the right rule, a natural candidate suggests itself: the |$Nec$| rule from the axiomatic systems. Articulated in a sequent calculus setting, this rule takes the form: |$ {\frac{\Rightarrow \square A}{\Rightarrow A}{\square\textrm{R}}}$|⁠.5 Moreover, |$\square R$| gives us a proof-theoretic indication of the meaning of the expression |$\square A$|⁠. For there is a long-standing tradition in proof-theoretic semantics according to which the introduction/right rules of a given expression specify the meaning of that expression (see, e.g. [6, 25]). According to this tradition, then, the meaning of |$\square A$| amounts to the claim that |$A$| is a theorem, i.e. derivable from no premises.6

How should we understand the sequent |$\varGamma ,\square A\Rightarrow \varDelta $|⁠, given the above account of the meaning of |$\square A$|? Simply put, this sequent amounts to the claim that |$\varDelta $| follows from the set of assumptions |$\varGamma $| together with the assumption that |$A$| is a theorem. Notice that by this account, |$A$| is merely assumed to be a theorem—this is just one of our assumptions—but it need not be a theorem as a matter of fact. That is to say, |$\varGamma ,\square A\Rightarrow \varDelta $| may be derivable even in cases where |$A$| is not a theorem.7

How do we express such an assumption? I would like to rely here on recent literature that suggests augmenting sequent calculi with natural deduction-like rules for assuming and discharging sequents [10–12]. In essence, these papers suggest integrating into our calculus “assumed sequents” that need not be derivable. Thus, we may assume |$\Rightarrow A$|—and thereby assume that |$A$| is a theorem—regardless of whether |$A$| is actually a theorem. Whenever such an assumption gives us what we need—i.e. a proof of |$\varDelta $| from |$\varGamma $|—we may discharge it and derive the sequent |$\varGamma ,\square A\Rightarrow \varDelta $|⁠. In this way, we arrive at the following rule:

where the assumption |$\Rightarrow A$| is presented within square brackets and the superscript |$1$| is not part of the syntax, but rather is meant to keep track of our assumptions that need to be discharged. Assumption |$1$| is discharged in the last step of the derivation, and the derivation marked by the four dots—namely, the derivation of |$\varGamma \Rightarrow \varDelta $| from |$\Rightarrow A$|—may involve more than one leaf (and so more than one assumption).8 At this point, we do not impose any special constraints on the derivation of |$\varGamma \Rightarrow \varDelta $| from |$\Rightarrow A$|⁠; we will do so later, in order to generate various modal logics. Here is a formal definition of derivability in |$G3cp+\square R+\square L$|⁠:

 

Definition 3.1

We say that a sequent |$\varGamma \Rightarrow \varDelta $| is derivable in the system |$G3cp+\square R+\square L$| if it can be derived with a proof tree whose leaves include both axioms and assumptions, such that (i) every step is an application of one of the system’s rules, and (ii) every assumption is discharged at some point.

Notice that |$\square L$| and |$\square R$| are explicit, symmetric, and non-circular: there is only one essential occurrence of |$\square $| in the conclusion of each rule, there is no essential occurrence of |$\square $| in the premise of each rule, and |$\square L,\square R$| are in the form of left/right rules, respectively. Moreover, these rules enjoy the subformula property. For |$\square R$|⁠, this is obvious. For |$\square L$|⁠, we first need to extend the property to cases where sequents are assumed and discharged. The following is a natural such extension: |$\Rightarrow A$| and |$\varGamma \Rightarrow \varDelta $| contain only formulas that are subformulas of formulas in |$\varGamma ,\square A\Rightarrow \varDelta $|⁠.9

Now, the following lemma will prove useful:

 

Lemma 3.2

For every formula |$A$| and sets of formulas |$\varGamma ,\varDelta $|⁠, the sequent |$\varGamma ,A\Rightarrow A,\varDelta $| is derivable in |$G3cp+\square R+\square L$|⁠.

 

Proof.

As usual, the proof proceeds by induction on the structure of |$A$|⁠. The only special case that we need to consider here is where |$A$| is of the form |$\square B$|⁠. In that case, we reason as follows:

Next, I would like to point out that |$\square R$| and |$\square L$| fit together productively: axioms |$K,D,T$|⁠, and |$S_{4}$| can all be derived in |$G3cp+\square R+\square L$|⁠.10 The derivation of |$K$| goes as follows:

The derivation of |$D$| goes as follows:

The derivation of |$T$| is much simpler, as it involves only one assumption, which is discharged at once:

and the derivation of |$S4$| is somewhat similar:

Let me stress what has been achieved so far: not only do |$\square R$| and |$\square L$| have attractive proof-theoretic properties—as opposed to the rules discussed in Section 2—but we can also derive from |$\square R$| and |$\square L$| no less than four different axioms that are characteristic of central modal systems! Thus, I claim, |$\square R$| and |$\square L$| together form an elegant set of rules, and the method of assuming and discharging sequents has proven itself quite valuable.

We may add possibility rules that parallel the necessity ones to the above system, by flipping the sides. Thus, the left possibility rule will be of the form |$ {\frac{\lozenge A\Rightarrow }{A\Rightarrow } {}^{_{\lozenge L}}}$|⁠, and the right possibility rule will be of the form:

Having added these rules, one can show that the resultant system allows for the (cut-free) derivation of the interdefinability of possibility and necessity. Namely, the following sequents are cut-free derivable: |$\neg \square \neg A\Rightarrow \lozenge A$|⁠, |$\lozenge A\Rightarrow \neg \square \neg A$|⁠, |$\neg \lozenge \neg A\Rightarrow \square A$|⁠, and |$\square A\Rightarrow \neg \lozenge \neg A$|⁠. For the sake of simplicity, though, the possibility rules will not be discussed here any further.

I conclude the proof-theoretic presentation of |$G3cp+\square R+\square L$| with some comments on the eliminability of Cut in it. To begin with, notice that Cut is not entirely eliminable in this system, as can be seen, e.g. in the above derivation of axiom |$D$|⁠. The problem is that, even though the applications of Cut can all be “pushed” up a given proof tree, there is no way to push these applications beyond assumptions or applications of |$\square L$|⁠; and, as opposed to a cut on axioms, a cut on assumptions and sequents of the form |$\varGamma ,\square A\Rightarrow \varDelta $| that are derived by |$\square L$| cannot always be dispensed with.

In response, I would like to make two points. First, for many logicians, cut-elimination is a virtue because it guarantees conservativity (see, e.g. [6, pp. 217–220, 250]). But |$\square R$| and |$\square L$| form a conservative extension of |$G3cp$| anyway, as guaranteed by the |$S4$| soundness theorem result below (Corollary 3.8). That is to say, if a sequent |$\varGamma \Rightarrow \varDelta $| whose expressions are taken from the fragment of the language that does not contain the |$\square $| operator is derivable in |$G3cp+\square R+\square L$|⁠, then it is already derivable solely with the |$G3cp$| rules. Otherwise, there would be a Boolean valuation function |$v$| that poses a counterexample to |$\varGamma \Rightarrow \varDelta $|⁠; it is then a simple exercise to construct an |$S4$|-Kripke model that poses a counterexample to |$\varGamma \Rightarrow \varDelta $|⁠, contradicting Corollary 3.8.11

Secondly, it is worth mentioning that we can push all such applications of Cut as far up the proof tree as possible. One way of putting this accurately is to say that Cut is semi-eliminable in this system: if |$\varGamma \Rightarrow \varDelta $| is derivable in |$G3cp+\square R+\square L$|⁠, then it is derivable by a proof tree in which each application of Cut is immediately preceded either by an application of |$\square L$|⁠, or by an assumption.

 

Theorem 3.3

Cut is semi-eliminable in |$G3cp+\square R+\square L$|⁠.

 

Proof.

The proof is similar to the one given by Negri and von Plato in [17, pp. 54–57], and proceeds by induction on proof height, as well as on the complexity of the cut formula. Notice, however, that “proof height” is regarded here (and passim) as the height of a tree whose leaves may include undischarged assumptions; thus, we need not restrict the inductive hypothesis to derivable sequents.

Anyway, there are two additional scenarios that we have to consider:

(i) Suppose that we cut on |$\varGamma ,B\Rightarrow \varDelta $| and |$\varSigma \Rightarrow B,\varPi $| where the cut-formula is not principal in the derivation of one of these sequents, say, |$\varGamma ,B\Rightarrow \varDelta $| (the proof is analogous for the case where the cut-formula is not principal in the derivation of |$\varSigma \Rightarrow B,\varPi $|⁠). We have to consider the additional cases where |$\varGamma ,B\Rightarrow \varDelta $| is derived by |$\square R$| or |$\square L$|⁠. Yet, |$\varGamma ,B\Rightarrow \varDelta $| cannot be derived by |$\square R$|⁠, since |$B$| isn’t principal but |$\square R$| only allows us to derive sequents with one formula on the right (and none on the left). Thus, we only have to consider the case where |$\varGamma ,B\Rightarrow \varDelta $| is derived by |$\square L$|⁠. But in that case there is nothing to prove for semi-eliminability.

(ii) Suppose that we cut on sequents of the form |$\varGamma ,\square A\Rightarrow \varDelta $| and |$\varSigma \Rightarrow \square A,\varPi $|⁠. There are four general cases to consider here:

1. The cut-formula is principal in the derivations of both sequents. Therefore, |$\varSigma =\varPi =\emptyset $|⁠, and so |$\varSigma \Rightarrow \square A,\varPi $|⁠, which is actually |$\Rightarrow \square A$|⁠, must be derived by |$\square R$| from |$\Rightarrow A$|⁠. Moreover, there is a derivation of |$\varGamma \Rightarrow \varDelta $| from |$\Rightarrow A$|⁠, since |$\varGamma ,\square A\Rightarrow \varDelta $| is derived by |$\square L$|⁠. A concatenation of the derivation of |$\Rightarrow A$| with that of |$\varGamma \Rightarrow \varDelta $| from |$\Rightarrow A$| forms a derivation of |$\varGamma \Rightarrow \varDelta $|⁠, where the cut application is dispensed with, as required.

2. The cut-formula is principal in |$\varSigma \Rightarrow \square A,\varPi $| but not in |$\varGamma ,\square A\Rightarrow \varDelta $|⁠. Therefore, |$\varSigma =\varPi =\emptyset $|⁠, and |$\varSigma \Rightarrow \square A,\varPi $|⁠, which is |$\Rightarrow \square A$|⁠, is derived by |$\square R$| from |$\Rightarrow A$|⁠. There are two general subcases to consider:

2.1. |$\varGamma ,\square A\Rightarrow \varDelta $| is derived by an operational rule of |$G3cp$|⁠, with one or two premise-sequent(s) of the form |$\varGamma ^{\prime}_{i},\square A\Rightarrow \varDelta ^{\prime}_{i}$| (⁠|$i=1,2$|⁠). The general strategy here is to “push” the cut application upward, and apply the operational rule afterwards. For example, suppose that |$\varGamma ,\square A\Rightarrow \varDelta $| is of the form |$\varGamma ,\square A\Rightarrow B\wedge C,\varDelta ^{\prime}$| (where |$\varDelta ^{\prime}=\varDelta \backslash \{B\wedge C\}$|⁠) and it is derived by |$\wedge R$| from |$\varGamma ,\square A\Rightarrow B,\varDelta ^{\prime}$| and |$\varGamma ,\square A\Rightarrow C,\varDelta ^{\prime}$|⁠. We then reason as follows:

The proofs of the other operational rules are analogous.

2.2. |$\varGamma ,\square A\Rightarrow \varDelta $| is derived by Weakening. If we weaken with |$\square A$| (and perhaps other formulas in |$\varGamma \cup \varDelta $|⁠), then |$\varGamma ,\square A\Rightarrow \varDelta $| is derived from |$\varGamma ^{\prime}\Rightarrow \varDelta ^{\prime}$|⁠, where |$\varGamma ^{\prime}\subseteq \varGamma ,\varDelta ^{\prime}\subseteq \varDelta $|⁠. Hence, we may immediately derive |$\varGamma \Rightarrow \varDelta $| by Weakening, thereby eliminating the cut application altogether. Otherwise, |$\varGamma ,\square A\Rightarrow \varDelta $| is derived from |$\varGamma ^{\prime},\square A\Rightarrow \varDelta ^{\prime}$|⁠, where |$\varGamma ^{\prime}\subseteq \varGamma ,\varDelta ^{\prime}\subseteq \varDelta $|⁠. We may then “push” the application of Cut upward in the following way:

3. The cut-formula is principal in the derivation of |$\varGamma ,\square A\Rightarrow \varDelta $| but not in the derivation of |$\varSigma \Rightarrow \square A,\varPi $|⁠. Therefore, |$\varGamma ,\square A\Rightarrow \varDelta $| is derived by |$\square L$|⁠, and so there is a derivation of |$\varGamma \Rightarrow \varDelta $| from |$\Rightarrow A$|⁠. Notice that for Cut to be semi-eliminable, we only need to prove that its application can be pushed up the branch of |$\varSigma \Rightarrow \square A,\varPi $|⁠. There are three general subcases:

3.1. |$\varSigma \Rightarrow \square A,\varPi $| is derived by Weakening. If we weaken with |$\square A$|⁠, then |$\varSigma \Rightarrow \square A,\varPi $| is derived from |$\varSigma ^{\prime}\Rightarrow \varPi ^{\prime}$| where |$\varSigma ^{\prime}\subseteq \varSigma ,\varPi ^{\prime}\subseteq \varPi $|⁠. Hence, one application of Weakening gives us:

and so we can completely dispense with the cut application in this case. Otherwise, |$\varSigma \Rightarrow \square A,\varPi $| is derived from |$\varSigma ^{\prime}\Rightarrow \square A,\varPi ^{\prime}$|⁠, where |$\varSigma ^{\prime}\subseteq \varSigma ,\varPi ^{\prime}\subseteq \varPi $|⁠. We may then “push” the application of Cut upward as follows:

3.2. |$\varSigma \Rightarrow \square A,\varPi $| is derived by an operational rule of |$G3cp$|⁠, with one or two premise-sequent(s) of the form |$\varSigma ^{\prime}_{i}\Rightarrow \square A,\varPi ^{\prime}_{i}$| (⁠|$i=1,2$|⁠). The general strategy here is to cut on the premise-sequents before applying the operational rule. For example, suppose that |$\varSigma \Rightarrow \square A,\varPi $| is of the form |$\varSigma ^{\prime},B\vee C\Rightarrow \square A,\varPi $| (where |$\varSigma ^{\prime}=\varSigma \backslash \{B\vee C\}$|⁠), and that it is derived by |$\vee L$| from |$\varSigma ^{\prime},B\Rightarrow \square A,\varPi $| and |$\varSigma ^{\prime},C\Rightarrow \square A,\varPi $|⁠. We then reason as follows:

The proofs of the other operational rules are analogous.

3.3. |$\varSigma \Rightarrow \square A,\varPi $| is derived by |$\square L$|⁠, in which case there is nothing to prove for semi-eliminability.

4. The cut-formula is not principal in the derivation of either sequent. It follows that the last step in the derivation of each sequent involves only the |$G3cp$| rules, and so the proof proceeds as in Negri and von Plato.

3.2 Model theory

We now turn to the task of proving that |$G3cp+\square R+\square L$| is sound and complete with respect to models of |$\mathbb{S}4$| frames, starting with soundness. To begin with, since this system involves a rule that allows us to assume and discharge sequents, we have to deal with the question of what it even means for such a rule to be “sound.” To answer this question, observe that a move like that of inferring |$\varGamma \Rightarrow \varDelta $| from |$\Rightarrow A$|—i.e. an inference between inferences—is sometimes being called a “metainference” in the literature.12 The sequent |$\Rightarrow A$| plays the role of a premise-inference in that metainference, whereas the sequent |$\varGamma \Rightarrow \varDelta $| plays the role of a conclusion-inference in it. Needless to say, such metainferences may also be of the form graphic—i.e. involve more than one premise-inference—in case we apply |$\square L$| several times in the proof tree.

Now, one can find in the literature (e.g. [3, 9, 13]) two criteria for assessing metainferences model-theoretically, known as the global and the local notions of metainferential validity:

1. According to the global notion, metainferential validity is a matter of validity preservation. That is, a metainference is globally valid if, given that its premiss-inferences are all valid, its conclusion-inference is valid too.

2. According to the local notion, metainferential validity is a matter of satisfaction preservation. That is, a metainference is locally valid if there is no model that satisfies its premiss-inferences, but does not satisfy its conclusion-inference.13

The debate over which of these two criteria is better is an ongoing issue in the philosophy of logic (ibid), but we need not engage with it here. For, the global notion is clearly unfit for the purpose of the present paper, i.e. for assessing metainferences with a premise-inference of the form |$\Rightarrow A$|⁠.14 To see this, notice that assessing |$\Rightarrow A$| globally necessarily results in one of the following scenarios:

(i) |$A$| is a theorem, and so the mechanism of assuming and discharging sequents does not make any contribution to |$G3cp+\square R$| : if |$A$| is a theorem then we may derive |$\varGamma \Rightarrow \varDelta $| without any assumption, and |$\varGamma ,\square A\Rightarrow \varDelta $| then follows by one application of Weakening.

(ii) |$A$| is not a theorem, and so the metainference graphic is globally valid vacuously, no matter what |$\varGamma ,\varDelta $| are. In that case, |$\square L$| is unsound. For example, if |$p$| and |$q$| are atoms then the metainference |$ {\frac{\Rightarrow p} {\Rightarrow q}}$| is globally valid vacuously, because there are some models that do not satisfy |$p$|⁠. Thus, in the global reading, the following application of |$\square L$|⁠:

has a valid premise (i.e. the metainference |$ {\frac{\Rightarrow p}{\Rightarrow q}}$|⁠), but an invalid conclusion (i.e. the sequent |$\square p\Rightarrow q$|⁠). In sum, in the global reading the applications of |$\square L$| are trivial in some cases and unsound in others.

By contrast, the local reading requires that we focus exclusively on those Kripke models that satisfy |$\Rightarrow A$|⁠, and then check whether some such models do not satisfy |$\varGamma \Rightarrow \varDelta $|⁠. On the face of it, then, the local reading seems a more promising criterion for assessing whether metainferences of the form graphic are valid. Let us therefore adopt this criterion, and see what follows.

Before we do so, however, we need to say more about what it means for a model to satisfy a sequent. After all, the local criterion draws on satisfaction preservation rather than validity preservation. Let |$M_{\mathscr{F}}=<\mathscr{F},\models _{M_{\mathscr{F}}}>$| be some Kripke model. What does it mean to say that |$M_{\mathscr{F}}$| satisfies sequents like |$\Rightarrow A$| and |$\varGamma \Rightarrow \varDelta $|? At first glance, one may be tempted to assess such satisfaction in terms of not posing a counterexample. Thus, |$M_{\mathscr{F}}$| will be said to satisfy a given sequent |$\varGamma \Rightarrow \varDelta $| if it does not pose a counterexample to that sequent: either |$M_{\mathscr{F}}\nvDash _{M_{\mathscr{F}}}\gamma $| for some |$\gamma \in \varGamma $|⁠, or |$M_{\mathscr{F}}\models _{M_{\mathscr{F}}}\delta $| for some |$\delta \in \varDelta $|⁠; indeed, satisfaction is often cashed out in such terms in the literature on metainferences (ibid). Notice that this idea seems to rely on associating each such sequent with a corresponding sentence. Thus, a sequent of the form |$\varGamma \Rightarrow \varDelta $| is naturally associated with the sentence |$\bigwedge \varGamma \supset \bigvee \varDelta $|⁠.

Now, for my purposes here a slightly different notion of satisfaction is required, which can be motivated along the following lines. As opposed to extensional languages, I work in this paper with an intensional language that contains the |$\square $| operator. In this context, there is at least some plausibility in associating a sequent of the form |$\varGamma \Rightarrow \varDelta $| with the sentence |$\square (\bigwedge \varGamma \supset \bigvee \varDelta )$|⁠. After all, such a sequent expresses a claim of a modal nature; it tells us that necessarily one of the following is the case: either some |$\gamma \in \varGamma $| does not hold, or some |$\delta \in \varDelta $| holds.15 Adopting this latter notion, we will say that a Kripke model |$M$| satisfies a sequent |$\varGamma \Rightarrow \varDelta $| iff |$M\models \square (\bigwedge \varGamma \supset \bigvee \varDelta )$|⁠. Hence, we arrive at the following definitions:

 

Definition 3.4

Given a Kripke model |$M_{\mathscr{F}}=<\mathscr{F},\models _{M_{\mathscr{F}}}>$| and some |$w\in W_{\mathscr{F}}$|⁠, we say that |$w\models _{M_{\mathscr{F}}}\varGamma \Rightarrow \varDelta $| if for all |$w^{\prime}\in W_{M}$| such that |$wRw^{\prime}$|⁠: either |$w^{\prime}\nvDash _{M_{\mathscr{F}}}\gamma $| for some |$\gamma \in \varGamma $|⁠, or |$w^{\prime}\models _{M_{\mathscr{F}}}\delta $| for some |$\delta \in \varDelta $|⁠. We say that |$M_{\mathscr{F}}\models _{M_{\mathscr{F}}}\varGamma \Rightarrow \varDelta $| if |$w\models _{M_{\mathscr{F}}}\varGamma \Rightarrow \varDelta $| for all |$w\in W_{\mathscr{F}}$|⁠.

Based on Definition 4, we finally get our local criterion for assessing metainferences in |$G3cp+\square R+\square L$| model-theoretically:

 

Definition 3.5

We say that a metainference of the form graphic is locally valid with respect to a given class of frames |$\mathbb{F}$| if for all |$\mathscr{F}\in \mathbb{F}$| and |$M_{\mathscr{F}}=<\mathscr{F},\models _{M_{\mathscr{F}}}>$|⁠, either there is some |$1\leq i\leq n$| such that |$M_{\mathscr{F}}\nvDash _{M_{\mathscr{F}}}\varGamma _{i}\Rightarrow \varDelta _{i}$|⁠, or |$M_{\mathscr{F}}\models _{M_{\mathscr{F}}}\varSigma \Rightarrow \varPi $|⁠.

We are now in a position to define what it means for the rules in our system to be sound with respect to a given class of frames |$\mathbb{F}$|⁠. For a rule that does not make use of assuming and discharging sequents, the definition is the familiar one: if no model |$M_{\mathscr{F}}$|(such that |$\mathscr{F}\in \mathbb{F}$|⁠) poses a counterexample to the premise sequent(s), no such model poses a counterexample to the conclusion sequent. For |$\square L$|⁠, the definition can be adjusted straightforwardly: |$\square L$| is sound if, given that the metainference graphic is locally valid with respect to |$\mathbb{F}$|⁠, then no model |$M_{\mathscr{F}}$|(such that |$\mathscr{F}\in \mathbb{F}$|⁠) poses a counterexample to |$\varGamma ,\square A\Rightarrow \varDelta $|⁠. Needless to say, if the rules of our system are all sound with respect to a class of frames |$\mathbb{F}$|⁠, and given that |$\varGamma \Rightarrow \varDelta $| is derivable in the system, then for all |$\mathscr{F}\in \mathbb{F}$| and |$M_{\mathscr{F}}=<\mathscr{F},\models _{M_{\mathscr{F}}}>$|⁠, either |$M_{\mathscr{F}}\nvDash \gamma $| for some |$\gamma \in \varGamma $|⁠, or |$M_{\mathscr{F}}\models \delta $| for some |$\delta \in \varDelta $|⁠.

Now, Lemma 3.6 below is perhaps the most important result in this paper. It can be viewed as a representation theorem, in that it maps patterns of derivations in a sequent calculus—in effect, patterns that will be of use within the scope of assumptions—onto classes of Kripke models characterized in terms of their accessibility relations. In other words, various structures of our subderivations are shown here to reflect various classes of Kripke models as characterized by their accessibility relations.

 

Lemma 3.6

For every frame |$\mathscr{F}=<W_{\mathscr{F}},R_{\mathscr{F}}>$| and Kripke model |$M_{\mathscr{F}}=<\mathscr{F},\models _{M_{\mathscr{F}}}>$| the following conditions hold:

(i) If |$M$| satisfies the premise-sequent(s) of some |$G3cp$| rule, it satisfies its conclusion-sequent.

(ii) For every world |$w\in W_{\mathscr{F}}$|⁠: |$w\models _{M_{\mathscr{F}}}\Rightarrow A$| iff |$w\models _{M_{\mathscr{F}}}\square A$|⁠.

(iii) |$\mathscr{F}\in \mathbb{D}$| iff, there is no world |$w\in W_{\mathscr{F}}$| such that |$w\models _{M_{\mathscr{F}}}A\Rightarrow $| and |$w\models _{M_{\mathscr{F}}}\Rightarrow A$|⁠.

(iv) |$\mathscr{F}\in \mathbb{T}$| iff, for every world |$w\in W_{\mathscr{F}}$|⁠: |$w\models _{M_{\mathscr{F}}}\Rightarrow A$| entails |$w\models _{M_{\mathscr{F}}}A$|⁠.

(v) |$R_{\mathscr{F}}$| is transitive iff, for every world |$w\in W_{\mathscr{F}}$|⁠: |$w\models _{M_{\mathscr{F}}}\Rightarrow A$| entails |$w\models _{M_{\mathscr{F}}}\Rightarrow \square A$|⁠.

 

Proof.

(i) I shall only prove one case: |$\neg R$|⁠; the other cases will be left to the reader as an exercise. Suppose that |$M\models _{M_{\mathscr{F}}}\varGamma ,A\Rightarrow \varDelta $|⁠. Therefore, for all |$w\in W_{\mathscr{F}}$|⁠, for all |$w^{\prime}\in W_{\mathscr{F}}$| such that |$wR_{\mathscr{F}}w^{\prime}$|⁠, either (a) |$w^{\prime}\nvDash _{M_{\mathscr{F}}}\gamma $| for some |$\gamma \in \varGamma $|⁠, or (b) |$w^{\prime}\nvDash _{M_{\mathscr{F}}}A$|⁠, or (c) |$w\models _{M_{\mathscr{F}}}\delta $| for some |$\delta \in \varDelta $|⁠. Either way, it is clear that |$M\models _{M_{\mathscr{F}}}\varGamma \Rightarrow \neg A,\varDelta $|⁠, as required.

(ii) follows directly from Definition 3.4.

(iii) Suppose that there is no |$w\in W_{\mathscr{F}}$| such that |$w\models _{M_{\mathscr{F}}}A\Rightarrow $| and |$w\models _{M_{\mathscr{F}}}\Rightarrow A$|⁠. To the contrary, suppose |$\mathscr{F}\notin \mathbb{D}$|⁠. Therefore, there is some |$w$| to which no world is accessible: there is no |$w^{\prime}$| such that |$wR_{\mathscr{F}}w^{\prime}$|⁠. Hence, it vacuously follows that |$w\models _{M_{\mathscr{F}}}\square A$|⁠, and |$w\models _{M_{\mathscr{F}}}\square \neg A$|⁠. Thus (ii) gives us |$w\models _{M_{\mathscr{F}}}\Rightarrow A$| and |$w\models _{M_{\mathscr{F}}}\Rightarrow \neg A$|⁠, and the latter clearly entails |$w\models _{M_{\mathscr{F}}}A\Rightarrow $|⁠. Hence, we reach a contradiction. The proof in the other direction is analogous.

(iv) Suppose that |$\mathscr{F}\in \mathbb{T}$|⁠, and that |$w\models _{M_{\mathscr{F}}}\Rightarrow A$|⁠. By part (ii), |$w\models _{M_{\mathscr{F}}}\square A$|⁠. Since |$\mathscr{F}\in \mathbb{T}$|⁠, |$R_{\mathscr{F}}$| is reflexive, and so |$w\models \square A\supset A$|⁠, which guarantees (by the semantic clause of |$\supset $|⁠) that |$w\models A$|⁠. Namely, if |$\mathscr{F}\in \mathbb{T}$| then |$w\models _{M_{\mathscr{F}}}\Rightarrow A$| entails |$w\models _{M_{\mathscr{F}}}A$|⁠, as required.

In the other direction, suppose that |$\mathscr{F\notin \mathbb{T}}$|⁠. Therefore, there is some |$w^{*}\in W_{\mathscr{F}}$| such that graphic. Let |$p\in \mathscr{L}$| be some atomic sentence in the language. We now define a new model |$M^{\prime}_{\mathscr{F}}=<\mathscr{F},\models _{M^{\prime}_{\mathscr{F}}}>$| as follows: for all |$w\in W_{\mathscr{F}}$|⁠,

1. If |$q\neq p$| then |$w\models _{M^{\prime}_{\mathscr{F}}}q$| iff |$w\models _{M_{\mathscr{F}}}q$|⁠.

2. |$w\models _{M^{\prime}_{\mathscr{F}}}p$| iff |$w^{*}R_{\mathscr{F}}w$|⁠.

Now, it follows that |$w^{*}\models _{M^{\prime}_{\mathscr{F}}}\square p$|⁠, but |$w^{*}\nvDash _{M^{\prime}_{\mathscr{F}}}p$| since graphic. Employing part (ii) of this lemma, we get that |$w^{*}\models _{M^{\prime}_{\mathscr{F}}}\Rightarrow p$| does not entail |$w^{*}\models _{M^{\prime}_{\mathscr{F}}}p$|⁠, as required.

(v) Suppose that |$R_{\mathscr{F}}$| is transitive, and that |$w\models _{M_{\mathscr{F}}}\Rightarrow A$|⁠. By part (ii) of this lemma we get |$w\models _{M_{\mathscr{F}}}\square A$|⁠. Let |$w^{\prime}\in W_{\mathscr{F}}$| be a world such that |$wR_{\mathscr{F}}w^{\prime}$|⁠, and let |$w^{\prime\prime}\in W_{\mathscr{F}}$| be a world such that |$w^{\prime}R_{\mathscr{F}}w^{\prime\prime}$|⁠. By the transitivity of |$R_{\mathscr{F}}$| we get |$wR_{\mathscr{F}}w^{\prime\prime}$|⁠, and so semantic clause (1) gives us |$w^{\prime\prime}\models _{M_{\mathscr{F}}}A$| (since |$w\models _{M_{\mathscr{F}}}\square A$|⁠.) Therefore, |$w^{\prime}\models _{M_{\mathscr{F}}}\square A$|⁠, and so semantic clause (1) gives us |$w\models _{M_{\mathscr{F}}}\square \square A$|⁠. Hence, |$w\models _{M_{\mathscr{F}}}\Rightarrow A$| entails |$w\models \Rightarrow \square A$|⁠, as required.

In the other direction, suppose that |$R_{\mathscr{F}}$| is not transitive. Therefore, there are three worlds |$w_{1},w_{2}$|⁠, and |$w_{3}$| such that |$w_{1}R_{\mathscr{F}}w_{2}$|⁠, |$w_{2}R_{\mathscr{F}}w_{3}$|⁠, but graphic. Let |$p\in \mathscr{L}$| be some atomic sentence in the language. We now define a new model |$M^{\prime}_{\mathscr{F}}=<\mathscr{F},\models _{M^{\prime}_{\mathscr{F}}}>$| as follows: for all |$w\in W_{\mathscr{F}}$|⁠,

1. If |$q\neq p$| then |$w\models _{M^{\prime}_{\mathscr{F}}}q$| iff |$w\models _{M_{\mathscr{F}}}q$|⁠.

2. |$w\models _{M^{\prime}_{\mathscr{F}}}p$| iff |$w_{1}R_{\mathscr{F}}w$|⁠.

Now, it follows that |$w_{2}\models _{M^{\prime}_{\mathscr{F}}}p$|⁠, but |$w_{3}\nvDash _{M^{\prime}_{\mathscr{F}}}p$|⁠. Therefore, |$w_{1}\models _{M^{\prime}_{\mathscr{F}}}\square p$| but |$w_{2}\nvDash _{M^{\prime}_{\mathscr{F}}}\square p$|⁠, and so |$w_{1}\nvDash _{M^{\prime}_{\mathscr{F}}}\square \square p$|⁠. That is, |$w\models _{M_{\mathscr{F}}}\Rightarrow A$| does not entail |$w\models _{M_{\mathscr{F}}}\Rightarrow \square A$|⁠, as required.

 

Theorem 3.7

If |$\varGamma \Rightarrow \varDelta $| can be derived from the assumption |$\Rightarrow A$| in |$G3cp+\square R+\square L$|⁠, then the metainference graphic is locally valid with respect to |$\mathbb{S}4$|⁠.

 

Proof.

Let |$\mathscr{F}=<W_{\mathscr{F}},R_{\mathscr{F}}>\in \mathbb{S}4$| be some |$S4$|-frame, and |$M_{\mathscr{F}}=<\mathscr{F},\models _{M_{\mathscr{F}}}>$| be some |$\mathscr{F}$|-model. There are two options: (a) |$M_{\mathscr{F}}\nvDash _{M_{\mathscr{F}}}\Rightarrow A$|⁠, in which case there is nothing to prove, and (b) |$M_{\mathscr{F}}\models _{M_{\mathscr{F}}}\Rightarrow A$|⁠, in which case we have to prove that |$M\models _{M_{\mathscr{F}}}\varGamma \Rightarrow \varDelta $|⁠. We prove this by induction on proof height.

By part (i) of Lemma 3.6, the applications of all |$G3cp$| rules in that proof preserve satisfaction by |$M_{\mathscr{F}}$|⁠: if |$M_{\mathscr{F}}$| satisfies the premise sequent(s) of each such rule, it satisfies the conclusion sequent as well. Next, suppose that at some point in the proof we have a sequent of the form |$\Rightarrow B$| from which we derive |$\Rightarrow \square B$| by |$\square R$|⁠. By the inductive hypothesis, we have |$M\models _{M_{\mathscr{F}}}\Rightarrow B$|⁠, and by part (v) of Lemma 3.6 (since |$R_{\mathscr{F}}$| is transitive), we have |$M\models _{M_{\mathscr{F}}}\Rightarrow \square B$|⁠, as required.

It remains to deal with the case where the proof of |$\varGamma \Rightarrow \varDelta $| from |$\Rightarrow A$| involves an assumption of the form |$\Rightarrow B$|⁠, from which we derive |$\varSigma \Rightarrow \varPi $| and then we discharge the assumption, thereby deriving |$\varSigma ,\square B\Rightarrow \varPi $|⁠. In this case, we need to show that |$M\models _{M_{\mathscr{F}}} \varSigma ,\square B\Rightarrow \varPi $|⁠. Suppose to the contrary that this is not the case, namely, |$M\nvDash _{M_{\mathscr{F}}}\varSigma ,\square B\Rightarrow \varPi $|⁠. Therefore, there is some |$w\in W_{\mathscr{F}}$| such that |$w\nvDash _{M_{\mathscr{F}}}\varSigma ,\square B\Rightarrow \varPi $|⁠. That is, for all |$w^{\prime}$| such that |$wR_{\mathscr{F}}w^{\prime}$|⁠: (1) |$w^{\prime}\models _{M_{\mathscr{F}}}\square B$|⁠, and (2) |$w^{\prime}\models _{M_{\mathscr{F}}}\sigma $| for all |$\sigma \in \varSigma $|⁠, but (3) |$w^{\prime}\nvDash _{M_{\mathscr{F}}}\pi $| for all |$\pi \in \varPi $|⁠. By (2) and part (ii) of Lemma 3.6 we get |$w^{\prime}\models _{M_{\mathscr{F}}}\Rightarrow B$|⁠. The inductive hypothesis then yields |$w^{\prime}\models _{M_{\mathscr{F}}}\varSigma \Rightarrow \varPi $|⁠. But |$wR_{\mathscr{F}}w$| since |$R_{\mathscr{F}}$| is reflexive (as |$\mathscr{F\in }\mathbb{S}4$|⁠), and so either |$w^{\prime}\nvDash _{M_{\mathscr{F}}}\sigma $| for some |$\sigma \in \varSigma $|⁠, or |$w^{\prime}\models _{M_{\mathscr{F}}}\pi $| for all |$\pi \in \varPi $|⁠. In the first case, we get a contradiction to (L) and in the second we get a contradiction to (3). Therefore, |$M\models _{M_{\mathscr{F}}}\varSigma ,\square B\Rightarrow \varPi $|⁠, as required.

 

Corollary 3.8

(Soundness) |$G3cp+\square R+\square L$| is sound with respect to |$\mathscr{\mathbb{S}}4$|⁠.

 

Proof.

It is rather trivial to verify that the |$G3cp$| rules are all sound. Likewise, |$\square R$| is known to be sound. We prove this by structural induction on the complexity of formulas. For, suppose that |$\Rightarrow A$|⁠, namely, |$A$| is a theorem. By the inductive assumption (as is usual in such proofs), for every model |$M_{\mathscr{F}}=<\mathscr{F},\models _{M_{\mathscr{F}}}>$| (where |$\mathscr{F\in \mathbb{S}}4$|⁠) and every |$w\in W_{\mathscr{F}}$| we have |$w\models _{M_{\mathscr{F}}}A$|⁠. Semantic clause (1) then gives us |$M_{\mathscr{F}}\models _{M_{\mathscr{F}}}\square A$|⁠, as required.

What is left to show is that |$\square L$| is sound. Suppose that we derive |$\varGamma \Rightarrow \varDelta $| from the assumption |$\Rightarrow A$| in |$G3cp+\square R+\square L$|⁠, and then we discharge the assumption and derive |$\varGamma ,\square A\Rightarrow \varDelta $|⁠. We have to show that there is no frame |$\mathscr{F\in \mathbb{S}}4$| and model |$M_{\mathscr{F}}=<\mathscr{F},\models _{M_{\mathscr{F}}}>$| such that |$M_{\mathscr{F}}$| poses a counterexample to |$\varGamma ,\square A\Rightarrow \varDelta $|⁠. That is, for every |$M_{\mathscr{F}}\in \mathscr{F}$|⁠, either |$M_{\mathscr{F}}\nvDash \gamma $| for some |$\gamma \in \varGamma $|⁠, or |$M_{\mathscr{F}}\nvDash _{M_{\mathscr{F}}}\square A$|⁠, or |$M\models \delta $| for some |$\delta \in \varDelta $|⁠.

Now, if |$M_{\mathscr{F}}\nvDash _{M_{\mathscr{F}}}\Rightarrow A$| then, by part (ii) of Lemma 3.6, |$M\nvDash _{M_{\mathscr{F}}}\square A$|⁠, as required. Otherwise, |$M\models _{M_{\mathscr{F}}}\Rightarrow A$|⁠. By Theorem 3.7, the metainference graphic is locally valid with respect to |$\mathbb{S}4$|⁠, and so |$M_{\mathscr{F}}\models _{M_{\mathscr{F}}}\varGamma \Rightarrow \varDelta $|⁠. That is, for all |$w\in W_{\mathscr{F}}$|⁠: |$w\models _{M_{\mathscr{F}}}\varGamma \Rightarrow \varDelta $|⁠. The reflexivity of |$R_{\mathscr{F}}$| then yields that either |$w\nvDash _{M_{\mathscr{F}}}\gamma $| for some |$\gamma \in \varGamma $| or |$w\models _{M_{\mathscr{F}}}\delta $| for some |$\delta \in \varDelta $|⁠. Thus, either |$M_{\mathscr{F}}\nvDash _{M_{\mathscr{F}}}\gamma $| for some |$\gamma \in \varGamma $| or |$M_{\mathscr{F}}\models _{M_{\mathscr{F}}}\delta $| for some |$\delta \in \varDelta $|⁠, as required.

 

Theorem 3.9

(Completeness) Assume that there is no frame |$\mathscr{F}\in \mathbb{S}4$| and model |$M_{\mathscr{F}}=<\mathscr{F},\models _{M_{\mathscr{F}}}>$| such that |$M_{\mathscr{F}}$| poses a counterexample to |$\varGamma \Rightarrow \varDelta $|⁠; then |$\varGamma \Rightarrow \varDelta $| is derivable in |$G3cp+\square R+\square L$|⁠.

 

Proof.

Recall the rules |$K,T,$| and |$S4$| from Table 1. It is proven in [20] that |$G3cp+K+T+S4$| is complete with respect to |$\mathbb{S}4$|⁠. Thus, all we need to prove is that |$G3cp+\square R+\square L$| is at least as strong as |$G3cp+K+T+S4$|⁠. In other words, all we need to show is that the rules |$K,T$|⁠, and |$S4$| can all be derived in |$G3cp+\square R+\square L$|⁠.

1. The derivation of |$T$| is the easiest. Assume that |$\varGamma ,A\Rightarrow \varDelta $| is derivable. We reason as follows:

2. We derive the instances of |$K$| by induction on the cardinality of |$\varGamma $|⁠. If |$\varGamma =\emptyset $| then we assume that |$\Rightarrow A$| is derivable, and so one instance of |$\square R$| does the job. If |$\varGamma =\{\gamma \}$| (a singleton) then we assume that |$\gamma \Rightarrow A$| is derivable. We then reason as follows:

Assume now that |$\varGamma =\{\gamma _{1},...,\gamma _{n+1}\}$|⁠. We then assume that |$\gamma _{1},...,\gamma _{n},\gamma _{n+1}\Rightarrow A$| is derivable. We reason as follows:

3. Likewise, we derive the instances of |$S4$| by induction on the cardinality of |$\varGamma $|⁠. If |$\varGamma =\emptyset $| then we assume that |$\Rightarrow A$| is derivable, and so one instance of |$\square R$| does the job. If |$\varGamma =\{\gamma \}$| (a singleton) then we assume that |$\square \gamma \Rightarrow A$| is derivable. We then reason as follows:

Assume now that |$\varGamma =\{\gamma _{1},...,\gamma _{n+1}\}$|⁠, and that |$\square \gamma _{1},...,\square \gamma _{n},\square \gamma _{n+1}\Rightarrow A$| is derivable. We then reason as follows:

4 |$K,D$|⁠, and |$T$|

|$G3cp+\square R+\square L$| is a proof system for |$S4$|⁠, which is stronger than |$K,D$|⁠, and |$T$|⁠. Therefore, to generate proof systems for |$K,D$|⁠, and |$T$| we have to somehow weaken |$G3cp+\square R+\square L$|⁠. In particular, we have to weaken |$\square L$|⁠, since all the other rules and axioms are derivable in these systems. To be specific, notice that |$\square L$| lets us assume and discharge sequents unrestrictedly. Therefore, to weaken this rule, we shall impose restrictions on reasoning within the scope of assumptions; those restrictions will actually serve as discharging conditions on assumptions.

Now, the restrictions presented below are somewhat complicated. As of yet, I have not found a way to explain them intuitively. The rationale behind those restrictions, however, is explainable: each restriction is meant to block some of the above derivations of the axioms. Bearing Lemma 3.6 in mind, one may understand why they model-theoretically work the way they do. The reader may also notice that the restrictions below are modular, in the sense that each one is independent of the others. Thus, there is no problem getting other subsystems of S4 by combining these restrictions in various ways (see footnote 17 for an example).

At any rate, the systems below are all sub-systems of the above |$S4$| calculus; as such, they can be looked upon as (admittedly cumbersome) ways of recovering |$K,D$|⁠, and |$T$| within the intuitive framework laid out above. I now turn to introduce the appropriate restrictions for each system.

4.1 A sequent system for |$K$|

 

Definition 4.1

Let |$\square _{K}L$| be the rule |$\square L$| subject to the following discharging condition: there has to be one and not more than one application of |$\square R$| in each branch of the tree that leads from an assumption to the point where the assumption is discharged.

Model-theoretically, the intuition behind |$\square _{K}L$| relies on part (ii) of Lemma 3.6, namely, that for every world |$w\in W_{\mathscr{F}}$|⁠: |$w\models _{M_{\mathscr{F}}}\Rightarrow A$| iff |$w\models _{M_{\mathscr{F}}}\square A$|⁠. Consequently, the assumption |$\Rightarrow A$| amounts to exclusively focusing on models in which |$\square A$| is true. Hence, the need to make use of |$\square R$| at least once: a model does not pose a counterexample to whatever is derived from this assumption unless |$\square A$| is false in it. To be more concrete, suppose that we derive |$\varGamma \Rightarrow \varDelta $| from |$\Rightarrow A$|⁠, then part (ii) of Lemma 3.6 guarantees that for every model, either |$\square A$| is false (in which case we focus on the wrong model), or the model does not pose a counterexample to |$\varGamma \Rightarrow \varDelta $|⁠. However, |$\square R$| cannot be used more than once because that would entail that |$w\models _{M_{\mathscr{F}}}\Rightarrow A$| guarantees |$w\models _{M_{\mathscr{F}}}\square \square A$|⁠, which is true (according to Lemma 3.6) only of |$S4$| models.

Now, the following theorem will be useful for establishing soundness:

 

Theorem 4.2

If |$\varGamma \Rightarrow \varDelta $| is derivable in |$G3cp+\square R+\square _{K}L$|⁠, then it is derivable by a proof tree in which each assumption is discharged immediately below the application of |$\square R$| in its branch; if more than one assumption is discharged in a given branch, then these assumptions are all discharged successively, immediately below the application of |$\square R$| in that branch.

 

Proof.

Suppose that in the proof tree of |$\varGamma \Rightarrow \varDelta $| we have an assumption of the form |$\Rightarrow B$| from which we derive |$\varSigma \Rightarrow \varPi $|⁠, and then discharge the assumption so as to derive |$\varSigma ,\square B\Rightarrow \varPi $|⁠. In a manner similar to a Cut-elimination proof, I shall now show by induction on proof height how the application of |$\square _{K}L$| can be “pushed” up the branch of the assumption, above any |$G3cp$| rule that stands between it and the application of |$\square R$|⁠. Carrying out such a procedure in full, we end up with a tree in which the application of |$\square _{K}L$| that discharges |$\Rightarrow B$| comes immediately below the application of |$\square R$|⁠; or, if more than one assumption is discharged in this branch, then carrying out the same procedure for each application of |$\square _{K}L$| separately yields a proof tree where the assumptions are all discharged successively, immediately below the application of |$\square R$|⁠.

Instead of working out each and every detail, I shall prove only one case, namely, |$\vee L$|⁠; the other cases can all be proved in a similar manner, and so they are left to the reader. Suppose that |$\varSigma \Rightarrow \varPi $| is derived by |$\vee L$|⁠. Then |$\varSigma =C\vee D,\varSigma ^{\prime}$|⁠, and |$\varSigma ^{\prime},C\vee D\Rightarrow \varPi $| is derived from |$\varSigma ^{\prime}_{1},C\Rightarrow \varPi _{1}$| and |$\varSigma ^{\prime}_{2},D\Rightarrow \varPi _{2}$|⁠, where |$\varSigma ^{\prime}=\varSigma ^{\prime}_{1}\cup \varSigma ^{\prime}_{2}$| and |$\varPi =\varPi _{1}\cup \varPi{}_{2}$|⁠. Let us assume without loss of generality that |$\varSigma ^{\prime}_{2},D\Rightarrow \varPi _{2}$| is in the branch of the assumption. Namely, the situation is as follows:

We can then “push” the application of |$\square _{K}L$| above the application of |$\vee L$| in the following way:

As I said, the other cases are all proved analogously.

 

Theorem 4.3

(Soundness and completeness) |$\varGamma \Rightarrow \varDelta $| is derivable in |$G3cp+\square R+\square _{K}L$| iff, for every frame |$\mathscr{F}\in \mathbb{K}$| and model |$M_{\mathscr{F}}=<\mathscr{F},\models _{M_{\mathscr{F}}}>$|⁠, either |$M\nvDash _{M_{\mathscr{F}}}\gamma $| for some |$\gamma \in \varGamma $|⁠, or |$M\models _{M_{\mathscr{F}}}\delta $| for some |$\delta \in \varDelta $|⁠.

 

Proof.

For completeness, notice that the derivation of the sequent rule |$K$| in Theorem 3.9 meets the restriction of |$\square _{K}L$|⁠, and so the |$K$| rule is derivable in |$G3cp+\square R+\square _{K}L$|⁠. This is all we need since |$G3cp+K$| is complete with respect to |$K$|-Kripke models [30].

For soundness, we have to show in particular that |$\square _{K}L$| is sound.16 Suppose that the metainference graphic meets the constraint of |$\square _{K}L$|⁠: there is one and only one application of |$\square R$| in each branch of the tree that leads from an assumption to the point where it is discharged. We may thus derive graphic by |$\square _{K}L$|⁠, and our task is to show that for every frame |$\mathscr{F}\in \mathbb{K}$| and model |$M_{\mathscr{F}}=<\mathscr{F},\models _{M_{\mathscr{F}}}>$|⁠, either |$M\nvDash _{M_{\mathscr{F}}}\gamma $| for some |$\gamma \in \varGamma $|⁠, or |$M\models _{M_{\mathscr{F}}}\delta $| for some |$\delta \in \varDelta $|⁠.

By Theorem 4.2, |$\varGamma ,\square A\Rightarrow \varDelta $| is derivable by a proof tree in which each assumption is discharged immediately below the application of |$\square R$| in its branch, and if more than one assumption is discharged in a given branch, then they are all discharged successively, immediately below the application of |$\square R$| in that branch. I shall focus on that particular proof tree, and show that no such model |$M_{\mathscr{F}}$| poses a counterexample to |$\varGamma ,\square A\Rightarrow \varDelta $|⁠.

For a start, let us assume that the proof tree makes use of only one assumption, namely, |$\Rightarrow A$|⁠. Thus, we assume that the metainference graphic involves only |$G3cp$| rules, followed by one application of |$\square R$| in the last step. Therefore, |$\varGamma \Rightarrow \varDelta $| is actually of the form |$\Rightarrow \square B$| and the derivation within the scope of the assumption is as follows:

where the derivation of |$\Rightarrow B$| from |$\Rightarrow A$| involves only |$G3cp$| rules.

Now, if |$M_{\mathscr{F}}\nvDash _{M_{\mathscr{F}}}\Rightarrow A$|⁠, then part (ii) of Lemma 3.6 gives us |$M\nvDash _{M_{\mathscr{F}}}\square A$|⁠, and so |$M_{\mathscr{F}}$| does not pose a counterexample to |$\varGamma ,\square A\Rightarrow \varDelta $|⁠, as required. Otherwise, |$M_{\mathscr{F}}\models _{M_{\mathscr{F}}}\Rightarrow A$|⁠. Since the derivation of |$\Rightarrow B$| from |$\Rightarrow A$| involves only |$G3cp$| rules, and part (i) of Lemma 3.6 guarantees that if |$M_{\mathscr{F}}$| satisfies the premise-sequent(s) of some |$G3cp$| rule it satisfies its conclusion-sequent, it follows by induction on proof height that |$M_{\mathscr{F}}\models _{M_{\mathscr{F}}}\Rightarrow B$|⁠. By part (ii) of Lemma 3.6 we then get |$M_{\mathscr{F}}\models _{M_{\mathscr{F}}}\square B$|⁠, and so |$M_{\mathscr{F}}$| does not pose a counterexample to |$\varGamma ,\square A\Rightarrow \varDelta $|⁠, which is |$\square A\Rightarrow \square B$|⁠, as required.

It remains to deal with the case where the derivation of |$\varGamma ,\square A\Rightarrow \varDelta $| makes use not only of the assumption |$\Rightarrow A$|⁠, but of other assumptions, say, |$\Rightarrow C_{1},...,\Rightarrow C_{n}$|⁠. Once again we assume, based on Theorem 4.2, that |$\varGamma ,\square A\Rightarrow \varDelta $| is derivable with a proof tree in which each assumption is discharged immediately below the application of |$\square R$| in its branch, and work with that tree. Thus, |$\varGamma ,\square A\Rightarrow \varDelta $| is the bottom sequent of that tree, and it is derived by an application |$\square _{K}L$| immediately after the application of |$\square R$| in the branch, possibly with some other applications of |$\square _{K}L$| in-between, but with no application of other rules in-between. It follows in particular that no two-premise rule is applied in this branch below the application of |$\square R$|⁠. That is to say, any other branch in the tree must be combined with the branch of the assumption |$\Rightarrow A$| above the application of |$\square R$| in the latter branch. Which is to say, the application of |$\square R$| in the branch of |$\Rightarrow A$| is common to all the other branches in the tree, and so it is the only application of |$\square R$| in the tree. In this case, the situation must be as follows: |$\varGamma =\square C_{1},...,\square C_{n}$|⁠, and |$\varDelta =\{\square B\}$| and the tree looks like this:

Now, if |$M_{\mathscr{F}}\nvDash _{M_{\mathscr{F}}}\Rightarrow C_{i}$| for some |$1\leq i\leq n$| then, by part (ii) of Lemma 3.6, |$M\nvDash _{M_{\mathscr{F}}}\square C_{i}$|⁠, and so |$M_{\mathscr{F}}$| does not pose a counterexample to |$\varGamma ,\square A\Rightarrow \varDelta $|⁠, which is |$\square C_{1},...,\square C_{n},\square A\Rightarrow \square B$|⁠, as required. For the same reason, if |$M_{\mathscr{F}}\nvDash _{M_{\mathscr{F}}}\Rightarrow A$| then |$M_{\mathscr{F}}$| does not pose a counterexample to |$\varGamma ,\square A\Rightarrow \varDelta $|⁠, as required. Otherwise, |$M_{\mathscr{F}}\models _{M_{\mathscr{F}}}\Rightarrow C_{i}$| for all |$1\leq i\leq n$| and |$M_{\mathscr{F}}\models _{M_{\mathscr{F}}}\Rightarrow A$|⁠. As before, we then get by induction on proof height that |$M_{\mathscr{F}}\models _{M_{\mathscr{F}}}\Rightarrow B$|⁠; after all, above the application of |$\square R$| there are only |$G3cp$| rules, and part (i) of Lemma 3.6 guarantees that if |$M_{\mathscr{F}}$| satisfies the premise-sequent(s) of some |$G3cp$| rule, it satisfies its conclusion-sequent. By part (ii) of Lemma 3.6, we then get |$M_{\mathscr{F}}\models _{M_{\mathscr{F}}}\square B$|⁠, and so |$M_{\mathscr{F}}$| does not pose a counterexample to |$\varGamma ,\square A\Rightarrow \varDelta $|⁠, which is |$\square C_{1},...,\square C_{n},\square A\Rightarrow \square B$|⁠, as required.17

4.2 A sequent system for |$D$|

 

Definition 4.4

Let |$\square _{D}L$| be the rule |$\square L$| subject to the following discharging condition: for each assumption, either the |$\square _{K}L$| restriction is met, or the empty sequent (⁠|$\Rightarrow $|⁠) is derived at some point in its branch.

Model-theoretically, the intuition behind |$\square _{D}L$| relies on part (iii) of Lemma 3.6: |$D$|-models are characterized by the condition that there they do not satisfy both |$A\Rightarrow $| and |$\Rightarrow A$|⁠. Given the Cut rule, the empty sequent follows from |$A\Rightarrow $| and |$\Rightarrow A$|⁠. Hence, provided that the empty sequent can be derived from our assumptions |$\Rightarrow A_{1},...,\Rightarrow A_{n}$|⁠, we are guaranteed that no |$D$|-model satisfies all of |$\square A_{1},...,\square A_{n}$|⁠.

 

Lemma 4.5

If |$\varGamma \Rightarrow \varDelta $| is derivable in |$G3cp+\square R+\square _{D}L$|⁠, then it is derivable by a proof tree where each assumption is discharged immediately below either the application of |$\square R$| or the derivation of the empty sequent in its branch; if more than one assumption is discharged in a given branch, then these assumptions are all discharged successively, immediately below either the application of |$\square R$| or the derivation of the empty sequent in that branch.

 

Proof.

It is easily observable that the proof of Theorem 4.2 applies to |$G3cp+\square R+\square _{D}L$|⁠, even in cases where the empty sequent is derived.

 

Theorem 4.6

(Soundness and completeness): |$\varGamma \Rightarrow \varDelta $| is derivable in |$G3cp+\square R+\square _{D}L$| iff, for every frame |$\mathscr{F}\in \mathbb{D}$| and model |$M_{\mathscr{F}}=<\mathscr{F},\models _{M_{\mathscr{F}}}>$|⁠, either |$M\nvDash _{M_{\mathscr{F}}}\gamma $| for some |$\gamma \in \varGamma $|⁠, or |$M\models _{M_{\mathscr{F}}}\delta $| for some |$\delta \in \varDelta $|⁠.

 

Proof.

For completeness, notice that the derivation of the sequent rule |$K$| in Theorem 3.9 meets the |$\square _{D}L$| requirement. It thus remains to prove that the |$D$| rule is derivable in |$G3cp+\square R+\square _{D}L$| since |$G3cp+K+D$| is complete with respect to |$D$|-Kripke models [32]. Assume that |$\varGamma =\{\gamma _{1},...,\gamma _{n}\}$|⁠, and that |$\varGamma \Rightarrow $| is derivable. We then reason as follows:

For soundness, we have to show in particular that |$\square _{D}L$| is sound.18 Suppose that we derive |$\varGamma ,\square A\Rightarrow \varDelta $| by |$\square _{D}L$|⁠, and that the derivation of |$\varGamma \Rightarrow \varDelta $| from |$\Rightarrow A$| involves |$n$| assumptions |$\Rightarrow A_{1},...,\Rightarrow A_{n}$|⁠, where |$A=A_{1}$|⁠. Since Theorem 4.3 establishes the soundness of |$\square _{K}L$| with respect to all Kripke models (regardless of classes of frames), and since every |$D$|-Kripke model is a |$K$|-Kripke model, we only need to prove the case where each assumption is discharged by the empty sequent in its branch. Thus, we may safely assume that no application of |$\square R$| occurs in the proof tree of |$\varGamma ,\square A\Rightarrow \varDelta $| and so only the |$G3cp$| rules and |$\square _{D}L$| are used in that derivation. We now argue as follows, in five steps:

1. All the rules of |$G3cp+\square R+\square _{D}L$| except for Cut enjoy the subformula property, and so each derivation of the empty sequent in the tree is achieved by an application of Cut. We now carry out the following procedure:

1.1. Following Theorem 3.3, we “push” all the applications of Cut up the proof tree so that each such application is immediately preceded either by an application of |$\square _{D}L$| or by an assumption.

1.2. Following Lemma 4.5, we “push” all the applications of |$\square _{D}L$| up the proof tree so that each such application is immediately preceded by an application of Cut.

We carry out steps 1.1 and 1.2 repeatedly for as long as possible. The result is a proof tree of |$\varGamma ,\square A\Rightarrow \varDelta $| in which each application of Cut is not preceded by any |$G3cp$| rule. Thus, each assumption is being cut immediately after it is introduced.

2. There aren’t many such proof trees. One option is the following proof tree:

(1)

where the sequent |$A_{1},...,A_{n}\Rightarrow $| is derivable. In other options the applications of Cut are made in different orderings, e.g.

(2)

where both |$A_{1},A_{2}\Rightarrow $| and |$A_{3},...,A_{n-1},A_{n}\Rightarrow $| are derivable.

Now, notice that we can easily transform derivation (2) into derivation (1) because if |$A_{1},A_{2}\Rightarrow $| is derivable, then one application of Weakening gives us |$A_{1},...,A_{n}\Rightarrow $|⁠, which is the starting point of derivation (1). For the same reason, one can easily convince oneself that each and every other proof tree—in which each assumption is cut immediately after it is introduced—can be transformed into derivation (1). Thus, we may safely assume that we are working here with the proof tree of derivation (1).

3. Given that there are no applications of |$\square R$| in the tree, we may safely assume by Lemma 4.5 that we are working with a proof tree in which the assumptions are all discharged successively, immediately below the derivation of the empty sequent. That is, we may safely assume that we are working here with the following proof tree:

(3)

4. Now, |$A_{1},...,A_{n}\Rightarrow $| is derivable. By the inductive hypothesis, there is no frame |$\mathscr{F}\in \mathbb{D}$| and model |$M_{\mathscr{F}}=<\mathscr{F},\models _{M_{\mathscr{F}}}>$| such that |$M_{\mathscr{F}}$| poses a counterexample to |$A_{1},...,A_{n}\Rightarrow $|⁠. That is to say, there is no frame |$\mathscr{F}\in \mathbb{D}$| and model |$M_{\mathscr{F}}=<\mathscr{F},\models _{M_{\mathscr{F}}}>$| such that |$M_{\mathscr{F}}\models _{M_{\mathscr{F}}}A_{i}$| for all |$1\leq i\leq n$|⁠. Since the accessibility relation in such models is directed, it follows that there is no frame |$\mathscr{F}\in \mathbb{D}$| and model |$M_{\mathscr{F}}=<\mathscr{F},\models _{M_{\mathscr{F}}}>$| such that |$M_{\mathscr{F}}\models _{M_{\mathscr{F}}}\square A_{i}$| for all |$1\leq i\leq n$|⁠. Therefore, there is no frame |$\mathscr{F}\in \mathbb{D}$| and model |$M_{\mathscr{F}}=<\mathscr{F},\models _{M_{\mathscr{F}}}>$| such that |$M_{\mathscr{F}}$| poses a counterexample to |$\square A_{1},...,\square A_{n}\Rightarrow $|⁠.

5. Notice that the derivation of |$\varGamma ,\square A\Rightarrow \varDelta $| from |$\square A_{1},...,\square A_{n}\Rightarrow $| (in derivation (3)) does not include any application of |$\square _{D}L$|⁠, since all the assumptions have already been discharged above the derivation of |$\square A_{1},...,\square A_{n}\Rightarrow $| in the proof tree. By the inductive hypothesis of the soundness proof—namely, that all the rules except for |$\square _{D}L$| are sound—we get that there is no frame |$\mathscr{F}\in \mathbb{D}$| and model |$M_{\mathscr{F}}=<\mathscr{F},\models _{M_{\mathscr{F}}}>$| such that |$M_{\mathscr{F}}$| poses a counterexample to |$\varGamma ,\square A\Rightarrow \varDelta $|⁠, which is what we wanted to prove.

4.3 A sequent system for |$T$|

 

Definition 4.7

Let |$\square _{T}L$| be the rule |$\square L$| subject to the following discharging condition: each branch in the proof tree of |$\varGamma \Rightarrow \varDelta $| from |$\Rightarrow A$| does not include more than one application of |$\square R$|⁠.

Model-theoretically, the intuition behind the restriction of only one application of |$\square R$| was already discussed in Section 4.1: otherwise, we get |$S4$| models. However, as opposed to the |$\square _{K}L$| rule, the |$\square _{T}$| rule does not require that we make use of |$\square R$|⁠, and the intuition behind this relies on part (iv) of Lemma 3.6: |$T$|-models are characterized by the condition that |$w\models _{M_{\mathscr{F}}}\Rightarrow A$| entails |$w\models _{M_{\mathscr{F}}}A$|⁠. Hence, provided that |$\Rightarrow A$| is satisfied by our model (which is equivalent, by part (i) of this lemma to |$\square A$|⁠), we may well assume that |$A$| is true, as those models are reflexive.

 

Lemma 4.8

If |$\varGamma \Rightarrow \varDelta $| is derivable in |$G3cp+\square R+\square _{T}L$|⁠, then it is derivable by a proof tree where, for each assumption, if there is an application of |$\square R$| in its branch the assumption is discharged immediately below that application with no |$G3cp$| rules in-between (but possibly with applications of |$\square _{T}L$| in-between that discharge other assumptions), and if there is no application of |$\square R$| in the assumption’s branch then the assumption is discharged immediately after it is introduced.

 

Proof.

Once again, it is easily observable that the proof of Theorem 4.2 applies to |$G3cp+\square R+\square _{T}L$|⁠.

 

Theorem 4.9

(Soundness and completeness): |$\varGamma \Rightarrow \varDelta $| is derivable in |$G3cp+\square R+\square _{T}L$| iff, for every frame |$\mathscr{F}\in \mathbb{T}$| and model |$M_{\mathscr{F}}=<\mathscr{F},\models _{M_{\mathscr{F}}}>$|⁠, either |$M\nvDash _{M_{\mathscr{F}}}\gamma $| for some |$\gamma \in \varGamma $|⁠, or |$M\models _{M_{\mathscr{F}}}\delta $| for some |$\delta \in \varDelta $|⁠.

 

Proof.

For completeness, notice that the derivations of the sequent rules |$K$| and |$T$| in Theorem 3.9 above meet the restriction of |$\square _{T}L$|⁠, and so these rules are derivable in |$G3cp+\square R+\square _{K}L$|⁠. This is all we need since |$G3cp+K+T$| is complete with respect to |$T$|-Kripke models [20].

For soundness, we have to prove in particular that |$\square _{T}L$| is sound.19 Suppose that we derive |$\varGamma ,\square A\Rightarrow \varDelta $| by |$\square _{T}L$|⁠. We may assume without loss of generality that no applications of |$\square R$| occur in the proof tree, because |$\square _{K}L$| has already been proven sound with respect to all Kripke models (regardless of classes of frames), and since every |$T$|-Kripke model is a |$K$|-Kripke model.

Suppose then that we derive |$\varGamma ,\square A\Rightarrow \varDelta $| by |$\square _{T}L$|⁠, and that the derivation of |$\varGamma \Rightarrow \varDelta $| from |$\Rightarrow A$| involves |$n$| assumptions |$\Rightarrow A_{1},...,\Rightarrow A_{n}$|⁠, where |$A=A_{1}$|⁠. Since there are no applications of |$\square R$| in the tree, we get by Lemma 4.8 a proof tree in which each assumption is discharged immediately after it is introduced. That is to say, the situation is as follows:

Now, it is clear that there is no frame |$\mathscr{F}\in \mathbb{T}$| and model |$M_{\mathscr{F}}=<\mathscr{F},\models _{M_{\mathscr{F}}}>$| such that |$M_{\mathscr{F}}$| poses a counterexample to some instance of the (sequent version of the) |$T$| axiom that occurs in this tree. Moreover, below those instances only |$G3cp$| rules are applied. It thus follows by the inductive assumption of the soundness proof—namely, that all the other rules are sound—that there is no frame |$\mathscr{F}\in \mathbb{T}$| and model |$M_{\mathscr{F}}=<\mathscr{F},\models _{M_{\mathscr{F}}}>$| such that |$M_{\mathscr{F}}$| poses a counterexample to |$\varGamma ,\square A\Rightarrow \varDelta $|⁠, as required.

5 Conclusion

Drawing on the recent literature on metainferences, I have presented a novel proof-theoretic approach to modal logics: regular sequent systems augmented with natural deduction-like rules for assuming and discharging sequents. Based on this approach, I have introduced remarkably simple calculi for the modal logics |$K,D,T,$| and |$S4$|⁠. The only differences between these calculi lie in the discharging conditions on assumptions. The rules in the resultant systems have attractive properties: they are explicit, symmetrical, and non-circular, and they enjoy the subformula property.

To conclude, I would like to point out one issue, namely, the question of whether my approach is applicable to other modal logics. It seems particularly interesting to inquire whether the modal logics |$B$| and |$S5$| can be accounted for in similar terms. As of now, I don’t have an answer to this question, and so I leave it as an open question for future research.

Acknowledgements

I would like to thank Robert Brandom, Ulf Hlobil, Ryan Simonelli, and Shuhei Shimamura for fruitful discussions.

Footnotes

1

See [19] for a survey.

2

For example, consider the following system (for the fragment that includes only negation and the conditional), with the following three axioms:|$A\supset (B\supset A)$||$(A\supset (B\supset C))\supset ((A\supset B)\supset (A\supset C))$||$(\neg B\supset \neg A)\supset (A\supset B)$|along with the rule of modus ponens: |$ {\frac{A\ \ \ A\supset B} {B}}$|⁠. Clearly, nothing depends on which axiomatic system for classical logic one chooses.

3

For an extensive discussion of this point, see [6, 21, 24, 35, pp. 1607–1609]. In my discussion below, I mention only the points that are most relevant to the present paper.

4

I take Restall [26, 27] to be an exception in this regard, as he provides (what I take to be) an intuitive motivation for his |$S5$|-hypersequent calculus. For him, a sequent of the form |$\varGamma \Rightarrow \varDelta $| is understood as the statement that the position where one asserts all the sentences in |$\varGamma $| and denies all the sentences in |$\varDelta $| is incoherent. Along the same lines, a hypersequent |$\varGamma _{1}\Rightarrow \varDelta _{1}\mid \varGamma _{2}\Rightarrow \varDelta _{2}\mid ...\mid \varGamma _{n}\Rightarrow \varDelta _{n}$| is understood as a statement concerning not only what one may assert and deny tout court, but also of what one may assert and deny under various suppositions (see [27, pp. 1613–1614] for more details; for Restall’s more recent developments of this view, see [28, 29].) Intuitive as it may be, it is questionable whether Restall’s interpretation applies to hypersequent calculi of other modal logics (Parisi [21] extends Restall’s technical approach, but Restall’s interpretation does not seem to straightforwardly apply to his extension. See in particular his discussion in [21, pp. 1631–1632].) Conceptually, though not technically, my approach in the present paper resembles Restall’s in that the notion of supposition plays an essential role in it.

5

Since Weakening is a |$G3cp$| rule, we may also add contexts immediately after introducing |$\square A$| on the right, thereby getting |$\varGamma \Rightarrow \square A,\varDelta $|⁠.

6

I wish to remain here as philosophically neutral as possible, and so I shall not elaborate on what “theorems” are. Let me nevertheless mention at this point the bilateralist interpretation of sequents that has gained popularity in recent years (see, e.g. [21, 27].) Under this interpretation, the meaning of |$\Rightarrow A$| (and thus of |$\square A$|⁠) amounts to the claim that |$A$| is undeniable according to our discursive norms.

7

There might be some similarity between this approach and Doėn’s view of logical constants as punctuation marks [5], though a thorough discussion of this point goes beyond the scope of the paper. See also [33, 34]. It is also worth mentioning that Doėn developed higher order proof systems for modal logics in which sequents appear to the left and right of the sequent sign. [4] His format is a sequent calculus, while mine is a mixture of sequent calculus and natural deduction. My system permits discharge of initial sequents, as in natural deduction, while in Dosen’s system discharge happens when rules permit the omission of sequents from the left hand side of the sequent sign as one infers one sequent from another (Kürbis [16] has put Doėn’s system into a natural deduction format where sequents are introduced and eliminated, and assumed and discharged). So, it looks as if my system is a special case of Doėn’s in a different framework. In a similar manner, Pfenning and Davies [22] develop a system of natural deduction in sequent calculus style in Martin–Löf’s-type theory, which employs a notion of validity, that is, following from no assumptions. What follows from no assumptions is a theorem. Here, too, there is a clear connection to my approach.

Now, Doėn gets |$S4$| when the consequents of sequents are restricted to at most one formula, and so do Pfenning and Davies. Hence, it seems like a natural proof system for modal logic gives |$S4$|⁠. This observation cries for explanation, but again, a thorough discussion of it goes beyond the scope of the paper.

8

My approach resembles that of Schroeder–Heister (e.g. in [31]), who allows rules of any hight to be assumed and discharged in natural deduction systems. Unfortunately, a proper comparison of the two approaches goes beyond the scope of the present paper. Suffice it to say for Schroeder–Heister, rules are always applied in a derivation and never occur as items that are asserted—only formulas can be asserted. By contrast, in a sequent system like mine formulas can be introduced only in the context of a sequent.

One may also wonder why this possibility—of assuming and discharging sequents—does not change the nature of the system as a sequent-calculus (and not some other kind of proof system). On the face of it, there might be a controversy over definitions here, which bears no conceptual significance. Let me nevertheless point out that only those derivations that involve the |$\square L$| rule look differently than regular sequent derivations. Thus, I am inclined to think that the nature of the system as a sequent calculus does not change when we add the |$\square L$| rule.

9

Moreover, if the derivation of |$\varGamma \Rightarrow \varDelta $| from |$\Rightarrow A$| contains no applications of Cut then any formula in that entire derivation is a subformula of some formula in |$\varGamma ,\square A\Rightarrow \varDelta $|⁠. Hence, as the case is with |$G3cp$|⁠, violations of the subformula property (in derivations) are solely due to applications of Cut.

10

The derivations below are of the sequent versions of these axioms; each axiom can be derived (as a theorem) from its sequent version with one application of |$\supset R$|⁠.

11

In addition, there are reasons to develop a more nuanced criterion of harmony for such a system. Needless to say, a full discussion of this point goes beyond the scope of the paper. Let me nevertheless point out that the |$\square $| rule do meet something like a levelling of local peaks criterion for a sequent setting. For, suppose that |$\square A$| is introduced and eliminated at once. In a sequent system, this amounts to a cut on |$\square A$| immediately after it is derived on both sides by the |$\square $| rules, namely,

In this case, |$\Rightarrow A$| is derivable, and there is also a derivation of |$\varGamma \Rightarrow \varDelta $| from |$\Rightarrow A$|⁠, which means that the latter sequent is derivable too. Namely, we can derive |$\varGamma \Rightarrow \varDelta $| directly from |$\Rightarrow A$|⁠, without the detour of the |$\square $| rules.

12

See, e.g. [3, 9] and the references therein.

13

Later, I shall say more about what it means for a model to “satisfy” a sequent. At this point, one may stick to the intuitive understanding that the model does not pose a counterexample to the sequent.

14

My argument below clearly extends to cases where more than one premise-inference is involved, to the extent that the premise-inferences are of form |$\Rightarrow A_{1},...,\Rightarrow A_{n}$|⁠.

15

Indrzejczak’s calculus [15] employs “modal sequents” of the form |$\varGamma \square \Rightarrow \varDelta $| in addition to regular sequents. There is some tension between this approach and my above claim, but I believe that my claim is widely accepted.

16

The proof that |$\square R$| is sound is almost identical to the proof given at the beginning of Section 3.2.

17

An editor of this journal raises the question of whether the modal logic |$K4$| can be provided a proof theory along the same lines. The answer is positive. We get a proof system for |$K4$| if we strengthen |$\square _{K}L$| such that there has to be one—but perhaps more than one—application of |$\square R$| in each branch of the tree that leads from an assumption to the point where the assumption is discharged. The proof is almost identical to that of Theorem 4.3. It is just that the derivation of |$\Rightarrow B$| from |$\Rightarrow A$| (or the derivation of |$\Rightarrow B$| from |$\Rightarrow C_{1},...,\Rightarrow C_{n},\Rightarrow A$|⁠, in the case of multiple assumptions) may involve not only |$G3cp$| rules, but also |$\square R$|⁠. However, in that case, we can rely on part (v) of Lemma 3.6 that guarantees, in the case of transitive models, that for every world |$w\in W_{\mathscr{F}}$|⁠: |$w\models _{M_{\mathscr{F}}}\Rightarrow A$| entails |$w\models _{M_{\mathscr{F}}}\Rightarrow \square A$|⁠.

18

As pointed out in Section 4.1, the proof that |$\square R$| is sound is almost identical to the proof given at the beginning of Section 3.2.

19

As pointed out in Section 4.1, the proof that |$\square R$| is sound is almost identical to the proof given at the beginning of Section 3.2.

References

[1]

Avron
 
A
.
The method of hypersequents in the proof theory of propositional nonclassical logics
. In:
Hodges
 
W
 et al.
(eds),
Logic: From Foundations to Applications
.
Oxford Science Publications
,
1996
,
1
32
.

[2]

Biermann
 
GM
,
de Paiva
 
VCV
.
On an intuitionistic modal logic
.
Stud Log
 
2000
;
65
:
383
416
.

[3]

Dicher
 
B
,
Paoli
 
F
.
ST, LP and tolerant metainferences
. In:
Başkent
 
C
,
Ferguson
 
T
(eds),
Graham Priest on Dialetheism and Paraconsistency
.
Springer
,
2019
,
383
407
.

[4]

Doėn
 
K
.
Sequent-systems for modal logic
.
J Symb Log
 
1985
;
50
:
149
68
. .

[5]

Doėn
 
K
.
Logical constants as punctuation marks
.
Notre Dame J Formal Log
 
1989
;
30
:
362
81
.

[6]

Dummett
 
MAE
.
The Logical Basis of Metaphysics
.
Harvard University Press
,
1991
.

[7]

Fitch
 
FB
.
Tree proofs in modal logic
.
J Symb Log
 
1966
;
31
:
152
.

[8]

Garson
 
JW
.
Modal logic for Philosophers
.
Cambridge University Press
,
2013
.

[9]

Golan
 
R
.
There is no tenable notion of global metainferential validity
.
Analysis
 
2021
;
81
:
411
20
. .

[10]

Golan
 
R
.
A simple sequent system for minimally inconsistent LP
.
Rev Symb Log
 
2023
;
16
:
1296
311
. .

[11]

Golan
 
R
,
Hlobil
 
U
.
Minimally nonstandard K3 and FDE
.
Australasian J Log
 
2022
;
19
:
182
213
.

[12]

Hlobil
 
U
.
Faithfulness for naive validity
.
Synthese
 
2019
;
196
:
4759
74
.

[13]

Humberstone
 
L
.
Valuational semantics of rule derivability
.
J Philos Log
 
1996
;
25
:
451
61
.

[14]

Indrzejczak
 
A
.
Natural Deduction, Hybrid Systems and Modal Logics (Vol. 30)
.
Springer Science & Business Media
,
2010
.

[15]

Indrzejczak
 
A
.
Sequents and trees
. In
Studies in Universal Logic
.
Birkhäuser, Cham
,
2021
.

[16]

Kürbis
 
N
. Sketch of a proof-theoretic semantics for necessity. In:
Olivetti
 
N
,
Verbrugge
 
R
(eds),
Advances in Modal Logic 2020
.
Booklet of Short Papers
,
2020
,
37
43
.

[17]

Negri
 
S
,
von Plato
 
J
and
Ranta
 
A
.
Structural Proof Theory
.
Cambridge University Press
,
2001
.

[18]

Negri
 
S
.
Proof analysis in modal logic
.
J Philos Log
 
2005
;
34
:
507
44
.

[19]

Negri
 
S
.
Proof theory for modal logic
.
Philos Compass
 
2011
;
6
:
523
38
.

[20]

Ohnishi
 
M
,
Matsumoto
 
K
.
Gentzen method in modal calculi
.
Osaka Math J
 
1957
;
9
:
113
30
.

[21]

Parisi
 
A
.
A Hypersequent solution to the Inferentialist problem of modality
.
Erkenntnis
 
2022
;
87
:
1605
33
. .

[22]

Davies
 
R
,
Pfenning
 
F
.
A modal analysis of staged computation
.
J ACM
 
2001
;
48
:
555
604
. .

[23]

Poggiolesi
 
F
.
The method of tree-hypersequents for modal propositional logic
. In
Towards Mathematical Philosophy, Trends in Philosophy Series, Vol. 28
.
Springer
,
2009
,
31
51
.

[24]

Poggiolesi
 
F
.
Gentzen Calculi for Modal Propositional Logic (Vol. 32)
.
Springer Science & Business Media
,
2010
.

[25]

Read
 
S
.
General-elimination harmony and the meaning of the logical constants
.
J Philos Log
 
2010
;
39
:
557
76
. .

[26]

Restall
 
G
.
Proofnets for |${S}\_5$|⁠: Sequents and circuits for modal logic
.
Log Colloq
 
2007
,
2007
;
28
:
151
72
.

[27]

Restall
 
G
.
A cut-free sequent system for two-dimensional modal logic-and why it matters
.
Ann Pure Appl Log
 
2012
;
163
:
1611
23
. .

[28]

Restall
 
G
.
Structural rules in natural deduction with alternatives
.
Bull Sect Log
 
2023
;
52
:
109
43
.

[29]

Restall
,
G.
 
Speech acts and the quest for a natural account of classical proof
. https://consequently.org/papers/speech-acts-for-classical-natural-deduction.pdf

[30]

Sambin
 
G
,
Valentini
 
S
.
The modal logic of provability: the sequential approach
.
J Philos Log
 
1982
;
11
:
311
42
.

[31]

Schroeder-Heister
 
P
.
The calculus of higher-level rules, propositional quantification, and the foundational approach to proof-theoretic harmony
.
Stud Log
 
2014
;
102
:
1185
216
.

[32]

Valentini
 
S
.
The sequent calculus for modal logic |$D$|
.
Bolletino dell’Unione Matematica Italiana
 
1993
;
12
:
455
60
.

[33]

Wansing
 
H
.
Sequent calculi for normal modal propositional logic
.
J Log Comput
 
1994
;
4
:
125
42
. .

[34]

Wansing
 
H
.
On the idea of a proof-theoretic semantics and the meaning of the logical operations
.
Stud Log
 
2000
;
64
:
3
20
. .

[35]

Wansing
 
H
.
Displaying modal logic
. In
Trends in Logic Series, Vol. 3
.
Kluwer Academic Publishers
,
2013
.

This is an Open Access article distributed under the terms of the Creative Commons Attribution NonCommercial-NoDerivs licence (https://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial reproduction and distribution of the work, in any medium, provided the original work is not altered or transformed in any way, and that the work properly cited. For commercial re-use, please contact [email protected] for reprints and translation rights for reprints. All other permissions can be obtained through our RightsLink service via the Permissions link on the article page on our site-for further information please contact [email protected].