Ordinary superlative descriptions are well-known to provide safe harbor to negative polarity items (NPIs), as in the longest book anyone read. What is less well-known is that relative superlative descriptions also sometimes host NPIs, as in the loudest that anyone sang. We observe that this latter pattern is more general than has been previously described. In fact, relative superlatives can license NPIs outside of their own descriptions. On the one hand, we argue that this provides evidence that the superlative adjectives take sentential rather than nominal scope. But on the other, following insights in Howard 2014, we argue that traditional semantic accounts of scope-taking superlatives do not present the right monotonicity profile to account for the NPIs either. A recent, dynamic take on superlative semantics (Bumford 2017) is shown to do better.
1 Introduction
The absolute/relative ambiguity of definite superlative noun phrases in English has been studied from a variety of theoretical perspectives, including seminal work by Ross (1964), Heim (1985, 1999), and Szabolcsi (1986). In this article, we contribute to this discussion by examining the distribution of negative polarity items (NPIs) in and around superlative descriptions, especially on their relative readings. In particular, we attempt to account for the two facts below. The first, Generalization I, is not new (see, e.g., von Fintel 1999, Gajewski 2010, Howard 2014), but to our knowledge, Generalization II has not been previously documented or analyzed. Under common assumptions about the licensing requirements of NPIs, we will show that both of these observations bear on established debates concerning the semantics and composition of phrases containing superlative adjectives.
Generalization I
NPIs are licensed inside English definite superlative noun phrases, whether the superlatives are interpreted absolutely, as in (1a), or relatively, as in (1b). This is in contrast with plain (nonsuperlative) singular definite descriptions like (2), where NPIs are generally ungrammatical.
Generalization II
NPIs are also licensed outside English definite superlative noun phrases, but only when those superlatives are interpreted relatively, as in (3). Again this contrasts with plain definite descriptions like (4), which never license NPIs elsewhere in an utterance.1
(1) Mary read the longest book that anyone read.
- a.
✓Absolute reading: Mary read the unique book that was longer than any other book that was read.
- b.
✓Relative reading: Mary read a book that was longer than any book read by anyone other than Mary.
(2) *Mary read the long book that anyone read.
(3) Mary ran the club that gave the largest prize to any sophomore.
- a.
✗Absolute reading: Mary ran the club that gave some sophomore the prize larger than any other prize.
- b.
✓Relative reading: Mary ran the club that gave some sophomore a prize larger than any prize given to any sophomore by any other club.
(4) *Mary ran the club that gave the large prize to any sophomore.
In sections 2 and 3, we describe the problems posed by Generalization I for many well-known theories of superlatives, largely following the discussion in Howard 2014. In section 4, we sketch Howard’s solution to these problems and then show that this solution falls short of the data in Generalization II. In section 5, we argue that Generalization II likewise challenges any pragmatic or domain restriction account of relative superlatives. In section 6, we propose an alternative theory of superlative semantics, similar to that in Bumford 2017a, and show that it fares better. In section 7, we summarize our conclusions.
2 Two Classic Superlative Challenges
Superlative descriptions systematically license NPIs. Superlative descriptions are also systematically ambiguous. Both of these things are well-known and well-studied, though they are rarely studied at the same time. And it turns out that there is quite a bit of tension between popular accounts of the two phenomena. So we begin by discussing two common analyses, one developed to capture the distribution of NPIs, the other to derive the ambiguity. We show that they are incompatible.
2.1 Superlatives and NPIs
First, the NPI problem. Consider the contrast in (5). The subjects in both (5a) and (5b) are syntactically definite noun phrases (i.e., DPs whose surface determiner position is occupied by a definite determiner), but (5a) contains the superlative adjective highest, whereas (5b) does not. Correspondingly, the NPI ever is acceptable in the former and unacceptable in the latter. The subject in (5a) is called an absolute superlative: one of John’s grades is presupposed to be higher than all the others, and the description refers to this grade.
(5)
- a.
The highest grade John (ever) got was kind of low.
- b.
The grade John (*ever) got was kind of low.
We assume that (6b) is the meaning of the definite determiner, along the lines of Sharvy 1980, where w is a world of evaluation and g is an assignment function. The first argument, K, is the implicit, context-dependent domain restriction of the determiner. The second argument, P, is a polymorphic property, a function from things of type σ to truth values, where σ is any type. For perspicuity, we will often suppress the K argument, assuming the domain to be restricted to things that are relevant. The denotation in (6b) is designed to apply to singular, plural, and mass noun phrases alike, though in this article we will only be concerned with singular descriptions. In this case, P will be a property of atomic entities none of which are parts of one another, and the presupposition will amount simply to uniqueness: ∃!x ∈ K [P(x)].
Regarding NPI-licensing, we assume the theory developed in Ladusaw 1979, refined in von Fintel 1999, and further refined in Lahiri 1998, Guerzoni and Sharvit 2007, and Gajewski and Hsieh 2014, according to which NPIs are subject to the distributional constraint in (7c). See the just-mentioned works for independent motivation and applications to other kinds of presuppositional licensers. Note that this constraint expresses a necessary condition for NPI grammaticality. There are presumably many additional requirements—for example, that an NPI should be pronounced after its licenser—that play a role in judgments of acceptability, but we do not attempt to spell out a complete theory. The first point we wish to make is merely that many analyses of superlatives do not even clear this monotonicity hurdle.2
(7)
- a.
Polymorphic Strawson entailment
X
Y iff either (i) or (ii) is true.
- i.
X and Y are truth values, and X → N Y.
- ii.
X and Y are functions of type (σ, τ), and X(a)
Y(a) for every a in the domain of both X and Y.- b.
A function f is Strawson downward-entailing (SDE) iff f(Q)
f( P ) whenever f( P ) and f(Q) are both defined and P ⇒ Q. A function f is Strawson upward-entailing (SUE) iff f(P) f(Q) whenever f( P ) and f(Q) are both defined and P ⇒ Q.- c.
An NPI is licensed only if it occurs in the scope of an operator α such that
αw,g is SDE and not SUE, for any w and g. We sometimes refer to α as a licenser of the NPI.
The LF and meaning of (5b) are given in (8), as an example. The unacceptability of the NPI in the restrictor is explained by the fact that
thew,g is both SDE and SUE on its first (explicit) argument. This is guaranteed by the fact in (9), together with the compositionality of ·w,g. If there is exactly one linguist in K and exactly one phonologist in K, then they are the same person, and so any sentences that differ only in how they describe this individual will have the same denotation, and thus be mutually entailing.Whether or not the NPI in (5a) is licensed thus depends on the denotation assigned to the superlative. To begin with, let us compare the predictions of the superlative morpheme estvF in (10), based on von Fintel 1999 and Gajewski 2010, with the predictions of the superlative morpheme estH1 in (11), based on Heim 1999. Both denotations take as argument a relation between degrees and individuals R in addition to an individual x, and both require this individual to stand in the R relation to a degree that no other individual in the domain of R does. In other words, they both require there to be a degree that sets x apart from all other elements in the domain of R.
Where the analyses differ is in whether the nominal complement of the superlative is taken as an independent argument, P, or whether it is simply absorbed as part of the measuring relation R. This difference typically has no effect on the predicted meanings. For instance, (12a) provides the LF of (5a) according to the estvF proposal. The superlative morpheme applies first to its gradable complement high, of type (d, (e, t)), and then to the property-denoting nominal restrictor grade John got. (12b) provides the LF of (5a) according to the estH1 proposal. The superlative morpheme takes scope over the rest of the (d, (e, t))-type expression formed by abstracting over its degree-denoting trace. These two LFs have the same denotation, shown in (14). Note that we assume that gradable adjectives are downward monotone (see, e.g., Cresswell 1976). Accordingly, high denotes the relation that holds between any individual x and degree d if x’s height is at least d, as in (13).
(12)
- a.
[the [high estvF] [grade [2 [John (ever) got t2]]]] was kind of low
- b.
[the [estH1 [1 [high-t1 [grade [2 [John (ever) got t2]]]]]]] was kind of low
But crucially these two proposed superlative denotations have different monotonicity profiles. For any relation R,
estvFw,g(R) is SDE (but not SUE). To see this, let P ⇒ Q, and let x be such that estvFw,g(R)(P)(x) and estvFw,g(R)(Q)(x) are both defined. Then we have the following simple proof.- 1.
Assume
estvFw,g(R)(Q)(x) is true. - 2.
By definition of
estvF, then, ∃d [{x} = {x′ | Q(x′) ˄ R(d)(x′)}]. That is, for some d*:- (i)
{x} ⊆ {x′ | Q(x′) ˄ R(d*)(x′)}, and
- (ii)
{x} ⊇ {x′ | Q(x′) ˄ R(d*)(x′)}
- (i)
- 3.
From the first conjunct, it follows that R(d*)(x). And since P(x) holds by assumption, {x} ⊆ {x′P(x′) ˄ R(d*)(x′)}.
- 4.
From the second conjunct, it follows that {x} ⊇ {x′ |Q(x′) ˄ R(d*)(x′)} ⊇ {x′ | P(x′) ˄ R(d*)(x′)}, given the assumption that P ⇒ Q (i.e., {z | Q(z)} ⊇ {z | P(z)}), and the fact that A ∩ Z ⊇ B ∩ Z whenever A ⊇ B.
- 5.
So ∃d [{x} = {x′ | P(x′) ˄ R(d)(x′)}], namely, d*.
- 6.
By definition of
estvF, then, estvF(R)(P)(x) is true. - 7.
Thus,
estvF(R)(Q) estvF(R)(P), completing the proof.
To see that
estvFw,g(highw,g) is not SUE, simply set P := λy. y ∈ {1, 2} and Q := λy. y ∈ {1, 2, 3}. Clearly P ⇒ Q, but estvFw,g(highw,g)(P)(2) is true while estvFw,g(highw,g)(Q)(2) is false.Since the NPI in (14a) occurs in the scope of highest, the estvF denotation can claim to predict its grammaticality. By contrast, the estH1 denotation does not predict this, because
estH1w,g is not SDE. This is shown by the model below, with three books A, B, and C, and two relations R and S.Clearly S ⇒ R. And
estH1w,g(R)(A) is true, since, for instance, 500 is a length that sets A apart in the R column from every other book. That is, {A} = {x′ | R(500)(x′)}. But estH1w,g(S)(A) is false, since there’s no degree that only A stands in the S relation to; A and B have the same degrees in the S column. So S ⇒ R does not guarantee that estH1w,g(R) estH1w,g(S).2.2 Superlatives and Scope Ambiguity
On the other hand, estH1 leads quite naturally to an account of the perceived ambiguity in sentences like (15).
(15) John got the highest grade.
- a.
✓Absolute: John got a perfect score (the grade higher than any other grade).
- b.
✓Relative: John got a higher grade than anyone else did.
Both readings are predicted if estH1 is able to take scope outside of its containing noun phrase. In the relative LF, movement of estH1 across the is hypothesized to neutralize the usual uniqueness presupposition of the definite determiner; on the relative reading, it need not be the case that one of the received grades was in fact higher than all of the others. This is an admittedly awkward detail, but since it is the subject of much debate (see, e.g., Szabolcsi 1986, Heim 1999, Sharvit and Stateva 2002, Coppock and Beaver 2014, Bumford 2017a), we stipulate it here and press forward. For clarity, we will display these nondefinite uses of the in gray.
In a similar fashion, estH1 accounts for the unambiguously relative readings of adverbial superlatives, as in Mary sang the loudest—meaning Mary sang louder than anyone else—and of amount superlatives, as in Mary read the most books—meaning Mary read more books than anyone else.
It is hard to see how estvF could account for such readings. The only real difference between estvF and estH1 is the initial property-type argument that the former takes. As we showed, this argument was crucial to predicting the grammaticality of NPIs in absolute superlative descriptions, since it provided a genuinely negative environment, quarantined from the nonmonotonic influence of the gradable predicate. But it is precisely this argument that seems to gum up the scope-taking works. Once estvF has combined with its adjectival and nominal complements, it no longer denotes a higher-order function of the sort that is capable of quantifying over its own continuation, so it does not have the right type to raise. And yet estvF cannot take scope before combining with these complements, as in the estH1 derivations above, because there is nowhere for it to land that provides the crucial (SDE) property-type argument P.
In any case, as far as we are aware, all of the literature since Szabolcsi 1986 that treats the absolute/relative distinction as a genuine scope ambiguity has abandoned the simple estvF denotation, so for now, let us concede that it does not generate relative readings. In sections 5 and 6, we return to the possibility that this is mistaken and that estvF, or something similar, can account for relativity after all (see also online appendix B, https://doi.org/10.1162/ling_a_00409).
Now, if relative truth conditions are altogether beyond the reach of estvF, then it seems fair to say, a fortiori, that estvF does not explain how NPIs can be licensed in relative readings, as in (21).
(21)
- a.
In this class, John got the highest grade anyone got.
- b.
Mary sang the loudest anyone sang.
- c.
Mary read the most books anyone read.
Likewise, even though estH1does generate relative truth conditions, it does not predict the felicity of the NPIs in (21) any more than those in absolute descriptions (see Howard 2014 for more on this point). The reason is the same: estH1 only has one (functional) argument, R, and so if these NPIs are in the scope of the superlative, then they must occur as part of the constituent that delivers this R relation. But
estH1w,g is not SDE with respect to this argument.To sum up,
estH1w,g generates both absolute and relative readings, but it is not SDE, and so does not manage to license NPIs in either case. In contrast, estvFw,g(R) is SDE for any R, but it derives only absolute readings, and so it both undergenerates truth conditions and fails to explain the broad distribution of NPIs in the context of superlatives.3 Superlatives and Focus: A Sentence-Adverbial Analysis
In this section, we discuss a third potential denotation for the superlative morpheme, also due to Heim (1999). Like estH1, this third analysis straightforwardly derives relative readings. Additionally, Howard (2014) has shown that it can be leveraged to provide an explanation for the felicity of NPIs in certain relative configurations, including the examples in (21). Unfortunately, as Howard notes, it is very hard to see how this sort of explanation could be extended to cover the basic absolute cases handled so smoothly by estvF. We try our best to find suitable LFs, but do not succeed. Finally, we point out that even if the estH2 morpheme that we introduce could ultimately provide a satisfactory account of absolute superlative descriptions containing NPIs, it would still fail to license NPIs that fall under Generalization II. That is, it would not explain the grammaticality of NPIs that occur outside the superlative noun phrase itself.
3.1 The Adverbial Superlative and Monotonicity
Of the various alternative accounts of relative readings available in the literature, the following focus-based analysis explored in Heim 1999 has probably been the most influential. One of the motivations for this analysis is the fact that relative interpretations seem to vary with prosodic stress, as illustrated in (22) and (23).
(22) JOHN gave Mary the highest grade.
- a.
✓John gave Mary a higher grade than anyone else gave Mary.
- b.
✗John gave Mary a higher grade than he gave anyone else.
(23) John gave MARY the highest grade.
- a.
✗John gave Mary a higher grade than anyone else gave Mary.
- b.
✓John gave Mary a higher grade than he gave anyone else.
The superlative morpheme estH2 is assumed to be a focus-sensitive sentence-level adverb, in the mold of adverbial only. The prejacent of estH2 contains a focused expression, often called the correlate, and denotes a property of degrees P of type (s, (d, t)). The restrictor of the superlative C is the set of focus alternatives to its prejacent. Focus alternatives are computed in the style of Rooth 1992. In particular,
α ~ Cw,g imposes the requirement that C is a subset of the focus semantic value of α.3 For any given degree property P and set of alternative degree properties C, estH2w,g(C)(P) is true iff there is a degree that sets the prejacent P apart from all of the other degree properties in C.As before, movement of estH2 to the sentence-adverbial position is assumed to reduce the to a determiner with merely existential force. For instance, the LFs in (25) and (26) deliver the relevant truth conditions of (22) and (23).
So estH2 generates relative readings. What does it predict about NPIs? The derivations above would seem to suggest that an NPI like that in (21a) is contained in the superlative’s prejacent P. After all, as Heim (1999) would have it (following Rooth 1992), the restrictor C is an implicit comparison class representing the formal question that the prejacent is taken to address, namely, its focus alternatives. Unfortunately, for a fixed comparison class C,
estH2w,g(C) is not SDE (Howard 2014). To see this, consider the world w characterized by the model below. There are five singers and two degree properties per singer, Pz and Qz. These are the ingredients that would be relevant for the sentences in (27).In this world, yesterday’s performance was nobody’s loudest; in every row, there is a higher value in the Q column than in the P column. Now, let C contain Pz and Qz for each singer z. That is, let it contain for each z the property corresponding to songs z performed sometime yesterday, as well as the property corresponding to songs z performed sometime this week:
Let P ≡ Pcal and Q ≡ Qcal. Clearly P and Q are both in C, so that
estH2w,g(C)(P) and estH2w,g(C)(Q) are both well-defined. And clearly P ⇒ Q, since in any given world w′, the decibels that Cal’s singing reached yesterday are a subset of the decibels that Cal’s singing reached at some point this week. Moreover, since there is a degree d* (namely, 60) such that Q(w)(d*) is true while Q′(w)(d*) is false for any other Q′ ∈ C, it follows that estH2w,g(C)(Q) is true. But estH2w,g(C)(P) is false; there is no degree in P(w) that is not in Q′(w) for any other Q′ ∈ C.So estH2w,g(C)(Q) estH2w,g(C)(P), and thus estH2w,g(C) is not SDE.4In other words, Cal might well be responsible for the loudest of all the songs performed this week, but that does not mean that Cal is responsible for the loudest of all the songs performed yesterday, even if Cal (and everybody else) did in fact sing at some point yesterday.
However, Howard (2014) points out that
estH2w,gis itself SDE. That is, it is SDE “on its first argument.” The proof is exactly the same as the proof for estvFw,g, except that the variables have higher types. Let P be an arbitrary degree property, type (s, (d, t)), and let C and D be sets of such degree properties such that C ⊆ D, with P ∈ C and P ∈ D.- 1.
Assume
estH2w,g(D)(P) is true. - 2.
By definition of
estH2w,g, then, ∃d [{P} = {Q ∈ D | Q(w)(d)}]. That is, for some d*:- (i)
{P} ⊆ {Q ∈ D | Q(w)(d*)}, and
- (ii)
{P} ⊇ {Q ∈ D | Q(w)(d*)}
- (i)
- 3.
From the first conjunct, it follows that P(w)(d*).
- 5.
From the second conjunct, it follows that {P} ⊇ {Q ∈ D | Q(w)(d*)} ⊇ {Q ∈ C | Q(w)(d*)}, given the assumption that D ⊇ C.
- 5.
So ∃d [{P} = {Q ∈ C | Q(w)(d)}], namely, d*.
- 6.
By definition of
estH2w,g, then, estH2w,g(C)(P) is true. - 7.
Thus,
estH2w,g(D)(P) estH2]w,g(C)(P), completing the proof.
Recognizing this, Howard hypothesizes that the grammatical NPIs in (21) must somehow be contained in a constituent that denotes this restrictor argument of the superlative. That is, where the traditional Heimian analysis of relative readings treats C as an implicit parameter generally associated with the focus structure of the superlative’s prejacent, Howard suggests that in some cases this comparison class argument is actually expressed in the sentence. And in exactly these cases, NPIs might appear felicitously inside the constituent that expresses this comparison class. We now present a slightly amended version of that proposal, which we hope is faithful in all crucial respects. As stated in the introduction to this section, the goal in section 4.2 will be to then work backward from this analysis to try and use estH2 to account for NPIs in absolute readings. But we will not succeed in this. What’s more, in section 4.3 we will show that NPIs also appear felicitously inside constituents that cannot possibly be part of this comparison class argument.
4 Superlative Antecedent-Contained Deletion: Howard 2014
4.1 Superlative Degree Clauses
Suppose that in addition to the usual association-with-focus mechanism depicted above, there is a syntactic means of forming a restrictor of the appropriate type for estH2. As we have shown, the Roothian route never licenses NPIs, since its only SDE environment is an unpronounced parameter. But the second route ought to license them in the constituent that forms this restrictor. Howard (2014) proposes that this syntactic mechanism involves a kind of relative clause ellipsis familiar from comparative antecedent-contained deletion (ACD), together with an Alternative Semantics approach to indefinites (cf. Kratzer and Shimoyama 2002), or at least to NPIs.
To formalize this, we first assume that sentences like those in (21)—based on Howard’s crucial relative readings with NPIs—contain elliptical material in their relative clauses, as is common for analogous sentences with comparatives. For instance, whereas (28a) is often derived from an LF like that in (28b) (see, e.g., von Stechow 1984, Rullmann 1995), we might assume (29a) is derived from an LF along the lines of (29b).
(28)
- a.
John sang louder than Mary sang.
- b.
[[er [1 [Mary sang
t1 loud]]] [1 [John sang t1-loud]]]
(29)
- a.
Mary sang the loudest that anyone sang.
- b.
[[est [1 [anyone sang
t1-loud]]] [1 [Mary sang t1-loud]]]
The difference between the comparative case of ACD and the superlative case is that the superlative’s relative clause ought to denote a set of alternative degree properties (all those that the prejacent is taken to exceed), whereas the comparative’s relative clause denotes a single competitor to its prejacent. So we follow Howard in borrowing Karttunen’s (1977) singleton-forming operator ? (cf. also Partee’s (1986) type-shifter IDENT) and in assuming that indefinite NPIs generate alternatives (cf. Krifka 1995, Chierchia 2013).
With this, we can derive the truth conditions of (29) as in (31). Because the NPI anyone is contained within the superlative’s restrictor, it is correctly predicted to be grammatical. Analogous derivations involving ever, which we set aside to avoid complications due to tense and aspect, are provided in online appendix A (https://doi.org/10.1162/ling_a_00409).
As Howard notes, ACD per se is not required for any semantic reason, but the presuppositions of
estH2w,g do impose a very strong requirement on the relationship between the meanings of the restrictor and nuclear scope here. Recall that for estH2w,g(C)(P) to be well-defined, P must be an element of C. That is, the prejacent must actually be one of the degree properties up for comparison. When C is derived from the focus structure of the prejacent itself, this is guaranteed. But when C is generated compositionally from additional elements in the sentence, it is possible for this presupposition to fail. For instance, Howard observes that the sentences in (32) are deviant. The first would require that the degree property (λdλw′. Mary readw′d-many books) be among the set of degree properties {λdλw′. z publishedw′d-many books | personw(z)}, which is impossible; the second would require that (λdλw′. Mary sangw′d-loud) be in the set {λdλw′. z sangw′d-loud | baritonew(z)}, which is possible, but requires Mary to be a baritone.(32)
- a.
*Mary read the most books that anyone published.
- b.
#Mary sang the loudest that any baritone sang.
We will come back to this “matching” generalization in section 7. First, we discuss some undergeneration issues of the estH2 + ACD approach.
4.2 Superlative ACD and Absolute Readings
The denotation for estH2 was expressly designed to account for relative readings. But what, then, of absolute readings? It is certainly conceivable that relative superlatives are composed from estH2 while absolute superlatives are composed from estvF (which is SDE on its (e, t)-argument, as we noted in section 2.1). However, in the absence of independent evidence for the existence of two distinct underlying superlative morphemes, we would expect there to be a way to form absolute superlatives with estH2.5 The LF in (33) shows that this is technically possible if estH2 associates with a focused trace bound by the definite determiner (see also Romero 2013).
The truth conditions in (33b) express the intuitive meaning of (33), but any NPI in the nominal description of an LF like this would appear in the (non-SDE) scope of
estH2w,g(C), rather than in the (SDE) restrictor C. Howard attempts to derive the absolute reading from other possible estH2-based LFs that would place an NPI in the restrictor argument, but he ultimately abandons the effort.We offer one additional unsuccessful stab at this, if only to demonstrate how elusive the proper LF would have to be. As we have described, according to Howard, there are two ways to obtain relative readings when the superlative takes scope outside a morphologically definite noun phrase: association with focus, as in (22), and alternative generation via ACD, as in (31). Suppose the same two grammatical mechanisms are available when the superlative takes scope inside a morphologically definite noun phrase. That is, the restriction might be derived by association with focus, as in (33), or it might be derived via ACD in the presence of alternative-generators, as in (34). Note that we assume the elided pronoun pro6, which is free in the prejacent, is bound by a widest-scope existential closure operator. And we appeal to common A and IDENT type-shifters to facilitate composition with relative clause copies, in the spirit of the usual Trace Conversion techniques in, for example, Fox 2003, Heim 2018. As elsewhere in this article, struck-out phrases are elided.
The sentence is predicted to be true iff, for some choice of g6, John climbed the unique z such that (i) z is a mountain seen by g6, (ii) z is higher than any other mountain seen by g6, and (iii) z is higher than any mountain seen by anyone other than g6. If there is such a g6, there is only one; only one person can have seen a mountain higher than any mountain seen by anyone else. And if there is such a g6, then the absolute superlative description picks out the highest mountain that g6 saw (so long as there is, indeed, a single mountain higher than any others among this person’s viewing history). What’s more, the NPI anyone is now in the restrictor of estH2, and so is correctly predicted to be grammatical.
Unfortunately, the truth conditions spelled out above are not quite right. They predict that in order for the description to be well-defined, there must be a single person g6 who saw a mountain higher than any mountain seen by anyone else. But empirically, the highest mountain anyone saw could in fact have been seen by many people, so long as it is higher than any other seen mountain. The error here is telling. Part of the predicted felicity conditions for the absolute description are correct: there cannot be ties among the seen mountains; one of them really should be higher than the others. But we have ended up predicting that in addition, there cannot be ties among the mountain seers; one of them is incorrectly required to have outseen all the others.
Worryingly, the bad prediction here arises precisely because this is exactly the prediction we want to make for relative superlatives. For instance, the person who climbed the highest mountain was g6 really does require that g6 climbed a higher mountain than any mountain climbed by anyone else. In going so far out of our way to make this absolute description look like a relative one—for the sake of trapping the NPIs in an SDE position—we lose track of what it is we’re comparing.
We reluctantly conclude with Howard, then, that absolute superlatives cannot be derived from estH2 in a way that would explain their capacity to license NPIs.
4.3 Relative Readings with Description-External and Nonmatching NPIs: Generalization II
The problem of extending estH2 to absolute superlatives is not a knock-down argument against Heim’s second analysis since, as noted above, it is conceivable that the superlative morpheme is ambiguous, and that absolute superlatives are formed with estvF instead. More troubling is the fact that the analysis cannot account for all attested NPIs in relative readings. The examples in provide a variety of problematic cases—some constructed, some lightly edited from text found online—all of which can (or even must) be interpreted relatively despite containing currently unlicensable NPIs.
In particular, the NPIs in (35a–e) are altogether outside the relative superlative descriptions, whether those descriptions are argumental, as in (35a,b,e,f), or adverbial, as in (35c). And though the NPIs in (35f–i) are inside their superlative descriptions—where we have come to expect them—these descriptions violate Howard’s (2014) “matching” criterion. In (35f,g,i), for instance, the description-internal NPIs occur in arguments of the noun, not relative clauses that might restrict the superlative morpheme, and even in (35h), the NPI-bearing relative clause has nothing to do with the matrix clause.
Note that all of the examples become ungrammatical if the superlatives are replaced by plain definite or plain adverbial expressions, as in (36). This strongly suggests that it really is the superlative that is responsible for the grammaticality here.6 Also note that in the presence of description-external NPIs, even those superlatives that are in general ambiguous between absolute and relative interpretations, like highest in (35a) and largest in (35e), have only relative interpretations in these sentences. If the NPIs are removed or replaced with plain indefinites, absolute readings reemerge.
(35)
- a.
The judge who gave the highest score to any rookie later regretted it.
- b.
Economics is the field in which the fewest women have ever won a Nobel Prize.
- c.
Which car went the fastest during any 1-second point of the race?
- d.
John has donated the most money to any third-party candidate.
- e.
The zoo that the largest animal has ever escaped from is in Kansas City, and it was a rhinoceros.
- f.
Which would you say is the company that has the best excuse for any security breach so far?
- g.
Mary recommended the student who wrote the harshest critique of any book on the syllabus.
- h.
Of all the perturbations examined, PIC treatment at 1 week resulted in the smallest GnRH-1 cells that exhibited any peaks.
- i.
Our goal with this satellite is to capture the best possible image of any asteroid in the Kuiper Belt.
(36)
- a.
*The judge who gave the high score to any rookie later regretted it.
- b.
*Economics is the field in which the few women (on stage) have ever won a Nobel Prize.
- c.
*Which car went really fast during any 1-second period of the race?
To be clear about the problem that these sorts of examples pose, look at the subject of (35a), repeated in (37). The right meaning, paraphrased in (37a), could be generated by the LF in (37b), which employs the traditional association-with-focus mechanism introduced in section 3.1. But as we have shown, the NPI in this LF would not be licensed, since it occurs in the nuclear scope of
estH2w,g(C), not the restrictor C.And there is no obvious well-formed LF in which the NPI occurs in a constituent that would deliver a set of degree properties to the superlative, even with extremely liberal notions about ellipsis. Certainly the NPI does not occur in a relative clause that the superlative c-commands, as in Howard’s (2014) examples. And even if the entire overt relative clause containing both the superlative and the NPI could play this role, say, as in (38), we doubt very much that the correct meaning could be derived. For instance, (38) denotes, for some choice of rookie g6, the unique judge z who gave g6 a higher score than theyz gave to any other rookie. This is not what (37a) describes. The problem is that in Howard’s cases, the NPI really does constitute the intuitive comparison class, from which the properties the superlative quantifies over are generated. But in (37a), the NPI is incidental to the comparison, which is between judges, not rookies. What’s more, how could this ever help with examples like (35d), where the superlative clearly does associate with the (focused) subject, just as it does in paradigm cases like JOHN gave Mary the highest grade?
Summing up, then, we have shown that the two most common scope-based approaches to relative readings, estH1 and estH2, are both predicted not to license NPIs in places where they do in fact occur. The former has only one explicit argument for the constituent it scopes over, but this argument is not SDE. The latter has been argued to at least sometimes pick up both of its arguments compositionally, one of which is indeed SDE (and not SUE). So in these circumstances, NPIs that occur in that restrictor argument are accounted for. But on the one hand, we found no way to extend this sort of analysis to absolute superlatives, and on the other hand, NPIs in the scope argument remain to be explained.
In the next section, we therefore approach the problem from the other direction. If popular analyses of relative superlatives cannot be massaged into an adequate account of NPI-licensing, maybe our original analysis of absolute superlatives—which was already SDE exactly where we wanted it to be—can be massaged to derive relative readings.
5 Explaining Away the Absolute/Relative Ambiguity
In fact, the literature on superlatives (e.g., Heim 1999, Farkas and É. Kiss 2000, Sharvit and Stateva 2002) has considered various ways to treat relative readings as special cases of absolute ones. We explore a specific implementation of this idea that keeps estvF as the only superlative morpheme.
Suppose John climbed the highest mountain presupposes that there is a mountain that is highest among some contextually relevant set of mountains C. This might be predicted by the LF in (39a), where the domain restriction of the definite article (which has so far been suppressed) is a variable that picks out C.7
(39) John climbed the highest mountain.
- a.
John climbed [the-C [[high estvF] mountain]]
Now, there are many ways to be relevant if you’re a mountain. For starters, you could just be yourself, since we’re talking about mountains. Alternatively, you could belong to some subset of climbed mountains, since we’re talking about mountain climbing. Suppose that the readings that we call “absolute” and “relative” are merely reflections of these different choices for the domain of quantification. The freedom to choose different values for C perhaps even avoids the problem that on its relative reading, John climbed the highest mountain can be felicitous (and true) when John climbs more than one mountain of the same height; when evaluating the sentence, the hearer may exclude from C some mountains that John climbed, as long as one of John’s highest remains.
Yet there are still obvious problems with this approach when it comes to ties. If John and Mary climbed the same mountain, and it is higher than any mountain climbed by anyone else, then (39) is false, but it is predicted to be true if defined. If John and Mary climbed distinct mountains of equal height, then the negation of (39)—John didn’t climb the highest mountain (he and Mary were tied)—is true, but it is predicted to be undefined, unless John’s highest mountain is (implausibly) excluded from C. This makes the theory quite weak. Still, the predictions of estvF regarding NPI-licensing, in both absolute and now “relative” readings, are worth exploring. If the approach turns out to be advantageous from the perspective of polarity, it might make sense to seek a pragmatic solution to these definiteness problems.8
Let us, then, temporarily ignore the complications with existence and uniqueness, and assume that estvF is the only superlative morpheme in English. Being SDE,
estvFw,g(R) is predicted to license description-internal NPIs in both absolute and relative scenarios for any R. However, accounting for description-external NPIs, and explaining why these obligatorily disambiguate toward relative readings, is not a straightforward task. Consider the case in (37) again.(40) The judge who gave the highest score to any rookie was Mary.
- a.
✓Mary gave a higher score to a rookie than any other judge did.
- b.
the [judge [5 [t5 [gave [the-C [[high estvF] score]] [to any rookie]]]]]
The NPI any rookie plainly does not occur in any argument of estvF, regardless of what domain C is understood to denote. What could be done about this? Since the superlative description in (40b) is necessarily interpreted under a relative-like domain restriction—where C is the set of scores given by judges to rookies—perhaps we should look to the similar relativizing effect that focus is known to produce. The LFs in (41) and (42) exploit the same focus-association mechanism that powers estH2 (cf. Heim 1999): [∪ C] is the restrictor of the, and the-[∪ C] estvFscore is raised to a presubject position, forcing C to receive, via association with focus, a value that we identify as “relative.”9
(41) MARY gave John the highest score.
- a.
[the-[∪ C] [[high estvF] score]] [[2 [MaryF gave John t2 ]] ~ C]
- b.(41a) is defined only when
C ⊆ {λx. gavew(j)(x)(z) | z ∈ De }, and thus
∪ Cw,g ⊆{x | ∃z [gavew(j)(x)(z)]} (i.e., things given to John).If defined,
(41a) is true iff Mary gave John the highest of the scores that were given to John.
(42) Mary gave JOHN the highest score.
- a.
[the-[∪ C] [[high estvF] score]] [[2 [Mary gave JohnFt2 ]] ~ C]
- b.(42a) is defined only when
C ⊆ {λx. gavew(y )(x)(m) | y E De }, and thus
U Cw,g ⊆ {x | ∃y [gavew( y )(x)(m)]} (i.e., things Mary gave).If defined,
(42a) is true iff Mary gave John the highest of the scores that she gave.
Following this strategy, the relative reading of (40) would include the LF in (43) as part of the derivation of its relative clause. But this hardly helps in explaining the felicity of the description-external NPI, since any rookie still does not occur in an argument of the superlative.
(43) t5 gave the highest score to any rookie
- a.
[the-[U C] [[high estvF] score]] [[2 [[t5 ]F gave t2 to any rookie]] ~ C]
- b.
[(43a)] is defined only when
C ⊆ {λx. ∃y [rookiew(y) ˄ gavew( y )(x)(z)] | z ∈ De }
∪ Cw,g ⊆ {x | ∃z∃y [rookiew( y ) ˄ gavew(y)(x)(z)]}.If defined,
(43a) is true iff John gave a rookie the highest of the scores that were given to a rookie.
But notice that now, the meaning of the superlative description is in an odd way entangled with a constituent containing the NPI, such that any variation in the content of the NPI would influence the meaning of the superlative description, by way of its association with focus. One might hope then that we could establish some sort of SDE pattern in the position of the NPI at some constituent containing both the NPI and the entangled superlative. For instance, we might expect (44)—the function represented by abstracting over the grammatical NPI’s position in (43)—to be SDE. This would be in line with theories that predict NPI acceptability by testing whether one syntactic constituent is SDE with respect to a larger constituent containing it, rather than by testing the monotonicity of a particular operator that scopes over it (Heim 1984, Gajewski 2005, Homer 2008).
But this is a fraught question. Different values for t9—
a rookie, a player, a person, and so on—will lead to different (usually incompatible) constraints on C, thanks to the presuppositions imposed by focus. The focus antecedent C does not vary with the value of t9. In any given context, it will be fixed, even as different arguments are passed into (44). So when (44) is applied to a rookie, every property in C will have to pick out, for some z ∈ De, the complete set of scores that z gave to rookies. But when (44) is applied to a player, every property in C will have to pick out, for some z ∈ De, the complete set of scores that z gave to players. So in any context where judges all gave at least one nonrookie a score they didn’t give to any rookie, C will have to be empty in order for both functions to be well-defined. But then of course the superlative description itself will be undefined, since there won’t be anything in the definite’s domain restriction.10For a pattern like this to be the explanation for the acceptability of (40), then, the licensing criterion on NPIs would have to be relaxed even further. Perhaps it would be enough simply for (45a) to Strawson-entail (45b), but not vice versa (given that
rookiew,g ⇒ playerw,g). Note that the superlative in (45a) associates with focus in the usual way, but the superlative in (45b) receives its domain restriction [∪ C] merely from the context of the monotonicity test itself. The C is identified with whatever set of properties is taken to establish the (relative) comparison class in (45a).(45)
- a.
[the-[∪ C] [[high estvF] score]] [[2 [MaryF gave t2 to any player]] ~ C]
- b.
[the-[∪ C] [[high estvF] score]] [2 [MaryF gave t2 to any rookie]]
But even this test fails. The LF in (45a) is true when (i)
∪ Cw,g is a set of scores given to players, (ii) there is a single score in ∪ Cw,g higher than all the others, and (iii) Mary gave this score to some player. The definite/superlative presuppositions of (45b) add nothing to this, since they are already factored into the truth conditions of (45a). The focus marking itself in (45b) perhaps contributes a presupposition that someone gave this score to a rookie. But still, all of this together does not entail (45b); Mary might have given this highest score to some nonrookie player while someone other than Mary gave it to a rookie. And note that switching which of the two LFs the comparison class is associated with does not help. If [U C]w,g is the set of scores assigned to rookies rather than players, then (45a) is true when Mary gave some player the highest score given to any rookie. This again does not entail that Mary gave some rookie that score (though somebody must have).Not that any of it matters, because for absolutely any domain restriction on the definite/ superlative, (45b) is going to (classically!) entail (45a). The superlative description in both LFs, if it is well-defined, will refer to the same object—say, s*. If the latter LF is true, then Mary gave s* to a rookie, and therefore to a player, so the former LF is true. So the position of the NPI couldn’t possibly be SDE without also being SUE. Presumably this point extends to any analysis of relative readings that does not give the superlative description-external scope. If the superlative is really contained in its own definite description, then no matter what object that description refers to, the superlative will be powerless to influence the monotonicity of positions elsewhere in the sentence.
This brings us to the conclusion that classic unifications of absolute and relative readings under the umbrella of domain restriction are in no position to account for the general pattern of NPI-licensing in relative readings. We are led then to seek an alternative analysis of relativity, one that gives the superlative genuine scope over all of the syntactic material that influences its interpretation, but nevertheless treats that material as if it were part of its restriction or comparison class, rather than part of its measure. The next section presents one such analysis.
6 Superlatives as Quantifiers on Sets of Assignments: Bumford 2017a
The polarity data paint the following picture:
Ignoring the definite article for a moment, we have shown that the classic, in-situ, absolute superlative estvF takes two arguments, a measure function R and a comparison class of things to measure P. In (46a), for instance, R is the denotation of the adjective,
high, and P is the denotation of the rest of the noun phrase, the set of scores given to rookies. The comparison class argument is SDE (but not SUE). The wide distribution of acceptable NPIs around relative superlatives suggests that their LFs ought to look similar. In (46b), for instance, the first argument of the superlative should again be a measure function determined by the gradable adjective, high, and everything else should be contained in a second argument that behaves like P, an SDE (but not SUE) comparison class.Unfortunately, in (46b) the superlative’s second argument cannot simply denote exactly what the DP of (46a) denotes—the set of scores given to rookies—because the result of evaluating (46b) should not be a score; it should be a proposition that is true iff John was the most generous rookie-scorer. We would do a little better to have the CP denote a relation between judges and the scores they assigned to rookies. Then the superlative might be reanalyzed to use focus marking to ensure that John is the argmax of this relation. But this would strain any sense of uniformity between absolute and relative readings, and certainly the sense of a unified account of NPI-licensing, since in absolute derivations the NPIs would be in a genuine comparison class but in relative derivations the NPIs would be in some sort of “association relation” (Farkas and É. Kiss 2000). In any case, this relation argument would not be SDE (see online appendix B).
We are aware, though, of one analysis of superlatives that has the desired shape. Bumford (2017a,b) offers a dynamic account of superlative operators, where adjectives like highest denote filters on sets of satisfying assignment functions, and can be applied uniformly to sets of individuals that might witness the referent of a description, as in (46a), and sets of propositions that might witness the truth of a sentence, as in (46b). We first present a simplified version of this analysis and then prove that it has the desired monotonicity properties.
6.1 A Simple Dynamic Semantics
As a bridge between the sorts of static analyses of superlatives in Heim 1999, von Fintel 1999, and Howard 2014 presented so far and the dynamic analysis on offer in Bumford 2017a, we introduce two object-language operators in (47). The first simply stores the current assignment together with the value of its complement in a singleton set (cf. Karttunen’s (1977),
? and Partee’s (1986) IDENT, both used in section 4). The second is effectively a “nondeterministic” analogue of Heim and Kratzer’s (1998) abstraction index: given a set of possible values G, it shifts the assignment at which its complement β is evaluated once for each element of G. For comparison, the traditional abstractor is shown in (47c). Both of these binders will be assumed to adjoin to the sisters of moved DPs, as is standard, with the choice between them determined by the semantic type of the DP; set-denoting DPs will require ★n, others the plain n.This modular formulation of dynamic techniques loosely follows Charlow 2019. There are “static” derivations in which binding information is only part of the input to the evaluation function (
·g), alongside “dynamic” derivations in which binding information is also part of the output of the evaluation function (·g = {a, g′ | . . . }). This is illustrated in (48).11 In annotating these trees with denotations, there is the usual awkwardness of having to choose a concrete assignment to display for nodes in the scope of a binding index, even though this binder guarantees that the node will be evaluated at many different assignments. We choose to describe the denotation of every constituent as if it were evaluated at the formal parameter g.To this we add the nondeterministic indefinite defined in (49).12 Note that
a NPg is essentially the characteristic set of the restrictor NP, except that each potential witness x is paired with an assignment that witnesses the truth of evaluating NPg at x. Together with the ★ abstraction operator, this allows all of the possible referents for the indefinite to modify the current assignment “in parallel.” This is shown in (50b); (50a) repeats (48b) for comparison.To simplify derivations of complex DPs, we follow Heim 1997 in allowing a determiner to bind the “subject” position of its nominal as well as the gap in that nominal’s relative clause. Schematically: [DP Det n [NP [tn Noun] [CP who . . . tn . . . ]]]. This is in no way essential, and derivations can be mechanically converted to the format in Heim and Kratzer 1998. The LFs in (51) provide a template. In (51a), a single abstraction index binds the argument position of student as well as the object of saw in the relative clause. The MOD mode of combination is simply (dynamic) conjunction:
[α β]g := {p ˄ q, k | p, j ∈ αg, q, k ∈ βj}. In (51b), a vacuous subject PRO binds the argument position of student, and an implicit relative pronoun binds the object of saw (cf. Heim and Kratzer 1998:sec. 8.5). The MOD mode of combination is (dynamic) predicate conjunction: [α β]g := λx. {p ˄ q, k | p, j ∈ αg(x), q, k ∈ βj(x)}.6.2 Superlatives as Dynamic Operators
Building on a compositional semantics along these lines, Bumford (2017a) hypothesizes that superlative adjectives filter sets of possible evaluation outputs—value-assignment pairs—by measuring them along the coordinate that their DP is associated with. As shown in (52a), the denotation of the superlative morpheme itself takes a comparative relation as argument,13 and returns a function from sets of outputs to sets of outputs, sometimes called a postsupposition (Brasoveanu 2013). For instance,
highestng surveys a set of results and keeps only those whose assignments are undominated in the height of the value they assign to n.To avoid some of the complications mentioned earlier regarding the definite determiner in superlative descriptions, we make two simplifying assumptions in this section:
- 1.
Use of a superlative adjective is only felicitous if its comparison class argument G initially contains several distinct values (usually at least three), but only one of these remains after nondominant outputs have been removed. In other words, there should be at least a few candidates, but only one winner. Formally, for any input g and set G such that |{a |
a, h ∈ G} | > 1, it must be true that |{a | a, h ∈ highestng (G)}| = 1. - 2.
As in the classic scope-theoretic accounts (Szabolcsi 1986, Heim 1999), the definite determiner of a superlative description is interpreted as if it were in fact indefinite. In this dynamic setting, that means that it introduces a set of potential witnesses for its restrictor and pairs each one with an independently modifiable assignment, exactly like
ag.
These assumptions are obviously connected, and a more robust analysis would presumably seek to derive the uniqueness of superlative descriptions from the definite article that they cooccur with. But since issues of definiteness in relative readings are well-known and much-discussed (Szabolcsi 1986, Heim 1999, Sharvit and Stateva 2002, Coppock and Beaver 2014), and since our chief concern here is monotonicity, we opt for the blunt approach in Heim 1999. For an extended attempt to parcel out the respective compositional contributions of the and -est, see Bumford 2017a.
Finally, we assume that superlative adjectives may take scope over any constituent where it would be well-typed and felicitous for them to do so.14 Perhaps unexpectedly, the adjectives in the LFs below ((54), (55), and (61)) do not leave traces, and so do not abstract over their initial positions. This again is motivated by the types. The situation tends to arise when semantic modifiers are analyzed as propositional operators (see, e.g., von Fintel 1997 on only and Crnič 2014 on even). This is because the initial position of a modifier is by definition not an argument position. There is no need to bind a variable there. One interesting aspect of this particular formulation of dynamic semantics is that DPs and CPs have essentially the same type; they both denote sets of pairs of values with assignments. So even when the adjective modifies its local DP, it acts on that DP the same way a propositional operator would act on a sentence, as desired in (46).
But here, too, this is not essential. Along the lines of Barker’s (2007) treatment of same, the superlative could abstract over a property—type (e, t)—or a property modifier—type ((e, t), (e, t))—when it raises. It would then have to fill this hole with a trivial universal property, or an identity function, as illustrated in (53a). Alternatively, one could scope the superlative together with its definite article as a kind of “complex quantifier,” abstracting over the type of a determiner, as in (53b). This, in fact, would be quite similar to Bumford’s (2017b) original analysis (see also Szabolcsi 1986 on the -est, Zimmermann 2003 on the occasional, Kennedy and Stanley 2009 on the average, and Morzycki 2016 for a survey of related constructions and analytical prospects).
In any case, we do not mean to commit to this scoping of the adjective as a particular form of movement, per se. Of course, such movement would be quite unorthodox syntactically (though not more unorthodox than the standard Heimian analysis, which raises just the superlative suffix out of various left branches). The LFs are intended to show where the various semantic operators take scope, to be worked out using whatever mechanisms for scope-taking are preferred. Bumford’s (2017b) version, for instance, relies exclusively on continuation-passing and type-shifting to regulate the order of operations. But again for concreteness, we take it that a superlative adjective may raise at LF, tracelessly, to adjoin to any constituent that dominates it, as long as its type fits and its felicity conditions are met.
Thus, we derive absolute superlative descriptions as in (54). The superlative adjective highest takes scope over its containing DP. That description serves up a set of potential referents for the direct object, each paired with a modified assignment mapping the index 2 to an apple. The superlative eliminates all but the highest such potential referents. Given the assumptions above, there should be only one value x left in play after this operation. The whole superlative description then takes scope over the sentence, returning the (singleton) proposition that is true iff Mary picked the unique apple that is not below any others.
Two things are worth pointing out about this LF and those that follow. First, we are again following Heim (1997) in abstracting over the same index in the description’s restrictor and its scope, in this case 2. Of course, it would not change the truth conditions to pick different indices for the two abstracts, though it would have the side effect of storing a description’s referent at two different locations in the output assignment. We cannot see any harm in this, but we avoid it for perspicuity. Second, the superlative adjective is indexed to the referent introduced by its own DP. Intuitively, nothing else would make sense. In (54), highest should compare assignments along the coordinate that stores apples. But formally, this must be stipulated: we assume Det Adj-est NP is necessarily parsed as [Det n [. . . [estn Adj] . . . NP]]. This is in some sense just the obverse of the normal requirement that a raised element’s abstractor should match the index of its local position; here, the raised element’s index should match the abstractor above its local position. In these compositional dynamic trees, as in real ones, information flows both up and down the trunk.
Relative readings result when a superlative adjective takes scope over a constituent that further restricts the set of entities that might witness its description. As mentioned earlier, we assume that superlative scope is free within the confines of well-typedness. However, since it must modify a set of entity-valued outputs, the opportunities for mobilization are fairly limited. Bumford (2017b) describes two configurations in which this can happen.15 In the first, a superlative DP is itself contained within another definite description. In the second, a superlative associates with focus. In both cases, the superlative DP’s referent becomes dynamically codependent on that of another argument in the sentence, either the definite description that hosts it or the focused expression it associates with. Take (55), for example.16
At the edge of the composition of the outer description, all of the potential assignments g under which the sentence could be true have the following property: the student g5 received the grade g2. Thus, students who did not receive a grade and grades that no students received cannot possibly serve as the witnesses for these respective descriptions, since no assignment g could have values for both g5 and g2 that meet this constraint. As a result, such initially available candidate referents fall out of the computation before the superlative adjective is evaluated. Of the remaining potential assignments, the superlative removes all except those that send 2 to a grade higher than any other remaining candidate grade. This guarantees that only those students whose grade was at least as high as that of any others are available to serve as the referent of the entire host description. And again given the assumption above—that the superlative is felicitous only if its result contains a single value across potential outcomes—there should be just one such “winning” student.
This presentation of Bumford’s (2017a) analysis suffices to establish the requisite monotonicity properties and to verify the predictions against the data introduced above. We return to association with focus in section 6.4. Recall that the troubling NPIs for most analyses are of the sort in (56), extracted from sentences in (35).
(56)
- a.
the judge who gave the highest score to any rookie
- b.
the field in which the fewest women have ever won a Nobel Prize
- c.
the company that has the best excuse for any security breach so far
- d.
the zoo that the largest animal has ever escaped from
All of these descriptions involve unambiguously relative readings of superlatives; (56a) describes the judge who gave a higher score to a rookie than any other judge gave to a rookie, (56b) describes the field that has awarded fewer women the Nobel Prize than any other field has, and so on. In each example, an NPI occurs in the semantic scope of the relative superlative (which is the entire host description), sometimes in the superlative’s nominal complement (56c) and sometimes not (56a,b,d). Since the containing DPs are all singular definite descriptions, they cannot be the source of whatever negativity these NPIs are swimming in. That leaves only the superlative adjective itself. We now establish that any such adjective defined as in (52) is indeed SDE (and not SUE).
6.3 The Dynamic Superlative Is SDE but Not SUE
Consider an arbitrary superlative function Sn, as in (57c). Different underlying comparative relations R will determine different measure functions µ and orderings >, but since these do not affect the proof, we gloss over them.
The only difference between a denotation like (52c) and one like (57c) is that whereas earlier, highest was taken to denote a (total) function from sets of assignments to subsets of those assignments, here we define S in terms of the sets’ characteristic functions. So S is a function from properties of inputs to properties of outputs. Moreover, S is partialized so that it is only defined for arguments that have the property picked out by its comparison class G. This is exactly analogous to von Fintel’s (1999) insight that tallest volleyball player is a partial function defined only for entities that have the property picked out by volleyball player. The only difference is that the properties S applies to are dynamic, in the sense that they are tagged with assignments.
Let G and H be potential arguments of Sn such that G ⇒H, and let
α, γ be in the domain of both Sn(G) and Sn(H). To show that Sn is SDE, we need to show that Sn(H)Sn(G); that is, if Sn(H)(α, γ) is true, then so is Sn(G)(α, γ).- 1.
Assume Sn(H)(
α, γ) is true. - 2.
By definition of Sn, then, ¬∃
b, hʹ. H(b, hʹ) ˄ µ(hʹn) > µ(γn). - 3.
Assume for contradiction that there is a pair
b*, h* such that G(b*, h*) ˄ µ(h) > µ(γn). - 4.
Then since G ⇒ H, it should be true that H(
b*, h*) and therefore that H(b*, h*) ˄ µ(h) > µ(γn). - 5.
But this contradicts Step 2, so in fact Step 3 must be false: ¬∃
b, h. G(b, h) ˄ µ(hn) > µ(γn). - 6.
By definition of Sn, then, Sn(G)(
α, γ) is true. - 7.
Thus, Sn(H)
Sn(G), completing the proof.
Finally, Sn is not SUE. To see this, let µ be the function that maps letters to their positions in the alphabet, so that Sn is something like “alphabetically earliest.” Then let G be the characteristic function of {
x, [n ↦ x] | x ∈ {‘a’,‘b’}}, and let H be the characteristic function of {x, [n ↦ x] | x ∈ {‘a’,‘b’,‘c’}}. Clearly G ⇒ H, but Sn(H) = {‘c’, [n ↦ ‘c’]} ⊈ {‘b’, [2 ↦ ‘b’]} = Sn(G). So Sn(H) / Sn(G), and thus Sn is not SUE.6.4 Focus as Witness-Set Restriction and Identification
Returning briefly to the derivation in (55), we can show that the outer description—the one specifying a student—plays a crucial role. In dynamic semantics, the assignments paired with values keep track (incrementally) of the potential witnesses for various descriptions. An assignment cannot fail to assign a value to a discourse referent (here, an index). And the values assigned to the various indices have to satisfy the descriptive constraints of the various noun phrases that introduce them. That means that every assignment still in play at the top of the outer description makes a choice among the students and a choice among the grades such that the one received the other. So students who were not graded, and grades that no student received, are not among the candidate witnesses. This is what leads to the relative, rather than absolute, comparison of heights.
If focus is to have the same relativizing effect, then it too must introduce a discourse referent (again, the way we have set things up, this is just an abstraction index) that in turn imposes relational constraints on the superlative description’s own potential witnesses. But a focused expression should do more than just introduce a referent. After all, JOHN left does not mean the same thing as A person left. So in addition to the variation it introduces, it should require that its own value be among those witnesses that make the sentence true. Here is a way of formalizing this (see Bumford 2017b for a more robust variant, and independent arguments for this approach to focus).
This denotation acts locally like an indefinite—k(
ag)—but then ultimately only returns true—1t, h—if the value of the focused expression is among the potential witnesses for the description—x, h ∈ k(ag). In other words, JohnF left computes whether or not John is among the leavers, as demonstrated in (59). If so, then any constraints on other referents picked up along the way are preserved in h. The denotation at the top of (59) is empty if John did not leave, which in dynamic semantics amounts to falsity.17 If John did leave, then there is just one output, representing that the sentence is true, paired with a sole assignment representing John as a potential witness for 5.Superlatives associate with focus when they scope in between the temporary indefinite that the focus introduces and the test that is run to ensure that the focused expression survives the update. This is shown in (61). The local position of the focus, JohnF, acts indefinitely. It introduces the set of entities y that would render the rest of the LF true, and pairs each of these entities with a grade x that would witness that truth—that is, a grade that y actually received. Setting g8 ≡
ag, as it is eventually resolved to once the focus binds its trace, would render the denotation of the superlative’s complement as in (60).At this point, the only entities left among the outputs as potential values for g2 are the grades that were received by some alternative to John. The superlative highest2 then filters out all but those outputs that are maximal in their choice of 2. This leaves only those assignments
such that x received y and no other grade that was received (by anyone) was higher than y. Each such assignment is paired with the entity y that received it. Given the assumption we have made that the superlative is only felicitous if all remaining outputs after its execution have the same value component—only if there is exactly one “winner”—we predict that this highest received grade must have been received by exactly one person.Finally, the focus semantics completes the derivation by checking to see whether John is among those that received this highest grade. If so, the sentence returns true, together with an updated assignment listing John and John’s highest grade as the witnessing values for the focus and the superlative description, respectively.
Before we wrap up, a reviewer points out that the story above is likely to encounter difficulty with Heim’s (1999) split-scope de dicto cases, as in (62).
(62) John needs to climb the highest mountain.
- a.
✓John has a mountain-climbing height requirement that exceeds everyone else’s mountain-climbing height requirement.
The reading of (62) paraphrased in (62a) is generally challenging for any theory of superlatives that achieves relative readings without scoping the superlative morpheme out of its description. The chief issue is that the predicative components of the description can be interpreted in the scope of the attitude, while the superlative comparison is nevertheless between what different people’s needs are. Bumford’s (2017a) analysis, even though it relies on superlative scope-taking, appears to be similarly at a loss here, since the superlative scopes together with the lexical adjective. It is not obvious how the height of a mountain, which can vary from world to world, should be determined if the adjective is evaluated above the attitude. In fact, there seem to be deeper questions about how the binding information in the embedded clause could possibly leak out into the matrix, given that attitudes are canonically envisioned as externally static. We take the reviewer’s point here, and admit that Heim’s (1999) analysis remains in the best position to account for the split-scope truth conditions in (62) even while it is unable to account for the distribution of NPIs we have explored.
7 Conclusion
In this article, we have pointed out a basic tension between theories of superlative ambiguity on the one hand and theories of superlative NPI-licensing on the other, largely following the lead of Howard (2014). Accounts of the latter have concentrated on absolute readings, and have tended to analyze almost all of the superlative’s noun phrase as contributing to its semantic restrictor, which typically serves as the comparison class of objects that the superlative measures. Accounts of the former capture both readings, but have tended to analyze almost all of the superlative’s noun phrase, or even the entire clause containing the superlative’s noun phrase, as contributing to its semantic scope, which typically serves as the degree-entity relation that the superlative uses to measure its comparison class. The restrictor argument is approximately downward-entailing, but the scope argument is not. This leaves many successful theories of the scope ambiguity unable to explain the grammaticality of NPIs in superlative descriptions, and many successful theories of NPI-licensing unable to derive relative readings.
Howard (2014) has found a way to make progress on the NPI front without giving up on a scopally mobile superlative operator. His insight is that in some sentences, a superlative’s comparison class may be spelled out by an entire alternative-generating clause. For instance, in (63a) the relative clause inside the superlative noun phrase is taken to denote the set of degree properties, one per person here, mapping each entity to the lengths of books they read. The sentence is true if the degree property identifying the lengths of books Mary read is the longest of these. This predicts that NPI-licensing in relative superlative descriptions depends on the NPI-containing constituent serving up a set of degree properties that includes the degree property formed by abstracting over the superlative in the matrix. This explains why (63b) sounds like nonsense: if the comparison class C only contains degree properties corresponding to people other than Mary, then Mary’s own degree property can’t possibly be the largest in C, because it isn’t in there at all!
(63)
- a.
Out of all of us, MARY has read the longest paper that anyone here has read.
- b.
*Out of all of us, MARY has read the longest paper that anyone else has read.
However, we argued that this analysis comes at the cost of having to posit different superlative morphemes for absolute and relative readings. And it does not account for legitimate NPIs that appear outside the superlative’s own noun phrase—that is, those that fall under our Generalization II, including (64a–b).
(64)
- a.
Out of all of us, MARY has written the longest paper while advising anyone’s dissertation.
- b.
Out of all of us, MARY has written the longest paper containing any claim from her dissertation.
After failing to find a charitable extension of Howard’s (2014) technique, we considered a dynamic account of superlatives due to Bumford 2017a. This analysis has two desirable features from the perspective of the puzzles here: (i) as in Howard 2014, following Heim 1999, superlatives take genuine scope over the constituents that influence their meaning; and (ii) as in von Fintel 1999, the superlative combines with its gradable measure first, independently of anything else in the sentence. The first feature ensures that superlatives will have a chance to license description-external NPIs. The second ensures that everything that the superlative adjective scopes over is in a properly SDE argument.
But what of Howard’s “matching” judgments? We certainly cannot maintain the generalization that an NPI in a relative superlative noun phrase is necessarily ungrammatical when its own argument position differs from that of the superlative’s focus. After all, (64b) and other examples in section 4.3 plainly contradict this.
Yet we agree that Howard’s nonmatching examples can be understood in ways that make them sound deviant. That is, there is a way of hearing (63b) such that it presupposes—nonsensically—that Mary is among the set of people who are not Mary, and of these people who are not her, it says that Mary has read the longest paper. (We speculate that it is most natural with stress on anyone.) But strictly speaking, we think that the example is at least ambiguous. It can also be understood—sensically—as saying that, of just those papers that have been read by one of us z and also another person y ≠ z, Mary has read the longest. (We speculate that this reading is most natural with stress on else.)18
In any case, if it is true that an NPI clause can be interpreted as delivering an explicit comparison class for the superlative, then perhaps we should seek to refashion Howard’s derivations so that the relevant clauses denote sets of alternative updates, rather than alternative degree properties. We see no inherent obstacle to implementing this idea with Bumford’s (2017a) target types in mind, but we leave the exercise for future work.
Notes:
1 Note that the descriptions in (3) and (4) have irrelevant free choice readings on which they describe a club that, as a rule, gave all sophomores the large(-est) prize. We will ignore such readings throughout the article; all diacritics are intended as judgments when the relevant words are interpreted as NPIs.
2 We should also say that everything in this article is intended to comment on the felicity of weak NPIs like any and ever in superlative descriptions. Quite mysteriously, superlatives are also known to license at least some strong NPIs like in years.
- (i)
- a.
Erin is the tallest girl John has seen in years.
- b.
The tallest girl John had seen until Friday walked in the room.
- a.
We are not aware of any formal explanation for this, including in work expressly aimed at accounting for strong NPIs in and around presuppositional operators (e.g., Gajewski 2011), and we will unfortunately not have any progress to report.
5 We are aware of languages that have more than one superlative morpheme, but we are unaware of languages where the two morphemes are used to distinguish absolute and relative interpretations (see Fitzgibbons, Sharvit, and Gajewski 2008 for some discussion).
6 Or really, we should say, the items that were ambiguous between NPI and free choice interpretations—for example, the any in (35a)—have only free choice interpretations when the superlative is absent.
7 We would also have to assume that the superlative quantification itself is restricted to elements in this domain. Otherwise, we would predict that [the-C [[high estvF] mountain]] describes Mt. Everest, if it is in C, and fails to refer otherwise, rather than predicting that it refers to the highest of the mountains in C, which is the goal.
8 There are additional concerns regarding VP-modifying superlatives, as in Mary sang the loudest, and amount superlatives, as in Mary read the most books. But these could perhaps be analyzed as event-referring (as in Sharvit and Stateva 2002) and degree-referring (as in, e.g., Loccioni 2018 and Wilson 2018), respectively.
10 See also von Fintel 1999:134–135, where a similar technical hiccup is pointed out in the examination of adverbial only’s monotonicity.
11 The fragment is extensional from here on out, so we suppress world parameters in what follows.
12 We write 1t for the model-theoretic object representing truth.
13 Here we are following Stateva (2002) and Bobaljik (2012) in assuming that highest is underlyingly [[high -er] -est], but nothing hinges on this; (52a) may be trivially adjusted to accept a gradable adjective of type (e, d) or (e, (d,t)) directly.
14 In this article, we do not wade into the issue of scope-island constraints on superlative movement, which we see as orthogonal to questions about the monotonicity of the denotation. Szabolcsi (1986), Heim (1999), and many others assume that definite determiners hosting superlative descriptions do not constitute scope islands because such determiners behave as if they are truth-conditionally indefinite in such contexts. See the references in the text above for more on this issue. Szabolcsi (1986) claims that tensed-clause boundaries delimit superlative scope, but only documents attitude reports; Heim (1985:26, (48)–(51)) provides examples of various degree operators, including superlatives, appearing to scope out of tensed relative clauses. We think the matter is quite unclear empirically and so make no stipulations here.
15 There are other sources of relative readings documented in the literature, including matrix wh-questions and some control constructions. The former are confounded for our purposes because wh-questions often license NPIs on their own, and those analyses that deal with the latter typically just resort to formally focus-marking the controlled PRO (e.g., Romero 2013). We leave these for future work.
16 NB: The tree depicts (55)’s relative reading, on which it describes the student who outscored all the others. If highest instead scoped only over its own DP—the grade—then the reading would be absolute, describing the student who received the maximum possible score.
17 Formally, φ is “true” in a context g iff there is at least one assignment paired with 1t among the outputs of φ when evaluated at g—that is, iff there is at least one way of choosing referents for all the descriptions that satisfy all the predicates that relate them.
18 Presumably the sentence actually has two relative readings, depending on whether else is interpreted strictly or sloppily (see Bumford 2018). The latter is given by the paraphrase in the main text. The former would say that of the papers that have been read by one of us and also somebody who is not Mary, Mary has read the longest.
Thanks especially to Cécil Meier, Ede Zimmermann, Daniel Gutzmann, Chris Barker, and audiences at UCLA, NYU, the University of Arizona, and Goethe Universität.