In linguistics, binding is the distribution of anaphoric elements (pronouns and other pro-forms). A pronoun (a "bindee") usually has an antecedent (a "binder") in context. The goal of binding theory is to identify the syntactic relationship that can or must hold between a given pronoun or noun and its antecedent (or postcedent), e.g. Johni said hei would help vs. *Hei said Johni would help (the second sentence is not possible if he is intended to mean John).
The idea that there should be a specialized, coherent theory dealing with this sort of phenomena originated in work in transformational grammar in the 1970s. This work culminated in government and binding theory in the 1980s. The binding theory that became established at that time is still considered a reference point, though its validity is no longer accepted.
Many theories of syntax now have a subtheory that addresses binding phenomena. These phenomena exist in all languages, although the behavior of binding can vary in interesting and nuanced ways across languages, even across languages that are closely related.
The following sentences illustrate some basic facts of binding. The words that bear the index i should be construed as referring to the same person or thing.
These sentences illustrate some aspects of the distribution of reflexive and personal pronouns. In the first pair of sentences, the reflexive pronoun must appear for the indicated reading to be possible. In the second pair, the personal pronoun must appear for the indicated reading to be possible. The third pair shows that at times a personal pronoun must follow its antecedent, and the fourth pair further illustrates the same point, although the acceptability judgement is not as robust. Based on such data, one sees that reflexive and personal pronouns differ in their distribution and that linear order (of a pronoun in relation to its antecedent or postcedent) is a factor influencing where at least some pronouns can appear. A theory of binding should be in a position to predict and explain the differences in distribution seen in sentences like these. It should be in a position to answer questions like: What explains where a reflexive pronoun must appear as opposed to a personal pronoun? When does linear order play a role in determining where pronouns can appear? What other factor (or factors) beyond linear order help predict where pronouns can appear?
The following three subsections consider the binding domains that are relevant for the distribution of pronouns and nouns in English. The discussion follows the outline provided by the traditional binding theory (see below), which divides nominals into three basic categories: reflexive and reciprocal pronouns, personal pronouns, and nouns (common and proper).
When one examines the distribution of reflexive pronouns and reciprocal pronouns (which are often subsumed under the general category of "anaphor"), one sees that there are certain domains that are relevant, a "domain" being a syntactic unit that is clause-like. Reflexive and reciprocal pronouns often seek their antecedent close by, in a binding domain that is local, e.g.
These examples illustrate that there is a domain within which a reflexive or reciprocal pronoun should find its antecedent. The a-sentences are fine because the reflexive or reciprocal pronoun has its antecedent within the clause. The b-sentences, in contrast, do not allow the indicated reading, a fact illustrating that personal pronouns have a distribution that is different from that of reflexive and reciprocal pronouns. A related observation is that a reflexive and reciprocal pronoun often cannot seek its antecedent in a superordinate clause, e.g.
When the reflexive or reciprocal pronoun attempts to find an antecedent outside of the immediate clause containing it, it fails. In other words, it can hardly seek its antecedent in the superordinate clause. The binding domain that is relevant is the immediate clause containing it.
Personal pronouns have a distribution that is different from reflexive and reciprocal pronouns, a point that is evident with the first two b-sentences in the previous section. The local binding domain that is decisive for the distribution of reflexive and reciprocal pronouns is also decisive for personal pronouns, but in a different way. Personal pronouns seek their antecedent outside of the local binding domain containing them, e.g.
In these cases, the pronoun has to look outside of the embedded clause containing it to the matrix clause to find its antecedent. Hence based on such data, the relevant binding domain appears to be the clause. Further data illustrate, however, that the clause is actually not the relevant domain:
Since the pronouns appear within the same minimal clause containing their antecedents in these cases, one cannot argue that the relevant binding domain is the clause. The most one can say based on such data is that the domain is "clause-like".
The distribution of common and proper nouns is unlike that of reflexive, reciprocal, and personal pronouns. The relevant observation in this regard is that a noun is often reluctantly coreferential with another nominal that is within its binding domain or in a superordinate binding domain, e.g.
The readings indicated in the a-sentences are natural, whereas the b-sentences are very unusual. Indeed, sentences like these b-sentences were judged to be impossible in the traditional binding theory according to Condition C (see below). Given a contrastive context, however, the b-sentences can work, e.g. Susan does not admire Jane, but rather Susani admires Susani. One can therefore conclude that nouns are not sensitive to binding domains in the same way that reflexive, reciprocal, and personal pronouns are.
The following subsections illustrate the extent to which pure linear order impacts the distribution of pronouns. While linear order is clearly important, it is not the only factor influencing where pronouns can appear.
A simple hypothesis concerning the distribution of many anaphoric elements, of personal pronouns in particular, is that linear order plays a role. In most cases, a pronoun follows its antecedent, and in many cases, the coreferential reading is impossible if the pronoun precedes its antecedent. The following sentences suggest that pure linear can indeed be important for the distribution of pronouns:
While the coreferential readings indicated in these b-sentences are possible, they are unlikely. The order presented in the a-sentences is strongly preferred. The following, more extensive data sets further illustrate that linear order is important:
While the acceptability judgements here are nuanced, one can make a strong case that pure linear order is at least in part predictive of when the indicated reading is available. The a- and c-sentences allow the coreferential reading more easily than their b- and d-counterparts.
While linear order is an important factor influencing the distribution of pronouns, it is not the only factor. The following sentences are similar to the c- and d-sentences in the previous section insofar as an embedded clause is present.
While there may be a mild preference for the order in the a-sentences here, the indicated reading in the b-sentences is also available. Hence linear order is hardly playing a role in such cases. The relevant difference between these sentences and the c- and d-sentences in the previous section is that the embedded clauses here are adjunct clauses, whereas they are argument clauses above. The following examples involve adjunct phrases:
The fact that the c-sentences marginally allow the indicated reading whereas the b-sentences do not at all allow this reading further demonstrates that linear order is important. But in this regard, the d-sentences are telling, since if linear order were the entire story, one would expect the d-sentences to be less acceptable than they are. The conclusion that one can draw from such data is that there are one or more other factors beyond linear order that are impacting the distribution of pronouns.
Given that linear order is not the only factor influencing the distribution of pronouns, the question is what other factor or factors might also be playing a role. The traditional binding theory (see below) took c-command to be the all important factor, but the importance of c-command for syntactic theorizing has been extensively criticized in recent years. The primary alternative to c-command is functional rank. These two competing concepts (c-command vs. rank) have been debated extensively and they continue to be debated. C-command is a configurational notion; it is defined over concrete syntactic configurations. Syntactic rank, in contrast, is a functional notion that resides in the lexicon; it is defined over the ranking of the arguments of predicates. Subjects are ranked higher than objects, first objects are ranked higher than second objects, and prepositional objects are ranked lowest. The following two subsections briefly consider these competing notions.
C-command is a configurational notion that acknowledges the syntactic configuration as primitive. Basic subject-object asymmetries, which are numerous in many languages, are explained by the fact that the subject appears outside of the finite verb phrase (VP) constituent, whereas the object appears inside it. Subjects therefore c-command objects, but not vice versa. C-command is defined as follows:
Given the binary division of the clause (S -> NP + VP) associated with most phrase structure grammars, this definition sees a typical subject c-commanding everything inside the verb phrase (VP), whereas everything inside the VP is incapable of c-commanding anything outside of the VP. Some basic binding facts are explained in this manner, e.g.
Sentence a is fine because the subject Larry c-commands the object himself, whereas sentence b does not work because the object Larry does not c-command the subject himself. The assumption has been that within its binding domain, a reflexive pronoun must be c-commanded by its antecedent. While this approach based on c-command makes a correct prediction much of the time, there are other cases where it fails to make the correct prediction, e.g.
The reading indicated is acceptable in this case, but if c-command were the key notion helping to explain where the reflexive can and must appear, then the reading should be impossible since himself is not c-commanded by Larry.
As reflexive and personal pronouns occur in complementary distribution, the notion of c-command can also be used to explain where personal pronouns can appear. The assumption is that personal pronouns cannot c-command their antecedent, e.g.
In both examples, the personal pronoun she does not c-command its antecedent Alice, resulting in the grammaticality of both sentences despite reversed linear order.
The alternative to a c-command approach posits a ranking of syntactic functions (SUBJECT > FIRST OBJECT > SECOND OBJECT > PREPOSITIONAL OBJECT). Subject-object asymmetries are addressed in terms of this ranking. Since subjects are ranked higher than objects, an object can have the subject as its antecedent, but not vice versa. With basic cases, this approach makes the same prediction as the c-command approach. The first two sentences from the previous section are repeated here:
Since the subject outranks the object, sentence a is predictably acceptable, the subject Larry outranking the object himself. Sentence b, in contrast, is bad because the subject reflexive pronoun himself outranks its postcedent Larry. In other words, this approach in terms of rank is assuming that within its binding domain, a reflexive pronoun may not outrank its antecedent (or postcedent). Consider the third example sentence from the previous section in this regard:
The approach based on rank does not require a particular configurational relationship to hold between a reflexive pronoun and its antecedent. In other words, it makes no prediction in this case, and hence does not make an incorrect prediction. The reflexive pronoun himself is embedded within the subject noun phrase, which means that it is not the subject and hence does not outrank the object Larry.
A theory of binding that acknowledges both linear order and rank can at least begin to predict many of the marginal readings. When both linear order and rank combine, acceptability judgments are robust, e.g.
This ability to address marginal readings is something that an approach combining linear order and rank can accomplish, whereas an approach that acknowledges only c-command cannot do the same.
The exploration of binding phenomena got started in the 1970s and interest peaked in the 1980s with Government and Binding Theory, a grammar framework in the tradition of generative syntax that is still prominent today. The theory of binding that became widespread at that time serves now merely as reference point (since it is no longer believed to be correct). This theory distinguishes between 3 different binding conditions: A, B, and C. The theory classifies nominals according to two features, [±anaphor] and [±pronominal], which are binary. The binding characteristics of a nominal are determined by the values of these features, either plus or minus. Thus, a nominal that is [-anaphor, -pronominal] is an R-expression (referring expression), such as a common noun or a proper name. A nominal that is [-anaphor, +pronominal] is a pronoun, such he or they, and a nominal that is [+anaphor, -pronominal] is a reflexive pronoun, such as himself or themselves. Note that the term anaphor here is being used in a specialized sense; it essentially means "reflexive". This meaning is specific to the Government and Binding framework and has not spread beyond this framework.
Based on the classifications according to these two features, three conditions are formulated:
While the theory of binding that these three conditions represent is no longer held to be valid, as mentioned above, the associations with the three conditions are so firmly anchored in the study of binding that one often refers to, for example, "Condition A effects" or "Condition B effects" when describing binding phenomena.