In phonology, an allophone (; from the Greek, állos, 'other' and ?, ph?n?, 'voice, sound') is a set of multiple possible spoken sounds – or phones – or signs used to pronounce a single phoneme in a particular language. For example, in English, (as in stop[st?p]) and the aspirated form (as in top['tp]) are allophones for the phoneme /t/, while these two are considered to be different phonemes in some languages such as Hindi and Thai. On the other hand, in Spanish, (as in dolor[do'lo?]) and (as in nada['naða]) are allophones for the phoneme /d/, while these two are considered to be different phonemes in English.
The specific allophone selected in a given situation is often predictable from the phonetic context, with such allophones being called positional variants, but some allophones occur in free variation. Replacing a sound by another allophone of the same phoneme usually does not change the meaning of a word, but the result may sound non-native or even unintelligible.
Native speakers of a given language perceive one phoneme in the language as a single distinctive sound and are "both unaware of and even shocked by" the allophone variations that are used to pronounce single phonemes.
History of concept
The term "allophone" was coined by Benjamin Lee Whorf circa 1929. In doing so, he is thought to have placed a cornerstone in consolidating early phoneme theory. The term was popularized by George L. Trager and Bernard Bloch in a 1941 paper on English phonology and went on to become part of standard usage within the American structuralist tradition.
Complementary and free-variant allophones and assimilation
Whenever a user's speech is vocalized for a given phoneme, it is slightly different from other utterances, even for the same speaker. That has led to some debate over how real and how universal phonemes really are (see phoneme for details). Only some of the variation is significant, by being detectable or perceivable, to speakers.
There are two types of allophones, based on whether a phoneme must be pronounced using a specific allophone in a specific situation or whether the speaker has the unconscious freedom to choose the allophone that is used.
If a specific allophone from a set of allophones that correspond to a phoneme must be selected in a given context, and using a different allophone for a phoneme would cause confusion or make the speaker sound non-native, the allophones are said to be complementary. The allophones then complement each other, and one of them is not used in a situation in which the usage of another is standard. For complementary allophones, each allophone is used in a specific phonetic context and may be involved in a phonological process.
In other cases, the speaker can freely select from free-variant allophones on personal habit or preference, but free-variant allophones are still selected in the specific context, not the other way around.
Another example of an allophone is assimilation, in which a phoneme is to sound more like another phoneme. One example of assimilation is consonant voicing and devoicing, in which voiceless consonants are voiced before and after voiced consonants, and voiced consonants are devoiced before and after voiceless consonants.
There are many allophonic processes in English: lack of plosion, nasal plosion, partial devoicing of sonorants, complete devoicing of sonorants, partial devoicing of obstruents, lengthening and shortening vowels, and retraction.
Aspiration: In English, a voiceless plosive /p, t, k/ is aspirated (has a strong explosion of breath) if it is at the beginning of the first or a stressed syllable in a word. For example, [p?] as in pin and [p] as in spin are allophones for the phoneme /p/ because they cannot distinguish words (in fact, they occur in complementary distribution). English-speakers treat them as the same sound, but they are different: the first is aspirated and the second is unaspirated (plain). Many languages treat the two phones differently.
Nasal plosion: In English, a plosive (/p, t, k, b, d, ?/) has nasal plosion if it is followed by a nasal, whether within a word or across a word boundary.
Partial devoicing of sonorants: In English, sonorants (/j, w, l, r, m, n, ?/) are partially devoiced after a voiceless sound in the same syllable.
Complete devoicing of sonorants: In English, a sonorant is completely devoiced after an aspirated plosive (/p, t, k/).
Partial devoicing of obstruents: In English, a voiced obstruent is partially devoiced next to a pause or next to a voiceless sound within a word or across a word boundary.
Retraction: In English, /t, d, n, l/ are retracted before /r/.
Because the choice among allophones is seldom under conscious control, few people realize their existence. English-speakers may be unaware of differences between a number of (dialect-dependent) allophones of the phoneme /t/:
post-aspirated [t?] as in top,
unaspirated [t] as in stop.
glottalized (or rather substituted by the glottal stop) [?] as in button, but many speakers preserve at least an unreleased coronal stop[ t?].
In addition, the following allophones of /t/ are found in (at least) some dialects of American(ised) English;
unreleased [ t?] as in American Englishcat, but other dialects preserve the released [t], or substitute the glottal stop [?].
However, speakers may become aware of the differences if – for example – they contrast the pronunciations of the following words:
Night rate: unreleased ['nt?.e?t?] (without a word space between [ . ] and [?])
Nitrate: aspirated ['na?.te?t?] or retracted ['na?.te?t?]
A flame that is held in front of the lips while those words are spoken flickers more for the aspirated nitrate than for the unaspirated night rate. The difference can also be felt by holding the hand in front of the lips. For a Mandarin-speaker, for whom /t/ and /t?/ are separate phonemes, the English distinction is much more obvious than for an English-speaker, who has learned since childhood to ignore the distinction.
One may notice the (dialect-dependent) allophones of English /l/ such as the (palatal) alveolar "light" [l] of leaf['li:f] as opposed to the velar alveolar "dark" [?] in feel['fi:?] found in the U.S.A. and Southern England. The difference is much more obvious to a Turkish-speaker, for whom /l/ and /?/ are separate phonemes, than to an English speaker, for whom they are allophones of a single phoneme.
These descriptions are more sequentially broken down in the next section.
Rules for English consonant allophones
Peter Ladefoged, a renowned phonetician, clearly explains the consonantallophones of English in a precise list of statements to illustrate the language behavior. Some of these rules apply to all the consonants of English; the first item on the list deals with consonant length, items 2 through 18 apply to only selected groups of consonants, and the last item deals with the quality of a consonant.
These descriptive rules are as follows:
Consonants are longer when at the end of a phrase. This can be easily tested by recording a speaker saying a sound like "bib", then comparing the forward and backward playback of the recording. One will find that the backward playback does not sound like the forward playback because the production of what is expected to be the same sound is not identical.
Voicelessstops/p, t, k/ are aspirated when they come at the beginning of a syllable, such as in words like "pip, test, kick" [pp, tst, kk]. You can compare this with voiceless stops that are not syllable initial like "stop" [st?p]. The /t/ voiceless stop follows the /s/ (fricative) here.
Voicedobstruents, which include stops and fricatives, such as /b, d, ?, v, ð, z, ?/, that come at the end of an utterance like /v/ in "improve" or before a voiceless sound like /d/ in "add two") are only briefly voiced during the articulation.
Voiced stops and affricates/b, d, ?, d?/ in fact occur as voiceless at the beginning of a syllable unless immediately preceded by a voiced sound, in which the voiced sound carries over.
Approximants (in English, these include /w, r, j, l/) are partially voiceless when they occur after syllable-initial /p, t, k/ like in "play, twin, cue" [pl?e?, twn, kj?u].
Voiceless stops /p, t, k/ are not aspirated when following after a syllable initial fricative, such as in the words "spew, stew, skew."
Voiceless stops and affricates /p, t, k, t?/ are longer than their voiced counterparts /b, d, ?, d?/ when situated at the end of a syllable. Try comparing "cap" to "cab" or "back" to "bag".
When a stop comes before another stop, the explosion of air only follows after the second stop, illustrated in words like "apt" [æp?t] and "rubbed" [r?b?d].
Many English accents produce a glottal stop in syllables that end with voiceless stops. Some examples include pronunciations of "tip, pit, kick" [tp, pt, kk].
Some accents of English use a glottal stop in place of a /t/ when it comes before an alveolar nasal in the same word (as opposed to in the next word), such as in the word "beaten" ['bi:?n?].
Nasals become syllabic, or their own syllable, only when immediately following an obstruent (as opposed to just any consonant), such as in the words "leaden, chasm" ['l?dn?, 'kæzm?]. Take in comparison "kiln, film"; in most accents of English, the nasals are not syllabic.
The lateral /l/, however, is syllabic at the end of the word when immediately following any consonant, like in "paddle, whistle" ['pædl?, 'w?sl?].
When considering /r, l/ as liquids, /r/ is included in this rule as well as present in the words "sabre, razor, hammer, tailor" ['se?b, 're?z, 'hæm, 'te?l].
Alveolar stops become voiced taps when they occur between two vowels, as long as the second vowel is unstressed. Take for instance mainly American English pronunciations like "fatty, data, daddy, many" ['fæ?i, 'dæ, 'dæ?i, 'mi].
When an alveolar nasal is followed by a stop, the /t/ is lost and a nasal tap occurs, causing "winter" to sound just like "winner" or "panting" to sound just like "panning". In this case, both alveolar stops and alveolar nasal plus stop sequences become voiced taps after two vowels when the second vowel is unstressed. This can vary among speakers, where the rule does not apply to certain words or when speaking at a slower pace.
All alveolar consonants assimilate to dentals when occurring before a dental. Take the words "eighth, tenth, wealth". This also applies across word boundaries, for example "at this" ['æt? ð?s].
Alveolar stops are reduced or omitted when between two consonants. Some examples include "most people" (can be written either as ['mo?s 'pipl?] or ['mo?st 'pipl?] with the IPA, where the [t] is inaudible, and "sand paper, grand master", where the [d] is inaudible.
A consonant is shortened when it is before an identical consonant, such as in "big game" or "top post".
A homorganic voiceless stop may be inserted after a nasal before a voiceless fricative followed by an unstressed vowel in the same word. For example, a bilabial voiceless plosive /p/ can be detected in the word "something" ['s?mp] even though it is orthographically not indicated. This is known as epenthesis. However, the following vowel must be unstressed.
Velar stops /k, ?/ become more front when the following vowel sound in the same syllable becomes more front. Compare for instance "cap" [kæp] vs. "key" [k?i] and "gap" [?æp] vs. "geese" [i:s].
The lateral /l/ is velarized at the end of a word when it comes after a vowel as well as before a consonant. Compare for example "life" [la?f] vs. "file" [fa] or "feeling" [fi:l] vs. "feel" [fi:?].
There are many examples for allophones in languages other than English. Typically, languages with a small phoneme inventory allow for quite a lot of allophonic variation: examples are Hawaiian and Pirahã. Here are some examples (the links of language names go to the specific article or subsection on the phenomenon):
Since phonemes are abstractions of speech sounds, not the sounds themselves, they have no direct phonetic transcription. When they are realized without much allophonic variation, a simple broad transcription is used. However, when there are complementary allophones of a phoneme, the allophony becomes significant and things then become more complicated. Often, if only one of the allophones is simple to transcribe, in the sense of not requiring diacritics, that representation is chosen for the phoneme.
However, there may be several such allophones, or the linguist may prefer greater precision than that allows. In such cases, a common convention is to use the "elsewhere condition" to decide the allophone that stands for the phoneme. The "elsewhere" allophone is the one that remains once the conditions for the others are described by phonological rules.
For example, English has both oral and nasal allophones of its vowels. The pattern is that vowels are nasal only before a nasal consonant in the same syllable; elsewhere, they are oral. Therefore, by the "elsewhere" convention, the oral allophones are considered basic, and nasal vowels in English are considered to be allophones of oral phonemes.
In other cases, an allophone may be chosen to represent its phoneme because it is more common in the languages of the world than the other allophones, because it reflects the historical origin of the phoneme, or because it gives a more balanced look to a chart of the phonemic inventory.
An alternative, which is commonly used for archiphonemes, is to use a capital letter, such as /N/ for [m], [n], [?].
In rare cases, a linguist may represent phonemes with abstract symbols, such as dingbats, to avoid privileging any particular allophone.
^Lee, Penny (1996). The Whorf Theory Complex -- A Critical Reconstruction. John Benjamins. pp. 46, 88.
^Trager, George L. (1959). "The Systematization of the Whorf Hypothesis". Anthropological Linguistics. Operational Models in Synchronic Linguistics: A Symposium Presented at the 1958 Meetings of the American Anthropological Association. 1 (1): 31-35. JSTOR30022173.
^Hymes, Dell H.; Fought, John G. (1981). American Structuralism. Walter de Gruyter. p. 99.
^Barbara M. Birch (2002), English L2 reading: getting to the bottom, Psychology Press, 2002, ISBN978-0-8058-3899-2, ...When the occurrence of one allophone is predictable when compared to the other, as in this case, we call this complementary distribution. Complementary distribution means that the allophones are 'distributed' as complements....
^Ladefoged, Peter (2001). A Course in Phonetics (4th ed.). Orlando: Harcourt. ISBN0-15-507319-2. p. 56-60.