The pronunciation of the digraph ⟨wh⟩ in English has changed over time, and still varies today between different regions and accents. It is now most commonly pronounced /w/, the same as a plain initial ⟨w⟩, although some dialects, particularly those of Scotland, Ireland, and the Southern United States, retain the traditional pronunciation /hw/, generally realized as , a voiceless "w" sound. The process by which the historical /hw/ has become /w/ in most modern varieties of English is called the wine-whine merger. It is also referred to as glide cluster reduction.
Before rounded vowels, a different reduction process took place in Middle English, as a result of which the ⟨wh⟩ in words like who and whom is now pronounced /h/. (A similar sound change occurred earlier in the word how.)
What is now English ⟨wh⟩ originated as the Proto-Indo-European consonant *k? (whose reflexes came to be written ⟨qu⟩ in Latin and the Romance languages). In the Germanic languages, in accordance with Grimm's Law, Indo-European voiceless stops became voiceless fricatives in most environments. Thus the labialized velar stop *k? initially became presumably a labialized velar fricative *x? in pre-Proto-Germanic, then probably becoming *[?] - a voiceless labio-velar approximant - in Proto-Germanic proper. The sound was used in Gothic and represented by the symbol known as hwair; in Old English it was spelled as ⟨hw⟩. The spelling was changed to ⟨wh⟩ in Middle English, but the pronunciation remained [?].
Because Proto-Indo-European interrogative words typically began with *k?, English interrogative words (such as who, which, what, when, where) typically begin with ⟨wh⟩ (for the word how, see below). As a result, such words are often called wh-words, and questions formed from them are called wh-questions. In reference to this English order, a common cross-lingual grammatical phenomenon affecting interrogative words is called wh-movement.
Before rounded vowels, such as /u:/ or /o:/, there was a tendency, beginning in the Old English period, for the sound /h/ to become labialized, causing it to sound like /hw/. Therefore, words with an established /hw/ in that position came to be perceived (and spelt) as beginning with plain /h/. This occurred with the interrogative word how (Proto-Germanic *hw?, Old English h?).
A similar process of labialization of /h/ before rounded vowels occurred in the Middle English period, around the 15th century, in some dialects. Some words which historically began with /h/ came to be written ⟨wh⟩ (whole, whore). Later in many dialects /hw/ was delabialized to /h/ in the same environment, regardless of whether the historic pronunciation was /h/ or /hw/ (in some other dialects the labialized /h/ was reduced instead to /w/, leading to such pronunciations as the traditional Kentish /wo?m/ for home). This process affected the pronoun who and its inflected forms. These had escaped the earlier reduction to /h/ because they had unrounded vowels in Old English, but by Middle English the vowel had become rounded, and so the /hw/ of these words was now subject to delabialization:
By contrast with how, these words changed after their spelling with ⟨wh⟩ had become established, and thus continue to be written with ⟨wh⟩ like the other interrogative words which, what, etc. (which were not affected by the above changes since they had unrounded vowels - the vowel of what became rounded at a later time).
The wine-whine merger is the phonological merger by which /hw/, historically realized as a voiceless labio-velar approximant [?], comes to be pronounced the same as plain /w/, that is, as a voiced labio-velar approximant [w]. John C. Wells refers to this process as Glide Cluster Reduction. It causes the distinction to be lost between the pronunciation of ⟨wh⟩ and that of ⟨w⟩, so pairs of words like wine/whine, wet/whet, weather/whether, wail/whale, Wales/whales, wear/where, witch/which become homophones. This merger has taken place in the dialects of the great majority of English speakers.
The merger is essentially complete in England, Wales, the West Indies, South Africa, Australia, and in the speech of young speakers in New Zealand. The merger is not found, however, in Scotland, in most of Ireland (although the distinction is usually lost in Belfast and some other urban areas of Northern Ireland), and in the speech of older speakers in New Zealand.
Most speakers in the United States and Canada have the merger. According to Labov, Ash, and Boberg (2006: 49), using data collected in the 1990s, there are regions of the U.S. (particularly in the Southeast) in which speakers keeping the distinction are about as numerous as those having the merger, but there are no regions in which the preservation of the distinction is predominant (see map). Throughout the U.S. and Canada, about 83% of respondents in the survey had the merger completely, while about 17% preserved at least some trace of the distinction.
The merger seems to have been present in the south of England as early as the 13th century. It was unacceptable in educated speech until the late 18th century, but there is no longer generally any stigma attached to either pronunciation. Some RP speakers may use /hw/ for ⟨wh⟩, a usage widely considered "correct, careful and beautiful", but that is usually a conscious choice rather than a natural part of the speaker's accent.
A portrayal of the regional retention of the distinct wh- sound is found in the speech of the character Frank Underwood, a South Carolina politician, in the American television series House of Cards. The show King of the Hill, set in Texas, pokes fun at the issue through character Hank Hill's use of the hypercorrected [h?] pronunciation. A similar gag can be found in several episodes of Family Guy, with Brian becoming annoyed by Stewie's over-emphasis of the /hw/ sound in his pronunciation of "Cool hWhip", "hWheat Thins", and "Will hWheaton".
The distribution of the wh- sound in words does not always exactly match the standard spelling; for example, Scots pronounce whelk with plain /w/, while in many regions weasel has the wh- sound.
Below is a list of word pairs which are liable to be pronounced as homophones by speakers having the wine-whine merger.
|wail||whale||'we?l||With pane-pain merger|
|weigh||whey||'we?||With wait-weight merger|
|were (man)||where||'w?:(r), 'we:r|
|were (to be)||whir||'w?:(r)|
|word||whirred||'w?:(r)d||With nurse merger|
|world||whirled||'w?:(r)ld||With nurse merger|
As mentioned above, the sound of initial ⟨wh⟩, when distinguished from plain ⟨w⟩, is often pronounced as a voiceless labio-velar approximant [?], a voiceless version of the ordinary [w] sound. In some accents, however, the pronunciation is more like [h?], and in some Scottish dialects it may be closer to [x?] or [k?] - the [?] sound preceded by a voiceless velar fricative or stop. (In other places the /kw/ of qu- words is reduced to [?].) In the Black Isle, the /hw/ (like /h/ generally) is traditionally not pronounced at all. Pronunciations of the [x?] or [k?] type are reflected in the former Scots spelling quh- (as in quhen for when, etc.).
In some dialects of Scots, the sequence /hw/ has merged with the voiceless labiodental fricative /f/. Thus whit ("what") is pronounced /f?t/, whan ("when") becomes /fan/, and whine becomes /fain/ (a homophone of fine). This is also found in some Irish English with an Irish Gaelic substrate influence (something which has led to an interesting re-borrowing of whisk(e)y as Irish Gaelic fuisce, the word having originally entered English from Scottish Gaelic).
Phonologically, the distinct sound of ⟨wh⟩ is often analyzed as the consonant cluster /hw/, and it is transcribed so in most dictionaries. When it has the pronunciation [?], however, it may also be analyzed as a single phoneme,