There are 9 vowels and 36 diphthongs, 28 of which are native to Estonian. All nine vowels can appear as the first component of a diphthong, but only /?, e, i, o, u/ occur as the second component. A vowel characteristic of Estonian is the unrounded back vowel /?/, which may be close-mid back, close back, or close-mid central.
Simple vowels can be inherently short or long, written with single and double vowel letters respectively. Diphthongs are always inherently long. Furthermore, long vowels and diphthongs have two suprasegmental lengths. This is described further below.
|Rhotic||r ~ ?|
Like the vowels, most consonants can be inherently short or long. For the plosives, this distinction is reflected as a distinction in tenseness/voicing, with short plosives being voiced and long plosives being voiceless. This distinction only applies fully for single consonants after stressed syllables. In other environments, the length or tenseness/voicing distinctions may be neutralized:
In addition, long consonants and clusters also have two suprasegmental lengths, like the vowels. This is described below.
Non-phonemic palatalization generally occurs before front vowels. In addition, about 0.15% of the vocabulary features fully phonemic palatalization, where palatalization occurs without the front vowel. A front vowel did historically occur there, but was lost, leaving the palatalization as its only trace (a form of cheshirization).[example needed] It mostly occurs word-finally, but in some cases it may also occur word-medially. Thus, palatalization does not necessarily need a front vowel, and palatalized vs. plain continuants can be articulated. Palatalization is not indicated in the standard orthography.
The stress in Estonian is usually on the first syllable, as was the case in Proto-Finnic. There are a few exceptions with the stress on the second syllable: aitäh ('thanks'), sõbranna ('female friend'). In loanwords, the original stress can be borrowed as well: ideaal ('ideal'), professor ('professor'). The stress is weak, and as length levels[clarification needed] already control an aspect of "articulation intensity", most words appear evenly stressed.
Additionally, Estonian has a complex system of secondary stresses, the placement of which is not always predictable. Words of more than three syllables can consist of combinations of monosyllabic, disyllabic and trisyllabic feet.
Syllables can be divided into short and long. Syllables ending in a short vowel are short, while syllables ending in a long vowel, diphthong or consonant are long. The length of vowels, consonants and thus syllables is "inherent" in the sense that it is tied to a particular word and is not subject to morphological alternations.
All stressed long syllables can possess a suprasegmental length feature. When a syllable has this feature, any long vowel or diphthong in the syllable is lengthened further, as is any long consonant or consonant cluster at the end of that syllable. A long syllable without suprasegmental length is termed "long", "half-long", "light" or "length II" and is denoted in IPA as ⟨?⟩ or ⟨:⟩. A long syllable with suprasegmental length is termed "overlong", "long", "heavy" or "length III", denoted in IPA as ⟨:⟩ or ⟨::⟩. For consistency, this article employs the terms "half-long" and "overlong" and uses ⟨:⟩ and ⟨::⟩, respectively, to denote them.
Both the regular short-long distinction and the suprasegmental length are distinctive, so that Estonian effectively has three distinctive vowel and consonant lengths, the distinction between the second and third length levels being at a level larger than the phoneme, such as the syllable or the foot. In addition to realizing greater phonetic duration, overlength in modern Estonian involves a pitch distinction where falling pitch is realized in syllables that are overlong and level pitch is realized in syllables that are short or long[clarification needed].
The suprasegmental length is not indicated in the standard orthography except for the plosives for which a single voiceless letter represents a half-long consonant while a double voiceless letter represents an overlong consonant. There are many minimal pairs and also some minimal triplets which differ only by length:
The extra length distinction has a number of origins: