Catalan orthography encompasses the spelling and punctuation of the Catalan language.
This section needs expansion. You can help by adding to it. (October 2018)
The orthographic norms of Catalan were first defined officially in the First International Congress of the Catalan Language, held in Barcelona October 1906. Subsequently, the Philological Section of the Institut d'Estudis Catalans (IEC, founded in 1911) published the Normes ortogràfiques in 1913 under the direction of Antoni Maria Alcover and Pompeu Fabra. Despite some opposition, the spelling system was adopted immediately and became widespread enough that, in 1932, Valencian writers and intellectuals gathered in Castelló de la Plana to make a formal adoption of the so-called Normes de Castelló, a set of guidelines following Pompeu Fabra's Catalan language norms.
In 1917, Fabra published an Orthographic Dictionary following the orthographic norms of the IEC. In 1931-32 the Diccionari General de la Llengua Catalana (General Dictionary of the Catalan language) appeared. In 1995, a new normative dictionary, the Dictionary of the Catalan Language of the Institute of Catalan Studies (DIEC), supposed a new milestone in the orthographic fixation of the language, in addition to the incorporation of neologisms and modern uses of the language.
The Catalan alphabet consists of the 26 letters of the ISO basic Latin alphabet:
The following letter-diacritic combinations are used, but they do not constitute distinct letters in the alphabet: À, É, È, Í, Ï, Ó, Ò, Ú, Ü and Ç. K and W are used only in loanwords. Outside loanwords, the letters Q and Y appear only in the digraphs qu, qü and ny. However, Y was used until the official orthography was established in 1913, when it was replaced with I, except in the digraph ny and loanwords.
|Name (IEC)||Pronunciation||Name (AVL)||Pronunciation|
|B b||be, be alta||/'be/, /'be 'alt?/||be||/'be/|
|F f||efa||/'ef?/||efe, ef||/'efe/, /'ef/|
|I i||i, i llatina||/'i/, /'i 'tin?/||i, i llatina||/'i/, /'i ?a'tina/|
|L l||ela||/'el?/||ele, el||/'ele/, /'el/|
|M m||ema||/'em?/||eme, em||/'eme/, /'em/|
|N n||ena||/'en?/||ene, en||/'ene/, /'en/|
|R r||erra||/'?r?/||erre, er||/'ere/, /'e?/|
|S s||essa||/'es?/||esse, es||/'ese/, /'es/|
|V v||ve, ve baixa||/'ve/, /'be 'ba/||ve||/'ve/|
|W w||ve doble||/'ve 'dob:l?/, /'be 'dob:l?/||ve doble||/'ve 'doble/|
|X x||ics, xeix||/'iks/, /'?e?/||ics, xeix||/'iks/, /'?ej?/|
|Y y||i grega||/'i 'e/||i grega||/'i 'e?a/|
The names efa, ela, ema, ena, erra, and essa can also be used in dialectal Valencian.[which?]
Catalan is a pluricentric language; the pronunciation of some of the letters is different in Eastern Catalan (IEC) and Valencian (AVL). Apart from those variations, the pronunciation of most consonants is fairly straightforward and is similar to French, Occitan or Portuguese pronunciation.
|bm||/bm/ or /m:/|
|bv||/b:/ or /bv/||/bv/|
|ch*||/k/, /x/, /?/|
|cn||/?n/ or /n:/, /n/|
|cz||/?z/ or /ks/|
|dq(u)||/tk/ or /k:/|
|dv||/db/ or /b:/ or /dv/||/dv/ (or */bv/)|
|gg||/?(:)/ or /d?/,/?(:)/||/d?/,/?(:)/|
|gm||/?m/ or /m:/|
|gn||/?n/ or /n:/, /n/|
|igj||/id?/ or /i?(:)/||/id?/|
|igs / (t)jos||/t?/ or /(d)?us/||/t?/ or /d?os/|
|ld||/ld/, /l/||/ld/, /lt/|
|lt||/lt/, /l/||/lt/, /(l)t/|
|mb||/mb/, /m/, /m(b)/||/mb/, /mp/, /m(b)/|
|nd||/nd/, /n/||/nd/, /nt/|
|ntm||/m:/||/ntm/ or /m:/|
|nv||/mb/ or /nv/||/nv/|
|pm||/bm/ or /m:/|
|pn||/bn/ or /n:/, /n/|
|rs||/?s/, /s/||/?s/, /(?)s/|
|sc, sch*||/sk/, /?k/, /esk/, /s/,/?/|
|sh||/z/, /?/||/z/, /z/ or /s/, /?/|
|st||/st/, /s(t)/, /(s)t/||/st/|
|tb||/db/ or /b:/|
|tl||/dl/ or /l:/|
|tll (tl)||/?:/||/l(:)/, /?/|
|tm||/dm/ or /m:/, /m:/, /m/||/dm/ or /m:/, /m/, /m/|
|tn||/dn/ or /n:/, /n:/||/dn/ or /n:/, /n(:)/|
|th||/t/, /th/ or /t/, /?/|
|xh||/?z/||/?z/ or /ks/|
Catalan also uses the acute and grave accents to mark stress or vowel quality. An acute on ⟨é ó⟩ indicates that the vowel is stressed and close-mid (/e o/), while grave on ⟨è ò⟩ indicates that the vowel is stressed and open-mid (/? ?/). Grave on ⟨à⟩ and acute on ⟨í ú⟩ simply indicates that the vowels are stressed. Thus, the acute is used on close or close-mid vowels, and the grave on open or open-mid vowels. For example:
Standard rules governing the presence of accents are based on word endings and the position of the stressed syllable. In particular, accents are expected for:
Since there is no need to mark the stressed syllable of a monosyllabic word, most of them do not have an accent. Exceptions to this are those with a diacritical accent that differentiates some cases of words that would otherwise be homographic. Example: es [?s] ('it' impersonal) vs és ['es] ('is'), te [t?] ('you' clitic) vs té ['te] ('s/he has'), mes ['m?s] ('month') vs més ['mes] ('more'), dona ['d?n?] ('woman') vs dóna ['don?] ('s/he gives'). In most cases, the word bearing no accent is either unstressed (as in the case of 'es' and 'te'), or the word without the accent is more common, usually a function word.
The different distribution of open e [?] vs closed e [e] between Eastern Catalan and Western Catalan is reflected in some orthographic divergences between standard Catalan and Valencian norms, example: anglès ['s] (Catalan) vs anglés [a?'es] (Valencian) ('English').
The diaeresis has two different uses: to mark hiatus over ⟨ï, ü⟩, and to mark that ⟨u⟩ is not silent in the groups ⟨gü, qü⟩.
This diaeresis is not used over a stressed vowel that already should have an accent. Examples: suís [su'is] ('Swiss' masculine), but suïssa [su'is?] ('Swiss' feminine), suïs ['suis] ('that you sweat' subjunctive) (without the diaeresis, this last example would be pronounced ['sujs], i.e. as only one syllable, like reis ['rejs] 'kings').
Certain verb forms of verbs ending in -uir do not receive a diaeresis, although they are pronounced with separate syllables. This concerns the infinitive, gerund, future and conditional forms (for example traduir, traduint, traduiré and traduiria, all with bisyllabic [u.i]). All other forms of such verbs do receive a diaeresis on the ï according to the normal rules (e.g. traduïm, traduïa).
The verb argüir represents a rare case of the sequence [?u.i], and the rules for [gu] and [ui] clash in this case. The ambiguity is resolved by an additional rule, which states that in cases where diaereses would appear on two consecutive letters, only the second receives one. This thus gives arguïm and arguïa, but argüir, argüint and argüiré as these forms don't receive a diaeresis on the i normally, according to the exception above.
Catalan ce trencada (Ç ç), literally in English 'broken cee', is a modified ⟨c⟩ with a cedilla mark ( ¸ ). It is only used before ⟨a u o⟩ to indicate a "soft c" /s/, much like in Portuguese, Occitan or French (e.g. compare coça ['kos?] 'kick', coca ['kok?] 'cake' and cosa ['k?z?] 'thing'). In Catalan, ce trencada also appears as last letter of a word when preceded by any vowel (e.g. feliç [f?'?is] 'happy'), but then ⟨ç⟩ may be voiced to [z] before vowels and voiced consonants, e.g. feliçment [fiz'men] ('happily') and braç esquerre ['b?az ?s'k?r?] ('left arm').
The so-called punt volat or middot is only used in the group ⟨?l⟩ (called ela or el(e) geminada, 'geminate el') to represent a geminated sound /l:/, as ⟨ll⟩ is used to represent the palatal lateral /?/. This usage of the middot sign is a recent invention from the beginning of twentieth century (in medieval and modern Catalan, before Fabra's standardization, this symbol was sometimes used to note certain elisions, especially in poetry). The only (and improbable) case of ambiguity in the whole language that could arise is the pair ce?la ['s?] ('cell') vs cella ['s] ('eyebrow').
Catalan does not capitalize the days of the week, months, or national adjectives.
The Catalan punctuation rules are similar to English, with some minor differences.
The distribution of the two rhotics /r/ and /?/ closely parallels that of Spanish. Between vowels, the two contrast but they are otherwise in complementary distribution: in the onset, an alveolar trill, [r], appears unless preceded by a consonant; different dialects vary in regards to rhotics in the coda with Western Catalan generally featuring an alveolar tap, [?], and Central Catalan dialects like those of Barcelona or Girona featuring a weakly trilled [r] unless it precedes a vowel-initial word in the same prosodic unit, in which case [?] appears.
In Eastern Catalan and North Western Catalan, most instances of word-final ⟨r⟩ are silent, but there are plenty of unpredictable exceptions (e.g. in Central Eastern Catalan por ['po] 'fear' but mar ['mar] 'sea'). In Central Eastern Catalan monosyllabic words with a pronounced final ⟨r⟩ get a reinforcement final consonant [t] when in absolute final position (e.g. final ⟨r⟩ of cor ('heart') in reina del meu cor ['rejn? ð 'mew 'k?rt] 'queen of my heart' vs el cor es mou [ 'k ?s 'm?w] 'the heart is moving').
In Valencian, most instances of word-final ⟨r⟩ are pronounced.