French orthography encompasses the spelling and punctuation of the French language. It is based on a combination of phonemic and historical principles. The spelling of words is largely based on the pronunciation of Old French c. 1100-1200 CE and has stayed more or less the same since then, despite enormous changes to the pronunciation of the language in the intervening years. This has resulted in a complicated relationship between spelling and sound, especially for vowels; a multitude of silent letters; and many homophones (e.g., saint/sein/sain/seing/ceins/ceint (all pronounced [s]), sang/sans/cent (all pronounced [s])). Later attempts to respell some words in accordance with their Latin etymologies further increased the number of silent letters (e.g., temps vs. older tens - compare English "tense", which reflects the original spelling - and vingt vs. older vint). Nevertheless, there are rules governing French orthography which allow for a reasonable degree of accuracy when pronouncing French words from their written forms. The reverse operation, producing written forms from pronunciation, is much more ambiguous.
|Letter||Name||Name (IPA)||Diacritics and ligatures|
|A||a||/a/||Àà, Ââ, Ææ|
|E||e||/?/||Éé, Èè, Êê, Ëë|
|U||u||/y/||Ùù, Ûû, Üü|
The letters ⟨w⟩ and ⟨k⟩ are rarely used except in loanwords and regional words. The phoneme /w/ sound is usually written ⟨ou⟩; the /k/ sound is usually written ⟨c⟩ anywhere but before ⟨e, i, y⟩, ⟨qu⟩ before ⟨e, i, y⟩, and sometimes ⟨que⟩ at the ends of words. However, ⟨k⟩ is common in the metric prefix kilo- (originally from Greek khilia "a thousand"): kilogramme, kilomètre, kilowatt, kilohertz, etc.
The usual diacritics are the acute (⟨´⟩, accent aigu), the grave (⟨`⟩, accent grave), the circumflex (⟨^⟩, accent circonflexe), the diaeresis (⟨¨⟩, tréma), and the cedilla (⟨¸ ⟩, cédille). Diacritics have no effect on the primary alphabetical order.
The tilde diacritical mark ( ~ ) above n is occasionally used in French for words and names of Spanish origin that have been incorporated into the language (e.g., El Niño). Like the other diacritics, the tilde has no impact on the primary alphabetical order.
Diacritics are often omitted on capital letters, mainly for technical reasons. It is widely believed that they are not required; however both the Académie française and the Office québécois de la langue française reject this usage and confirm that "in French, the accent has full orthographic value", except for acronyms but not for abbreviations (e.g., CEE, ALENA, but É.-U.). Nevertheless, diacritics are often ignored in word games, including crosswords, Scrabble, and Des chiffres et des lettres.
(French: o, e dans l'o or o, e collés/liés) This ligature is a mandatory contraction of ⟨oe⟩ in certain words. Some of these are native French words, with the pronunciation /oe/ or /ø/, e.g., choeur "choir" /koe?/, coeur "heart" /koe?/, moeurs "moods (related to moral)" /moe?, moe?s/, noeud "knot" /nø/, soeur "sister" /soe?/, oeuf "egg" /oef/, oeuvre "work (of art)" /oev?/, voeu "vow" /vø/. It usually appears in the combination oeu; oeil /oej/ "eye" is an exception. Many of these words were originally written with the digraph eu; the o in the ligature represents a sometimes artificial attempt to imitate the Latin spelling: Latin bovem > Old French buef/beuf > Modern French boeuf.
OE is also used in words of Greek origin, as the Latin rendering of the Greek diphthong , e.g., coelacanthe "coelacanth". These words used to be pronounced with the vowel /e/, but in recent years a spelling pronunciation with /ø/ has taken hold, e.g., oesophage /ez?fa?/ or /øz?fa?/, OEdipe /edip/ or /ødip/ etc. The pronunciation with /e/ is often seen to be more correct.
When oe is found after the letter c, the c can be pronounced /k/ in some cases (coeur), or /s/ in others (coelacanthe).
The ligature oe is not used when both letters contribute different sounds. For example, when ⟨o⟩ is part of a prefix (coexister), or when ⟨e⟩ is part of a suffix (minoen), or in the word moelle and its derivatives.
(French: a, e dans l'a or a, e collés/liés) This ligature is rare, appearing only in some words of Latin and Greek origin like tænia, ex æquo, cæcum, æthuse (as named dog's parsley). It generally represents the vowel /e/, like ⟨é⟩.
The sequence ⟨ae⟩ appears in loanwords where both sounds are heard, as in maestro and paella.
This section needs expansion. You can help by adding to it. (August 2008)
French digraphs and trigraphs have both historical and phonological origins. In the first case, it is a vestige of the spelling in the word's original language (usually Latin or Greek) maintained in modern French, for example, the use of ⟨ph⟩ in words like téléphone, ⟨th⟩ in words like théorème, or ⟨ch⟩ in chaotique. In the second case, a digraph is due to an archaic pronunciation, such as ⟨eu⟩, ⟨au⟩, ⟨oi⟩, ⟨ai⟩, and ⟨oeu⟩, or is merely a convenient way to expand the twenty-six-letter alphabet to cover all relevant phonemes, as in ⟨ch⟩, ⟨on⟩, ⟨an⟩, ⟨ou⟩, ⟨un⟩, and ⟨in⟩. Some cases are a mixture of these or are used for purely pragmatic reasons, such as ⟨ge⟩ for /?/ in il mangeait ('he ate'), where the ⟨e⟩ serves to indicate a "soft" ⟨g⟩ inherent in the verb's root.
This section possibly contains original research. (April 2012) (Learn how and when to remove this template message)
|Examples of major value||Minor values
|Examples of minor value||Exceptions|
|-bs, -cs (in the plural of words ending in
silent b or c), -ds, -fs (in oeufs and boeufs,
and words ending with silent -f in the singular), -gs, -ps, -ts
|Ø||, , , , , , ,|
|b, bb||elsewhere||, abbé|
|before a voiceless consonant||, observer, subtile|
|ç||, garçon, reçu|
|c||before e, i, y||, loquace, douce, ciel, ceux|
|initially/medially elsewhere||, crasse, coeur, sacré||coelacanthe||second|
|finally||, donc||Ø||, blanc, caoutchouc||zinc|
|cc||before e, i, y||/ks/|
|ch||, douche||(often in words of Greek origin)||chaotique, chlore, varech||Ø yacht, almanach |
chips, check-list, strech, coach
|-ct||/kt/||, correct||Ø||respect, suspect, instinct, succinct|
|d, dd||elsewhere||, adresse, addition|
|finally||Ø||, accord||David, sud|
|f, ff||, affoler, soif||Ø clef, cerf, nerf|
|g||before e, i, y||, manger||gin, management, adagio|
|initially/medially elsewhere||, glacier|
|finally||Ø||, long, sang||zigzag|
|gg||before e, i, y||/??/|
|gn||, agneau, gnôle||/?n/ cognitif, gnose, gnou|
|h||Ø||, hiver|| Hainan|
ahaner (also Ø)
|j||, jeter||jean, jazz|| fjord|
|k||, kilomètre, bifteck||Ø skunks, knock-out, knickerbockers, knickers|
|l, ll||, allier, il, royal, matériel||Ø (occasionally finally)||cul, fusil, saoul||Ø fils, aulne, aulx|
(see also -il)
|m, mm||, pomme||Ø automne, condamner|
|n, nn||, panne|
|ng (in loanwords)||, camping|
|p, pp||elsewhere||, appel|
|finally||Ø||coup, trop||, cep|
|medially||/pt/||, excepter||baptême, compter|
|finally||/pt/||Ø||prompt (also /pt/)||sept|
|q (see qu)||, cinq, piqûre, Qatar|
|r, rr||, barre||Ø monsieur, gars|
(see also -er)
medially next to a consonant
or after a nasal vowel
|, estime, penser, instituer||Alsace, transat, transiter||esque|
|elsewhere between two vowels||, paysage||antisèche, parasol, vraisemblable|
|finally||Ø||, repas||fils, sens (noun), os (singular), ours|
|sc||before e, i, y||fasciste (also )|
|sch||, haschisch, esche||/sk/||schizoïde, ischion, æschne|
|-st||/st/||est (direction), ouest, podcast||Ø||est (verb),
Jésus-Christ (also /st/)
|t, tt||elsewhere||, attente||nation (see ti + vowel)|
|finally||Ø||, raffut||dot, brut, yaourt|
|th||, thermique, aneth||Ø asthme, bizuth|
|w||, week-end, whisky||wagon, schwa, interviewer||(see also aw, ew, ow)|
next to a voiceless consonant
|/ks/||, expansion, connexe||/?z/||xénophobie, Xavier||xhosa, xérès (also /ks/)|
|medially elsewhere||/?z/||, exulter||
|finally||Ø||, deux||/ks/||index, pharynx||six, dix, coccyx|
|xc||before e, i, y||/ks/|
|Examples of major value||Minor values
|Examples of minor value||Exceptions|
|a, à||patte, arable, là, déjà||araser, base, condamner|| yacht|
|â||château, pâté||dégât (also ), parlâmes, liâtes, menât (simple past and imperfect subjunctive verb forms ending in -âmes, -âtes, and -ât)|
|aa||graal, Baal, maastrichtois||/a.a/||aa|
|ae||reggae||/a/||groenendael, maelstrom, Portaels||/a.?/ maestro|
|aë||/a.?/||Raphaël, Israël||/a/ Staël|
ai, aiguille, baisser, gai, quai
|lançai, mangerai (future and simple past verb forms ending in -ai or -rai)||faisan, faisons,(and all other conjugated forms of faire which are spelt fais- and followed by a pronounced vowel)|
|aï||/a.i/||naïf, haïr||/aj/||aïe, aïeul, haïe, païen|
|ao, aô||elsewhere||/a.?/||aorte, extraordinaire (also )||/a.o/||baobab|| faonne, paonneau|
|phonologically finally||/a.o/||cacao, chaos||curaçao|
|aou, aoû||/a.u/||caoutchouc, aoûtien, yaourt||saoul, août|
|before r||dinosaure, Aurélie, Laurent (also )|
|ay||elsewhere||/?j/||ayons, essayer (also /ej/)||/aj/||mayonnaise, papaye, ayoye||/ei/ pays (also /?i/)|
|finally||Gamay, margay, railway||okay|
|-aye||/?.i/||abbaye||/?j/||paye|| La Haye |
|before two or more consonants
(including double consonants),
x (in all cases), or
a final consonant (silent or pronounced)
|est, estival, voyelle, examiner, exécuter, quel, chalet||essence, effet, henné
recherche, secrète, repli (before ch+vowel or 2 different consonants when the second one is l or r)
| mangez, (and any form of a verb in the second person plural that ends in -ez).|
femme, solennel, fréquemment, (and other adverbs ending in -emment)
Gennevilliers (see also -er, -es)
|in monosyllabic words before a silent consonant||et, les, nez, clef||es|
in a position where
it can be easily elided
|?||caisse, unique, acheter (also ), franchement||(finally in monosyllabic words)||que, de, je|
|é, ée||clé, échapper, idée||(in closed syllables)||événement, céderai, vénerie|
|ê||tête, crêpe, forêt, prêt||bêtise|
|ea (except after g)||dealer, leader, speaker|
|ei||neige, reine, geisha (also /?j/)||/aj/ leitmotiv (also /?/)|
|Europe, heureux, peu, chanteuse||eu, eussions, (and any conjugated form of avoir spelt with eu-), gageure (in new orthography, gageüre)|
|elsewhere||beurre, jeune||feutre, neutre, pleuvoir|
|eû||jeûne||eûmes, eût, (and any conjugated forms of avoir spelt with eû-)|
|ey||before vowel||/?j/||gouleyant, volleyer|
|i||elsewhere||ici, proscrire||Ø business|
|before vowel||fief, ionique, rien||(in compound words)||antioxydant|
|ï (initially or between vowels)||ïambe, aïeul, païen||ouïe|
|pro, mot, chose, déposes||sosie|
|elsewhere||carotte, offre||cyclone, fosse, tome|
|ô||tôt, cône||hôpital (also )|
|oe||/?.e/||coefficient||/wa/ moelle, moellon|
/w?/ moelleux (also /wa/)
|oë||/?.?/||Noël||/?.e/ canoë, goëmon|
/w?/ foëne, Plancoët
|oeu||phonologically finally||noeud, oeufs, boeufs, voeu|
|elsewhere||soeur, coeur, oeuf, boeuf|
|oi, oie||/wa/||roi, oiseau, foie, quoi||/w?/||bois, noix, poids, trois|| oignon|
|oo||/?.?/||coopération, oocyte, zoologie||bazooka, cool, football|| alcool, Boskoop, rooibos|
spéculoos, mooré, zoo
|ou, où||elsewhere||ouvrir, sous, où||/o.y/ pseudouridimycine|
/aw/ out, knock-out
|before vowel or h+vowel||ouest, couiner, oui, souhait (also /u/)|
|oy||/waj/||moyen, royaume||/wa/||Fourcroy||/?j/ oyez (and any conjugated form of ouïr spelt with oy-), goyave, cow-boy, ayoy|
|u||elsewhere||tu, juge|| tofu, pudding|
rhumerie (see also um)
|before vowel||huit, tuer||pollueur||cacahuète (also )|
|ue, üe||elsewhere||/??/||actuel, ruelle||(see below)||orgueil, cueillir|
|uy||/?ij/||bruyant, ennuyé, fuyons, Guyenne||/y.j/||gruyère, thuya||/?i/ puy|
|before vowel||yeux, yole||polyester, Libye|
|ÿ||(used only in proper nouns)||L'Haÿ-les-Roses, Freÿr|
|Examples of major value||Minor values
|Examples of minor value||Exceptions|
|am (before consonant)||ambiance, lampe||damné|
|-am (finally)||/am/||Vietnam, tam-tam, macadam||Adam|
|an (before consonant or finally)||France, bilan||/an/||brahman, chaman, dan, gentleman, tennisman|
|aen, aën (before consonant or finally)||Caen, Saint-Saëns|
|aim, ain (before consonant or finally)||faim, saint, bains|
|aon (before consonant or finally)||paon, faon||/a./||pharaon|
|aw||crawl, squaw, yawl||/?s/ Law|
|-cte (as the feminine adjective ending for words ending in a silent "ct" (see above))||succincte|
|em, en (before consonant or finally elsewhere)||embaucher, vent||examen, ben, pensum, pentagone||/?n/ week-end, lichen|
/?m/ indemne, totem
|em, en (before consonant[clarification needed] or finally after é, i, or y)||européen, bien, doyen||(before t or soft c)||patient, quotient, science, audience|
|eim, ein (before consonant or finally)||plein, sein, Reims|
|-ent (3rd person plural verb ending)||Ø||parlent, finissaient|
|-er||aller, transporter, premier||/??/||hiver, super, éther, fier||/oe?/ leader (also ??), speaker|
|-es||Ø||Nantes, faites||,||les, des, ces, es|
|eun (before consonant or finally)||jeun|
|ew||/ju/||newton, steward (also iw)||chewing-gum|
|ge (before a, o, u)||geai, mangea|
|gu (before e, i, y)||guerre, dingue||/?y/, /??/||argue (and any conjugated form of arguer), aiguille, linguistique, ambiguïté|
|-il (after some vowels)1||ail, conseil|
|-il (not after vowel)||/il/||il, fil||outil, fils, fusil|
|-ill- (after some vowels)1||paille, nouille|
|-ill- (not after vowel)||/ij/||grillage, bille||/il/||mille, million, billion, ville, villa, village, tranquille|
|im, in, în (before consonant or finally)||importer, vin, vînt||/in/ sprint|
|oin, oën (before consonant or finally)||/w/||besoin, point, Samoëns|
|om, on (before consonant or finally)||ombre, bon||/?n/ canyon|
|ow||cow-boy, show|| clown|
|qu||quand, pourquoi, loquace||/k?/
aquarium, loquace, quatuor
|/ky/ piqure, qu|
|ti + vowel (initially or after s or x)||/tj/||bastion, gestionnaire, tiens, aquae-sextien|
|ti + vowel (elsewhere)||/sj/, /si/||fonctionnaire, initiation, Croatie, haïtien||/tj/, /ti/||the suffix -tié, all conjugated forms of
verbs with a radical ending in -t
(augmentions, partiez, etc.) or derived from
tenir, and all nouns and past participles derived
from such verbs and ending in -ie (sortie, divertie, etc.)
|um, un (before consonant or finally)||parfum, brun||/?m/||album, maximum||nuncupation, punch, secundo|
|ym, yn (before consonant or finally)||sympa, syndrome||/im/||gymnase, hymne|
The spelling of French words of Greek origin is complicated by a number of digraphs which originated in the Latin transcriptions. The digraphs ⟨ph⟩, ⟨th⟩, and ⟨ch⟩ normally represent /f/, /t/, and /k/ in Greek loanwords, respectively; and the ligatures ⟨æ⟩ and ⟨oe⟩ in Greek loanwords represent the same vowel as ⟨é⟩ . Further, many words in the international scientific vocabulary were constructed in French from Greek roots and have kept their digraphs (e.g., stratosphère, photographie).
This section needs expansion. You can help by adding to it. (June 2008)
The Oaths of Strasbourg from 842 is the earliest text written in the early form of French called Romance or Gallo-Romance.
The Gaulish language of the inhabitants of Gaul disappeared progressively over the course of Roman rule as the Latin languages began to replace it: written (Classical) Latin and spoken (vulgar) Latin. Classical Latin, taught in schools, remained the language of religious services, of scientific works, of legislative acts and of certain literary works. Vulgar Latin, spoken by the Roman soldiers and merchants, and adopted by the natives, evolved slowly, taking the forms of different spoken Roman vernaculars according to the region of the country.
Eventually the different forms of Vulgar Latin would evolve into three branches in the Gallo-Romance language sub-family, the langues d'oïl north of the Loire, the langues d'oc in the south, and the Franco-Provençal languages in part of the east.
In the 9th century, the Romance vernaculars were already quite far from Latin. For example, to understand the Bible, written in Latin, footnotes were necessary. With consolidation of royal power, beginning in the 13th century, the Francien vernacular, the langue d'oil variety in usage then on the Île-de-France, brought it little by little to the other languages and evolved toward Classic French.
The languages found in the manuscripts dating from the 9th century to the 13th century form what is known as Old French or ancien français. These languages continued to evolve until, in the 14th century to the 16th century, Middle French (moyen français) emerged.
During the Middle French period (c. 1300-1600), modern spelling practices were largely established. This happened especially during the 16th century, under the influence of printers. The overall trend was towards continuity with Old French spelling, although some changes were made under the influence of changed pronunciation habits; for example, the Old French distinction between the diphthongs eu and ue was eliminated in favor of consistent eu,[a] as both diphthongs had come to be pronounced /ø/ or /oe/ (depending on the surrounding sounds). However, many other distinctions that had become equally superfluous were maintained, e.g. between s and soft c or between ai and ei. It is likely that etymology was the guiding factor here: the distinctions s/c and ai/ei reflect corresponding distinctions in the spelling of the underlying Latin words, whereas no such distinction exists in the case of eu/ue.
This period also saw the development of some explicitly etymological spellings, e.g. temps ("time"), vingt ("twenty") and poids ("weight") (note that in many cases, the etymologizing was sloppy or occasionally completely incorrect; vingt reflects Latin viginti, with the g in the wrong place, and poids actually reflects Latin pensum, with no d at all; the spelling poids is due to an incorrect derivation from Latin pondus). The trend towards etymologizing sometimes produced absurd (and generally rejected) spellings such as sçapvoir for normal savoir ("to know"), which attempted to combine Latin sapere ("to be wise", the correct origin of savoir) with scire ("to know").
Modern French spelling was codified in the late 17th century by the Académie française, based largely on previously established spelling conventions. Some reforms have occurred since then, but most have been fairly minor. The most significant changes have been:
In October 1989, Michel Rocard, then-Prime Minister of France, established the High Council of the French Language (Conseil supérieur de la langue française) in Paris. He designated experts -- among them linguists, representatives of the Académie française and lexicographers -- to propose standardizing several points, a few of those points being:
Quickly, the experts set to work. Their conclusions were submitted to Belgian and Québécois linguistic political organizations. They were likewise submitted to the Académie française, which endorsed them unanimously, saying: "Current orthography remains that of usage, and the 'recommendations' of the High Council of the French language only enter into play with words that may be written in a different manner without being considered as incorrect or as faults."
The changes were published in the Journal officiel de la République française in December 1990. At the time the proposed changes were considered to be suggestions. In 2016, schoolbooks in France began to use the newer recommended spellings, with instruction to teachers that both old and new spellings be deemed correct.
In France, the exclamation mark, question mark, semicolon, colon, percentage mark, currency symbols, hash, and guillemet all require a non-breaking space before and after the punctuation mark. Outside of France, this rule is often ignored. Computer software may aid or hinder the application of this rule, depending on the degree of localisation, as it is marked differently from most other Western punctuation.
The hyphen in French has a particular use in geographic names that is not found in English. Traditionally, the "specific" part of placenames, street names, and organization names are hyphenated (usually namesakes). For instance, la place de la Bataille-de-Stalingrad (Square of the Battle of Stalingrad [la bataille de Stalingrad]); and l'université Blaise-Pascal (named after Blaise Pascal). Likewise, Pas-de-Calais is actually a place on land; the real pas ("strait") is le pas de Calais.
However, this rule is not uniformly observed in official names, e.g., either la Côte-d'Ivoire or la Côte d'Ivoire, but normally la Côte d'Azur has no hyphens. The names of Montreal Metro stations are consistently hyphenated when suitable, but those of Paris Métro stations mostly ignore this rule. (For more examples, see Trait d'union)
On le met dans le nom donné à des voies (rue, place, pont...), une agglomération, un département... Exemples : boulevard Victor-Hugo, rue du Général-de-Gaulle, ville de Nogent-le-Rotrou.Summary ranslation: "Hyphenate name in roadways (streets, squares, bridges), towns, départements". See also "orthotypography".
Les parties d'un spécifique qui comporte plus d'un élément sont liées par un trait d'union [...] Exemples : l'école Calixa-Lavallée, l'école John-F.-Kennedy. Summary ranslation: "Multi-word "specifics" are hyphenated.".