The romanization or Latinization of Ukrainian is the representation of the Ukrainian language using Latin letters. Ukrainian is natively written in its own Ukrainian alphabet, which is based on the Cyrillic script. Romanization may be employed to represent Ukrainian text or pronunciation for non-Ukrainian readers, on computer systems that cannot reproduce Cyrillic characters, or for typists who are not familiar with the Ukrainian keyboard layout. Methods of romanization include transliteration, representing written text, and transcription, representing the spoken word.
Transliteration is the letter-for-letter representation of text using another writing system. Rudnyckyj classified transliteration systems into the scholarly system, used in academic and especially linguistic works, and practical systems, used in administration, journalism, in the postal system, in schools, etc. The scholarly or scientific system is used internationally, with very little variation, while the various practical methods of transliteration are adapted to the orthographical conventions of other languages, like English, French, German, etc.
Depending on the purpose of the transliteration it may be necessary to be able to reconstruct the original text, or it may be preferable to have a transliteration which sounds like the original language when read aloud.
Also called scientific transliteration, this system is most often seen in linguistic publications on Slavic languages. It is purely phonemic, meaning each character represents one meaningful unit of sound, and is based on the Croatian Latin alphabet. It was codified in the 1898 Prussian Instructions for libraries, or Preußische Instruktionen (PI). It was later adopted by the International Organization for Standardization, with minor differences, as ISO/R 9.
Representing all of the necessary diacritics on computers requires Unicode, Latin-2, Latin-4, or Latin-7 encoding. Other Slavic based romanizations occasionally seen are those based on the Slovak alphabet or the Polish alphabet, which include symbols for palatalized consonants.
The ALA-LC Romanization Tables, published by the American Library Association (1885) and Library of Congress (1905). Used to represent bibliographic information by US and Canadian libraries, by the British Library since 1975, and in North American publications. The latest 1997 revision is very similar to the 1905 version.
Requires Unicode for connecting diacritics--these are used in bibliographies and catalogues, but typically omitted in running text.
British Standard 2979:1958, from BSI, is used by the Oxford University Press. A variation is used by the British Museum and British Library, but since 1975 their new acquisitions have been catalogued using Library of Congress transliteration.
BGN/PCGN romanization is a series of standards approved by the United States Board on Geographic Names and Permanent Committee on Geographical Names for British Official Use, and also adopted by the United Nations (but superseded there by the Ukrainian National System from 2012). Pronunciation is intuitive for English-speakers. Latest revision is from 1965. A modified version is also mentioned in the Oxford Style Manual.
Requires only ASCII characters if optional separators are not used.
The Soviet Union's GOST, COMECON's SEV, and Ukraine's Derzhstandart are government standards bodies of the former Eurasian communist countries. They published a series of romanization systems for Ukrainian, which were replaced by ISO 9:1995. For details, see GOST 16876-71.
ISO 9 is a standard from the International Organization for Standardization. It supports most national Cyrillic alphabets in a single transliteration table. Each Cyrillic character is represented by exactly one unique Latin character, so the transliteration is reliably reversible. This was originally derived from the Scholarly system in 1954, and is meant to be usable by readers of most European languages.
The 1995 revision considers only graphemes and disregards phonemic differences. So, for example, ? (Ukrainian He or Russian Ge) is always represented by the transliteration g; ? (Ukrainian letter Ge) is represented by g?.
Representing all of the necessary diacritics on computers requires Unicode, and a few characters are rarely present in computer fonts, for example g-grave: g?.
This is the official system of Ukraine, also employed by the United Nations and many countries' foreign services. It is currently widely used to represent Ukrainian geographic names, which were almost exclusively romanized from Russian before Ukraine's independence in 1991, and for personal names in passports. It is based on English orthography, and requires only ASCII characters with no diacritics.
Its first version was codified in Decision No. 9 of the Ukrainian Committee on Issues of Legal Terminology on April 19, 1996, stating that the system is binding for the transliteration of Ukrainian names in English in legislative and official acts.
A new official system was introduced for transliteration of Ukrainian personal names in Ukrainian passports in 2007.
An updated 2010 version became the system is used for transliterating all proper names and was approved as Resolution 55 of the Cabinet of Ministers of Ukraine, January 27, 2010. This modified earlier laws and brought together a unified system for official documents, publication of cartographic works, signs and indicators of inhabited localities, streets, stops, subway stations, etc.
It has been adopted internationally. The 27th session of the UN Group of Experts on Geographical Names (UNGEGN) held in New York 30 July and 10 August 2012  after a report by the State Agency of Land Resources of Ukraine (now known as Derzhheokadastr: Ukraine State Service of Geodesy, Cartography and Cadastre) experts approved the Ukrainian system of romanization.
Romanization intended for readers of other languages than English is usually transcribed phonetically into the familiar orthography. For example, y, kh, ch, sh, shch for anglophones may be transcribed j, ch, tsch, sch, schtsch for German readers (for letters ?, ?, ?, ?, ?). Or it may be rendered in Latin letters according to the normal orthography of another Slavic language, such as Polish or Croatian (as does the established scholarly system, above).
Users of public-access computers or mobile text messaging services sometimes improvise informal romanization due to limitations in keyboard or character set. These may include both sound-alike and look-alike letter substitutions. Example: YKPAIHCbKA ABTOPKA for "? ?". See also Volapuk encoding.
This system uses the available character set.
For telegraph transmission. Each separate Ukrainian letter had a 1:1 equivalence to a Latin letter. Latin Q, W, V, and X are equivalent to Ukrainian ? (or sometimes ?), ?, ?, ?. Other letters are transcribed phonetically. This equivalency is used in building the KOI8-U table.
Transcription is the representation of the spoken word. Phonological, or phonemic, transcription represents the phonemes, or meaningful sounds of a language, and is useful to describe the general pronunciation of a word. Phonetic transcription represents every single sound, or phone, and can be used to compare different dialects of a language. Both methods can use the same sets of symbols, but linguists usually denote phonemic transcriptions by enclosing them in slashes / ... /, while phonetic transcriptions are enclosed in square brackets [ ... ].
The International Phonetic Alphabet precisely represents pronunciation. Requires a special Unicode font.
In many contexts, it is common to use a modified system of transliteration that strives to be read and pronounced naturally by anglophones. Such transcriptions are also used for the surnames of people of Ukrainian ancestry in English-speaking countries (personal names have often been translated to equivalent or similar English names, e.g., "Alexander" for Oleksandr, "Terry" for Taras).
Usually such a usage is based on either the Library of Congress (in North America) or British Standard system. Such a simplified system usually omits diacritics and tie-bars, simplifies -y? and -i? word endings to "-y", ignores the Ukrainian soft sign (?) and apostrophe ('), and may substitute ya, ye, yu, yo for ia, ie, iu, io at the beginnings of words. It may also simplify doubled letters. Unlike in the English language where an apostrophe is punctuation, in the Ukrainian language it is a letter. Therefore sometimes Rus' is translated with an apostrophe, even when the apostrophe is dropped for all other names and words.
Conventional transliterations can reflect the history of a person or place. Many well-known spellings are based on transcriptions into another Latin alphabet, such as the German or Polish. Others are transcribed from equivalent names in other languages, for example Ukrainian Pavlo ("Paul") may be called by the Russian equivalent Pavel, Ukrainian Kyiv by the Russian equivalent Kiev.
The employment of romanization systems can become complex. For example, the English translation of Kubijovy?'s Ukraine: A Concise Encyclopædia uses a modified Library of Congress (ALA-LC) system as outlined above for Ukrainian and Russian names--with the exceptions for endings or doubled consonants applying variously to personal and geographic names. For technical reasons, maps in the Encyclopedia follow different conventions. Names of persons are anglicized in the encyclopedia's text, but also presented in their original form in the index. Various geographic names are presented in their anglicized, Russian, or both Ukrainian and Polish forms, and appear in several forms in the index. Scholarly transliteration is used in linguistics articles. The Encyclopedia's explanation of its transliteration and naming convention occupies 2-1/2 pages.
|? ?||h||h||h||h||g||h, gh¹||h||h|
|? ?||je||i?e||ye||ye||ê||ie, ye²||ie||je|
|? ?||ji (ï)||ï||yi||yi||ï||i, yi²||ï||ji|
|? ?||j||?||?||y||j||i, y²||y||j|
|? ?||s||s||s||s||s||s||s||s, ss|
|? ?||?||?||', '||'||?||-||-||-|
|? ?||ju||i?u||yu||yu||û||iu, yu²||iou||ju|
|? ?||ja||i?a||ya||ya||â||ia, ya²||ia||ja|
|'||- (?)||-||", "||"||'||-||-||-|
|? ?||", "|
|Cyrillic||GOST 1971||GOST 1986||Derzhstandart 1995||National 1996||Passport 2004||Passport 2007||National 2010|
|? ?||v||v||v||v||v, w||v||v|
|? ?||g||g||gh||h, gh++||h, g||g||h, gh++|
|? ?||-||-||g||g||g, h||g||g|
|? ?||je||je||je||ie, ye*||ie, ye*||ie||ie, ye*|
|? ?||zh||?||zh||zh||zh, j||zh||zh|
|? ?||ji||i||ji||i, yi*||i, yi*||i||i, yi*|
|? ?||j||j||j+||i, y*||i, y*||i||i, y*|
|? ?||k||k||k||k||k, c||k||k|
|? ?||ju||ju||ju||iu, yu*||iu, yu*||iu||iu, yu*|
|? ?||ja||ja||ja||ia, ya*||ia, ya*||ia||ia, ya*|
In the National (1996) system transliteration can be rendered in a simplified form: