The term dialect (from Latin dialectus, dialectos, from the Ancient Greek word , diálektos, "discourse", from , diá, "through" and ?, lég?, "I speak") is used in two distinct ways to refer to two different types of linguistic phenomena:
Features that distinguish dialects from each other can be found in lexicon (vocabulary) and grammar, as well as in pronunciation (phonology, including prosody). Where the salient distinctions are only or mostly to be observed in pronunciation, the more specific term accent may be used instead of dialect. Differences that are largely concentrated in lexicon may be creoles in their own right. When lexical differences are mostly concentrated in the specialized vocabulary of a profession or other organization, they are jargons; differences in vocabulary that are deliberately cultivated to exclude outsiders or to serve as shibboleths are known as cryptolects (or "cant") and include slangs and argots. The particular speech patterns used by an individual are referred to as that person's idiolect. Dialects do not always correspond with a standard written system this is the case for most spoken dialects. For example, spoken dialects of the Arabic Language do not have their own writing system that is distinguishable from other dialects. However, these dialects are not always mutually intelligible from one another. For example, speakers of the Levantine Dialect of Arabic may have trouble understanding speakers of the Egyptian Dialect. This leads to some debate among scholars of the status of Arabic dialects as their own regionalects or their own separate languages. To classify subsets of language as dialects, linguists take into account linguistic distance.
A standard dialect (also known as a "standardized dialect" or "standard language") is a dialect that is supported by institutions. Such institutional support may include any or all of the following: government recognition or designation; formal presentation in schooling as the "correct" form of a language; informal monitoring and policing of everyday usage; published grammars, dictionaries, and textbooks that set forth a normative spoken and written form; and/or an extensive formal literature that employs that variety (prose, poetry, non-fiction, etc.). There may be multiple standard dialects associated with a single language. For example, Standard American English, Standard British English, Standard Canadian English, Standard Indian English, Standard Australian English, and Standard Philippine English may all be said to be standard dialects of the English language.
A nonstandard dialect, like a standard dialect, has a complete grammar and vocabulary, but is usually not the beneficiary of institutional support. Examples of a nonstandard English dialect are Southern American English, Western Australian English, New York English, New England English, Mid-Atlantic American or Philadelphia / Baltimore English, Scouse, Brummie, Cockney, and Tyke. The Dialect Test was designed by Joseph Wright to compare different English dialects with each other.
There is no universally accepted criterion for distinguishing two different languages from two dialects (i.e. varieties) of the same language. A number of rough measures exist, sometimes leading to contradictory results. The distinction (dichotomy) between dialect and language is therefore subjective (arbitrary) and depends upon the user's preferred frame of reference. For example, there has been discussion about whether or not the Limón Creole English should be considered "a kind" of English or a different language. This creole is spoken in the Caribbean coast of Costa Rica (Central America) by descendants of Jamaican people. The position that Costa Rican linguists support depends upon which university they represent. Another example is Scanian, which even, for a time, had its own ISO code.
An important criterion for categorizing varieties of language is linguistic distance, for a variety to be considered a dialect, the linguistic distance between the two varieties must be low. Linguistic distance between spoken or written forms of language increases as the differences between the forms are characterized For example, two languages with completely different syntactical structures would have a high linguistic distance, while a language with very few differences from another may be considered a dialect or a sibling of that language. Linguistic distance may be used to determine language families and language siblings. For example, languages with little linguistic distance, like Dutch and German, are considered siblings. Dutch and German are siblings in the West-Germanic language group. Some language siblings are closer to each other in terms of linguistic distance than to other linguistic siblings. French and Spanish, siblings in the Romance Branch of the Indo-European group, are closer to each other than they are to any of the languages of the West-Germanic group. When languages are close in terms of linguistic distance, they resemble one another, hence why dialects are not considered linguistically distant to their parent language.
One criterion, which is often considered to be purely linguistic, is that of mutual intelligibility: two varieties are said to be dialects of the same language if being a speaker of one variety confers sufficient knowledge to understand and be understood by a speaker of the other; otherwise, they are said to be different languages. However, this definition cannot consistently delimit languages in the case of a dialect continuum (or dialect chain), containing a sequence of varieties, each mutually intelligible with the next, but where widely separated varieties may not be mutually intelligible. Further problems with this criterion are that mutual intelligibility occurs in varying degrees, and that it is difficult to distinguish from prior familiarity with the other variety. Reported mutual intelligibility may also be affected by speakers' attitudes to the other speech community.
Another occasionally used criterion for discriminating dialects from languages is the sociolinguistic notion of linguistic authority. According to this definition, two varieties are considered dialects of the same language if (under at least some circumstances) they would defer to the same authority regarding some questions about their language. For instance, to learn the name of a new invention, or an obscure foreign species of plant, speakers of Westphalian and East Franconian German might each consult a German dictionary or ask a German-speaking expert in the subject. Thus these varieties are said to be dependent on, or heteronomous with respect to, Standard German, which is said to be autonomous. In contrast, speakers in the Netherlands of Low Saxon varieties similar to Westphalian would instead consult a dictionary of Standard Dutch. Similarly, although Yiddish is classified by linguists as a language in the Middle High German group of languages and has some degree of mutual intelligibility with German, a Yiddish speaker would consult a Yiddish dictionary rather than a German dictionary in such a case.
Within this framework, W. A. Stewart defined a language as an autonomous variety together with all the varieties that are heteronomous with respect to it, noting that an essentially equivalent definition had been stated by Charles A. Ferguson and John J. Gumperz in 1960. Similarly, a heteronomous variety may be considered a dialect of a language defined in this way. In these terms, Danish and Norwegian, though mutually intelligible to a large degree, are considered separate languages. In the framework of Heinz Kloss, these are described as languages by ausbau (development) rather than by abstand (separation).
In other situations, a closely related group of varieties possess considerable (though incomplete) mutual intelligibility, but none dominates the others. To describe this situation, the editors of the Handbook of African Languages introduced the term dialect cluster as a classificatory unit at the same level as a language. A similar situation, but with a greater degree of mutual unintelligibility, has been termed a language cluster.
In many societies, however, a particular dialect, often the sociolect of the elite class, comes to be identified as the "standard" or "proper" version of a language by those seeking to make a social distinction and is contrasted with other varieties. As a result of this, in some contexts, the term "dialect" refers specifically to varieties with low social status. In this secondary sense of "dialect", language varieties are often called dialects rather than languages:
The status of "language" is not solely determined by linguistic criteria, but it is also the result of a historical and political development. Romansh came to be a written language, and therefore it is recognized as a language, even though it is very close to the Lombardic alpine dialects. An opposite example is the case of Chinese, whose variations such as Mandarin and Cantonese are often called dialects and not languages in China, despite their mutual unintelligibility.
Modern nationalism, as developed especially since the French Revolution, has made the distinction between "language" and "dialect" an issue of great political importance. A group speaking a separate "language" is often seen as having a greater claim to being a separate "people", and thus to be more deserving of its own independent state, while a group speaking a "dialect" tends to be seen not as "a people" in its own right, but as a sub-group, part of a bigger people, which must content itself with regional autonomy. The distinction between language and dialect is thus inevitably made at least as much on a political basis as on a linguistic one, and can lead to great political controversy or even armed conflict.
The Yiddish linguist Max Weinreich published the expression, A shprakh iz a dialekt mit an armey un flot (" ? ": "A language is a dialect with an army and navy") in YIVO Bleter 25.1, 1945, p. 13. The significance of the political factors in any attempt at answering the question "what is a language?" is great enough to cast doubt on whether any strictly linguistic definition, without a socio-cultural approach, is possible. This is illustrated by the frequency with which the army-navy aphorism is cited.
By the definition most commonly used by linguists, any linguistic variety can be considered a "dialect" of some language--"everybody speaks a dialect". According to that interpretation, the criteria above merely serve to distinguish whether two varieties are dialects of the same language or dialects of different languages.
The terms "language" and "dialect" are not necessarily mutually exclusive, although it is often perceived to be. Thus there is nothing contradictory in the statement "the language of the Pennsylvania Dutch is a dialect of German".
There are various terms that linguists may use to avoid taking a position on whether the speech of a community is an independent language in its own right or a dialect of another language. Perhaps the most common is "variety"; "lect" is another. A more general term is "languoid", which does not distinguish between dialects, languages, and groups of languages, whether genealogically related or not.
John Lyons writes that "Many linguists [...] subsume differences of accent under differences of dialect." In general, accent refers to variations in pronunciation, while dialect also encompasses specific variations in grammar and vocabulary.
There are around three geographical zones in which Arabic is spoken (Jastrow 2002).
Zone I is categorized as the area in which Arabic was spoken before the rise of Islam, it is the Arabian Peninsula, excluding the areas where southern Arabian was spoken. Zone II is categorized as the areas to which Arabic speaking peoples moved as a result of the conquests of Islam. Included in zone II are the Levant, Egypt, North Africa, Iraq, and some parts of Iran. Zone III are the areas in which Arabic is spoken that are located outside the continuous Arabic Language area.
There is a large amount of documentation of the Arabic dialects of Zone II. Among these dialects are the Levant or Levantine Dialect. This includes Syrian dialect. Egyptian and Sudanese dialects are also widely spoken and studied.
When talking about the German language, the term German dialects is only used for the traditional regional varieties. That allows them to be distinguished from the regional varieties of modern standard German.
The German dialects show a wide spectrum of variation. Some of them are not mutually intelligible. German dialectology traditionally names the major dialect groups after Germanic tribes from which they were assumed to have descended.
The extent to which the dialects are spoken varies according to a number of factors: In Northern Germany, dialects are less common than in the South. In cities, dialects are less common than in the countryside. In a public environment, dialects are less common than in a familiar environment.
The situation in Switzerland and Liechtenstein is different from the rest of the German-speaking countries. The Swiss German dialects are the default everyday language in virtually every situation, whereas standard German is only spoken in education, partially in media, and with foreigners not possessing knowledge of Swiss German. Most Swiss German speakers perceive standard German to be a foreign language.
The Low German and Low Franconian varieties spoken in Germany are often counted among the German dialects. This reflects the modern situation where they are roofed by standard German. This is different from the situation in the Middle Ages when Low German had strong tendencies towards an ausbau language.
The Frisian languages spoken in Germany are excluded from the German dialects.
Italy is an often quoted example of a country where the second definition of the word "dialect" (dialetto) is most prevalent. Italy is in fact home to a vast array of separate languages, most of which lack mutual intelligibility with one another and have their own local varieties; twelve of them (Albanian, Catalan, German, Greek, Slovene, Croatian, French, Franco-Provençal, Friulian, Ladin, Occitan and Sardinian) underwent Italianization to a varying degree (ranging from the currently endangered state displayed by Sardinian and Southern Italian Greek to the vigorous promotion of Germanic Tyrolean), but have been officially recognized as minority languages (minoranze linguistiche storiche), in light of their distinctive historical development. Yet, most of the regional languages spoken across the peninsula are often colloquially referred to in non-linguistic circles as Italian dialetti, since most of them, including the prestigious Neapolitan, Sicilian and Venetian, have adopted vulgar Tuscan as their reference language since the Middle Ages. However, all these languages evolved from Vulgar Latin in parallel with Italian, long prior to the popular diffusion of the latter throughout what is now Italy.
During the Risorgimento, Italian still existed mainly as a literary language, and only 2.5% of Italy's population could speak Italian. Proponents of Italian nationalism, like the Lombard Alessandro Manzoni, stressed the importance of establishing a uniform national language in order to better create an Italian national identity. With the unification of Italy in the 1860s, Italian became the official national language of the new Italian state, while the other ones came to be institutionally regarded as "dialects" subordinate to Italian, and negatively associated with a lack of education.
In the early 20th century, the vast conscription of Italian men from all throughout Italy during World War I is credited with having facilitated the diffusion of Italian among the less educated conscripted soldiers, as these men, who had been speaking various regional languages up until then, found themselves forced to communicate with each other in a common tongue while serving in the Italian military. With the popular spread of Italian out of the intellectual circles, because of the mass-media and the establishment of public education, Italians from all regions were increasingly exposed to Italian. While dialect levelling has increased the number of Italian speakers and decreased the number of speakers of other languages native to Italy, Italians in different regions have developed variations of standard Italian specific to their region. These variations of standard Italian, known as "regional Italian", would thus more appropriately be called dialects in accordance with the first linguistic definition of the term, as they are in fact derived from Italian, with some degree of influence from the local or regional native languages and accents.
The most widely spoken languages of Italy, which are not to be confused with regional Italian, fall within a family of which even Italian is part, the Italo-Dalmatian group. This wide category includes:
Modern Italian is heavily based on the Florentine dialect of Tuscan. The Tuscan-based language that would eventually become modern Italian had been used in poetry and literature since at least the 12th century, and it first spread outside the Tuscan linguistic borders through the works of the so-called tre corone ("three crowns"): Dante Alighieri, Petrarch, and Giovanni Boccaccio. Florentine thus gradually rose to prominence as the volgare of the literate and upper class in Italy, and it spread throughout the peninsula and Sicily as the lingua franca among the Italian educated class as well as Italian travelling merchants. The economic prowess and cultural and artistic importance of Tuscany in the Late Middle Ages and the Renaissance further encouraged the diffusion of the Florentine-Tuscan Italian throughout Italy and among the educated and powerful, though local and regional languages remained the main languages of the common people.
Aside from the Italo-Dalmatian languages, the second most widespread family in Italy is the Gallo-Italic group, spanning throughout much of Northern Italy's languages and dialects (such as Piedmontese, Emilian-Romagnol, Ligurian, Lombard, Venetian, Sicily's and Basilicata's Gallo-Italic in southern Italy, etc.).
Finally, other languages from a number of different families follow the last two major groups: the Gallo-Romance languages (French, Occitan and its Vivaro-Alpine dialect, Franco-Provençal); the Rhaeto-Romance languages (Friulian and Ladin); the Ibero-Romance languages (Sardinia's Algherese); the Germanic Cimbrian, Southern Bavarian, Walser German and the Mòcheno language; the Albanian Arbëresh language; the Hellenic Griko language and Calabrian Greek; the Serbo-Croatian Slavomolisano dialect; and the various Slovene languages, including the Gail Valley dialect and Istrian dialect. The language indigenous to Sardinia, while being Romance in nature, is considered to be a specific linguistic family of its own, separate from the other Neo-Latin groups; it is often subdivided into the Centro-Southern and Centro-Northern dialects.
Though mostly mutually unintelligible, the exact degree to which all the Italian languages are mutually unintelligible varies, often correlating with geographical distance or geographical barriers between the languages; some regional Italian languages that are closer in geographical proximity to each other or closer to each other on the dialect continuum are more or less mutually intelligible. For instance, a speaker of purely Eastern Lombard, a language in Northern Italy's Lombardy region that includes the Bergamasque dialect, would have severely limited mutual intelligibility with a purely Italian speaker and would be nearly completely unintelligible to a Sicilian-speaking individual. Due to Eastern Lombard's status as a Gallo-Italic language, an Eastern Lombard speaker may, in fact, have more mutual intelligibility with an Occitan, Catalan, or French speaker than with an Italian or Sicilian speaker. Meanwhile, a Sicilian-speaking person would have a greater degree of mutual intelligibility with a speaker of the more closely related Neapolitan language, but far less mutual intelligibility with a person speaking Sicilian Gallo-Italic, a language that developed in isolated Lombard emigrant communities on the same island as the Sicilian language.
Today, the majority of Italian nationals are able to speak Italian, though many Italians still speak their regional language regularly or as their primary day-to-day language, especially at home with family or when communicating with Italians from the same town or region.
The classification of speech varieties as dialects or languages and their relationship to other varieties of speech can be controversial and the verdicts inconsistent. Serbo-Croatian illustrates this point. Serbo-Croatian has two major formal variants (Serbian and Croatian). Both are based on the Shtokavian dialect and therefore mutually intelligible with differences found mostly in their respective local vocabularies and minor grammatical differences. Certain dialects of Serbia (Torlakian) and Croatia (Kajkavian and Chakavian), however, are not mutually intelligible even though they are usually subsumed under Serbo-Croatian. How these dialects should be classified in relation to Shtokavian remains a matter of dispute.
Macedonian, although largely mutually intelligible with Bulgarian and certain dialects of Serbo-Croatian (Torlakian), is considered by Bulgarian linguists to be a Bulgarian dialect, in contrast with the contemporary international view and the view in North Macedonia, which regards it as a language in its own right. Nevertheless, before the establishment of a literary standard of Macedonian in 1944, in most sources in and out of Bulgaria before the Second World War, the southern Slavonic dialect continuum covering the area of today's North Macedonia were referred to as Bulgarian dialects (see Bulgarian language#Relationship to Macedonian). Sociolinguists agree that the question whether Macedonian is a dialect of Bulgarian or a language is a political one and cannot be resolved on a purely linguistic basis, because dialect continua do not allow for either/or judgments.
In Lebanon, a part of the Christian population considers "Lebanese" to be in some sense a distinct language from Arabic and not merely a dialect thereof. During the civil war, Christians often used Lebanese Arabic officially, and sporadically used the Latin script to write Lebanese, thus further distinguishing it from Arabic. All Lebanese laws are written in the standard literary form of Arabic, though parliamentary debate may be conducted in Lebanese Arabic.
In Tunisia, Algeria, and Morocco, the Darijas (spoken North African languages) are sometimes considered more different from other Arabic dialects. Officially, North African countries prefer to give preference to the Literary Arabic and conduct much of their political and religious life in it (adherence to Islam), and refrain from declaring each country's specific variety to be a separate language, because Literary Arabic is the liturgical language of Islam and the language of the Islamic sacred book, the Qur'an. Although, especially since the 1960s, the Darijas are occupying an increasing use and influence in the cultural life of these countries. Examples of cultural elements where Darijas' use became dominant include: theatre, film, music, television, advertisement, social media, folk-tale books and companies' names.
The Modern Ukrainian language has been in common use since the late 17th century, associated with the establishment of the Cossack Hetmanate. In the 19th century, the Tsarist Government of the Russian Empire claimed that Ukrainian (or Little Russian, per official name) was merely a dialect of Russian (or Polonized dialect) and not a language on its own (same concept as for Belarusian language). That concepted was enrooted soon after the partitions of Poland. According to these claims, the differences were few and caused by the conquest of western Ukraine by the Polish-Lithuanian Commonwealth. However, in reality the dialects in Ukraine were developing independently from the dialects in the modern Russia for several centuries, and as a result they differed substantially.
Following the Spring of Nations in Europe and efforts of the Brotherhood of Saints Cyril and Methodius, across the so called "Southwestern Krai" of Russian Empire started to spread cultural societies of Hromada and their Sunday schools. Themselves "hromadas" acted in same manner as Orthodox fraternities of Polish-Lithuanian Commonwealth back in 15th century. Around that time in Ukraine becoming popular political movements Narodnichestvo (Narodniks) and Khlopomanstvo.
There have been cases of a variety of speech being deliberately reclassified to serve political purposes. One example is Moldovan. In 1996, the Moldovan parliament, citing fears of "Romanian expansionism", rejected a proposal from President Mircea Snegur to change the name of the language to Romanian, and in 2003 a Moldovan-Romanian dictionary was published, purporting to show that the two countries speak different languages. Linguists of the Romanian Academy reacted by declaring that all the Moldovan words were also Romanian words; while in Moldova, the head of the Academy of Sciences of Moldova, Ion B?rbu, described the dictionary as a politically motivated "absurdity".
Unlike languages that use alphabets to indicate their pronunciation, Chinese characters have developed from logograms that do not always give hints to their pronunciation. Although the written characters have remained relatively consistent for the last two thousand years, the pronunciation and grammar in different regions have developed to an extent that the varieties of the spoken language are often mutually unintelligible. As a series of migration to the south throughout the history, the regional languages of the south, including Gan, Xiang, Wu, Min, Yue and Hakka often show traces of Old Chinese or Middle Chinese. From the Ming dynasty onward, Beijing has been the capital of China and the dialect spoken in Beijing has had the most prestige among other varieties. With the founding of the Republic of China, Standard Mandarin was designated as the official language, based on the spoken language of Beijing. Since then, other spoken varieties are regarded as fangyan (regional speech). Cantonese is still the most commonly-used language in Guangzhou, Hong Kong, Macau and among some overseas Chinese communities, whereas Hokkien has been accepted in Taiwan as an important local language alongside Mandarin.
One language, Interlingua, was developed so that the languages of Western civilization would act as its dialects. Drawing from such concepts as the international scientific vocabulary and Standard Average European, linguists[who?] developed a theory that the modern Western languages were actually dialects of a hidden or latent language. Researchers at the International Auxiliary Language Association extracted words and affixes that they considered to be part of Interlingua's vocabulary. In theory, speakers of the Western languages would understand written or spoken Interlingua immediately, without prior study, since their own languages were its dialects. This has often turned out to be true, especially, but not solely, for speakers of the Romance languages and educated speakers of English. Interlingua has also been found to assist in the learning of other languages. In one study, Swedish high school students learning Interlingua were able to translate passages from Spanish, Portuguese, and Italian that students of those languages found too difficult to understand. The vocabulary of Interlingua extends beyond the Western language families.
language standard dialect.CS1 maint: ref=harv (link)
Similarly, Bulgarian politicians often argue that Macedonian is simply a dialect of Bulgarian - which is really a way of saying, of course, that they feel Macedonia ought to be part of Bulgaria. From a purely linguistic point of view, however, such arguments are not resolvable, since dialect continua admit of more-or-less but not either-or judgements.
Sociolinguists agree that in such situations the decision as to whether a particular variety of speech constitutes a language or a dialect is always based on political, rather than linguistic criteria (Trudgill 1974:15). A language, in other words, can be defined "as a dialect with an army and a navy" (Nash 1989:6).