|Native to||Yunnan, China|
|1.3 million (2003)|
The Bai language (Bai: Baip?ngvp?zix; simplified Chinese: ; traditional Chinese: ; pinyin: ) is a language spoken in China, primarily in Yunnan province, by the Bai people. The language has over a million speakers and is divided into three or four main dialects. Bai syllables are always open, with a rich set of vowels and eight tones. The tones are divided into two groups with modal and non-modal (tense, harsh or breathy) phonation. There is a small amount of traditional literature written with Chinese characters, Bowen (), as well as a number of recent publications printed with a recently standardized system of romanisation using the Latin alphabet.
The origins of Bai have been obscured by intensive Chinese influence of an extended period. Different scholars have proposed that it is an early offshoot or sister language of Chinese, part of the Loloish branch or a separate group within the Sino-Tibetan family.
Xu and Zhao (1984) divided Bai into three dialects, which may actually be distinct languages: Jianchuan (Central), Dali (Southern), and Bijiang (Northern). Bijiang County has since been renamed as Lushui County. Jianchuan and Dali are closely related, and speakers are reported to be able to understand one another after living together for a month.
The more divergent Northern dialects are spoken by about 15,000 Laemae (lm, Lemei, Lama), a clan numbering about 50,000 people who are partly submerged within the Lisu. They are now designated as two languages by ISO 639-3:
Wang Feng (2012) provides the following classification for 9 Bai dialects:
The affiliation of Bai is obscured by over two millennia of influence from varieties of Chinese, leaving most of its lexicon related to Chinese etyma of various periods. To determine its origin, researchers must first identify and remove from consideration the various layers of loanwords, and then examine the residue. In his survey of the field, Wang notes that early work was hampered by a lack of data on Bai and uncertainties in the reconstruction of early forms of Chinese. Recent authors have suggested that Bai is an early offshoot from Chinese, a sister language to Chinese, or more distantly related (though usually still Sino-Tibetan).
Some of these changes date back to the first centuries AD.
The oldest layer of Bai vocabulary with Chinese cognates, of which Wang lists some 250 words, includes common Bai words that were also common in Classical Chinese, but are not used in modern varieties of Chinese. Its features have been compared with current ideas on Old Chinese phonology:
Sergei Starostin suggests that these facts indicate a split from mainstream Chinese around the 2nd century BC, corresponding to the Western Han period. Wang argues that a few of the correspondences between his reconstructed Proto-Bai and Old Chinese cannot be explained by the Old Chinese forms, and that Chinese and Bai therefore form a Sino-Bai group. However, Gong suggests that at least some of these cases can be accounted for by refining the Proto-Bai reconstruction to take account of complementary distribution within Bai.
Starostin and Zhengzhang Shangfang have separately argued that the oldest Chinese layer accounts for all but an insignificant residue of Bai vocabulary, and that Bai is therefore an early branch of Chinese.
On the other hand, Lee and Sagart (1998) argued that the various layers of Chinese vocabulary are loans, and that when they are removed, a significant non-Chinese residue remains, including 15 entries from the 100-word Swadesh list of basic vocabulary. They suggest that this residue shows similarities with Proto-Loloish.James Matisoff (2001) argued that the comparison with Loloish is less persuasive when considering other Bai varieties than the Jianchuan dialect used by Lee and Sagart, and that it is safer to consider Bai as an independent branch of Sino-Tibetan, though perhaps close to the neighbouring Loloish. Lee and Sagart (2008) refined their analysis, presenting the residue as a non-Chinese form of Sino-Tibetan, though not necessarily Loloish. They also note that this residue includes the Bai vocabulary relating to pig rearing and rice agriculture.
Lee and Sagart's analysis has been further discussed by List (2009). Gong (2015) suggests that the residual layer may be Qiangic, pointing out that the Bai, like the Qiang, call themselves "white", whereas the Lolo use "black".
The Jianchuan dialect has the following consonants, all of which are restricted to syllable-initial position:
The Gongxing and Tuolou dialects retain an older 3-way distinction for stop and affricate initials between voiceless unaspirated, voiceless aspirated and voiced. In the core eastern group, including the standard form of Dali, the voiced initials have become voiceless unaspirated, while other dialects show partial loss of voicing, conditioned by tone in different ways. Some varieties also have an additional uvular nasal [?] that contrasts phonemically with [?].
Jianchuan finals comprise:
All but u, ?o and i?o have contrasting nasalized variants. Dali Bai lacks nasal vowels. Some other varieties retain nasal codas instead of nasalization, though only the Gongxing and Tuolou dialects have a contrast between -n and -?.
The old Bai script used modified Chinese characters, but its use was limited. A new script based on the Latin alphabet was designed in 1958, based on the speech of the urban centre of Xiaguan, even though it was not a typical Southern dialect. The idea of romanization was controversial among Bai elites, and the system saw little use. In a renewed attempt in 1982, language planners used the Jianchuan dialect as a base, because it represented an area with a significant population, almost all of whom spoke Bai. The new script was popular in the Jianchuan area, but was rejected in the more economically advanced area of Dali, which also had the largest number of speakers, albeit living alongside a large number of speakers of Chinese. The script was revised extensively in 1993 to define two variants, representing Jianchuan and Dali respectively, and has since been more widely used.
|Stop||unaspirated||b [p]||d [t]||g [k]|
|aspirated||p [p?]||t [t?]||k [k?]|
|Nasal||m [m]||n [n]||ni [?]||ng [?]|
|Affricate||unaspirated||z [ts]||zh ||j [t?]|
|aspirated||c [ts?]||ch ||q [t]|
|Fricative||voiceless||f [f]||s [s]||sh [?]||x [?]||h [x]|
|voiced||v [v]||ss [z]||r [?]||hh [?]|
|Lateral and semivowel||l [l]||y [j]|
|i [i]||ei [e]||ai/er [?]/||a [?]||ao [?]||o [o]||ou [ou]||u [u]||e [?]||v [v?]|
|iai/ier [i?]/[i]||ia [i?]||iao [iao]||io [io]||iou [iou]||ie [i?]|
|u [ui]||uai/uer [u?]/[u]||ua [u?]||uo [uo]|
The 1993 revision introduced variants ai/er etc, with the former to be used for Jianchuan Bai and the latter for Dali Bai. In Jianchuan, all vowels but ao, iao, uo, ou and iou have nasalized counterparts, denoted by a suffixed n. Dali Bai lacks nasalized vowels.
|Pitch contour and phonation||1982 spelling||1993 spelling||Notes|
|high level (55), modal||-l||-l|
|mid level (33), modal||-x||-x|
|mid falling (31), breathy||-t||-t|
|mid rising (35), modal||-f||-f|
|mid-low falling (21), harsh||(unmarked)||-d|
|high level (55), tense||-rl||-b||Jianchuan only|
|mid-high level (44), tense||-rx||(unmarked)|
|mid-high falling (42), tense||-rt||-p|
|mid falling (32), modal||-p/-z||distinguished in Dali only|