LLMpediaThe first transparent, open encyclopedia generated by LLMs

Chinese language

Generated by DeepSeek V3.2
Note: This article was automatically generated by a large language model (LLM) from purely parametric knowledge (no retrieval). It may contain inaccuracies or hallucinations. This encyclopedia is part of a research project currently under review.
Article Genealogy
Parent: United Nations Hop 3
Expansion Funnel Raw 75 → Dedup 40 → NER 17 → Enqueued 16
1. Extracted75
2. After dedup40 (None)
3. After NER17 (None)
Rejected: 23 (not NE: 23)
4. Enqueued16 (None)
Similarity rejected: 1
Chinese language
NameChinese
Nativename中文, 汉语
FamilycolorSino-Tibetan
StatesChina, Taiwan, Singapore, Malaysia, and other regions of the Chinese diaspora
Speakers~1.3 billion
Fam1Sino-Tibetan
Fam2Sinitic
ScriptChinese characters
Iso1zh
Iso2chi (B) / zho (T)
Iso3zho
Lingua79-AAA
Glottosini1245
GlottorefnameSinitic

Chinese language. It is a group of language varieties that form the Sinitic branch of the Sino-Tibetan language family, spoken predominantly across China, Taiwan, Singapore, and Malaysia. The standardized form, known as Standard Chinese or Mandarin Chinese, is the official language of the People's Republic of China and the Republic of China and one of the four official languages of Singapore. With a history spanning millennia, it is the language with the most native speakers in the world and possesses one of the oldest continuously used writing systems.

History

The earliest attested forms of the language are found in oracle bone inscriptions from the Shang dynasty, used for divination at sites like Yinxu. The classical literary language, known as Classical Chinese, was standardized during the late Zhou dynasty and served as the written lingua franca for administration and literature throughout imperial history, including during the Qin dynasty, Han dynasty, and Tang dynasty. Significant phonological changes occurred between Old Chinese and Middle Chinese, the latter reconstructed from works like the Qieyun rime dictionary. The development of modern vernacular standards was heavily influenced by the Baihua movement and later promoted by figures like Hu Shih and the government of the Republic of China, with Putonghua established as the national standard after the founding of the People's Republic of China.

Classification and varieties

Chinese is not a single uniform language but a macrolanguage comprising many mutually unintelligible varieties, often termed dialects. These are traditionally classified into seven to ten major groups: Mandarin Chinese, which includes Beijing-based Standard Chinese; Wu Chinese, spoken in Shanghai and Zhejiang; Yue Chinese, which includes Cantonese centered on Guangzhou and Hong Kong; Min Chinese, with major branches like Southern Min (including Hokkien and Taiwanese Hokkien) and Eastern Min; Xiang Chinese; Hakka Chinese; and Gan Chinese. Other notable varieties include Jin Chinese, Huizhou Chinese, and Pinghua. The International Organization for Standardization classifies these as top-level languages under the umbrella code "zho".

Phonology

The phonological systems of Chinese varieties are diverse but share tonal and monosyllabic characteristics. Standard Chinese phonology is based on the Beijing dialect and features four primary lexical tones and a neutral tone. Its syllable structure is relatively simple, allowing initials, finals, and a tone, with a typical inventory of around 21 consonants and 16 vowels. In contrast, Cantonese retains a more complex system with six to nine lexical tones and preserves final consonants like -p, -t, and -k from Middle Chinese. Historical sound changes are studied through the framework of rime tables and the comparative method, with significant resources including the Guangyun and other medieval dictionaries.

Grammar

Chinese is broadly characterized as an analytic language with minimal inflection, relying heavily on word order and function words to express grammatical relationships. The basic word order is subject–verb–object (SVO). It features a topic-comment structure and makes extensive use of serial verb constructions. Grammatical functions such as aspect, mood, and possession are indicated by particles like 了, 过, 的, and 吗. Classical Chinese grammar differs significantly, being more highly synthetic and concise. Modern grammar was formally described and influenced by Western linguistics through the work of scholars like Yuen Ren Chao and the publications of the Academia Sinica.

Writing system

The language is written using Chinese characters, logograms that represent morphemes and syllables. The earliest complete texts are found on bronzeware and oracle bones from the Shang dynasty. The script was unified under the Qin dynasty's Li Si, leading to the Clerical script and later the Regular script used today. Two modern standardized forms exist: Simplified Chinese characters, promulgated by the People's Republic of China after the Chinese Civil War, and Traditional Chinese characters, retained in Taiwan, Hong Kong, and Macau. The system is supplemented by phonetic notations like Zhuyin (Bopomofo) used in Taiwan and the Hanyu Pinyin romanization system developed by the People's Republic of China.

Sociolinguistics

The status and use of different varieties are shaped by complex sociopolitical factors. Standard Chinese serves as the medium of instruction in the education system and official communication within the People's Republic of China and Singapore. Other varieties, such as Cantonese, maintain strong cultural and media presence in regions like Guangdong, Hong Kong, and overseas communities in Chinatowns worldwide. Language policy has been a central issue, from the May Fourth Movement's advocacy of Baihua to contemporary efforts in Taiwan and among the Chinese diaspora. The global influence of China has spurred the growth of Confucius Institutes and increased the study of the language internationally.