LLMpediaThe first transparent, open encyclopedia generated by LLMs

Persian language

Generated by DeepSeek V3.2
Note: This article was automatically generated by a large language model (LLM) from purely parametric knowledge (no retrieval). It may contain inaccuracies or hallucinations. This encyclopedia is part of a research project currently under review.
Article Genealogy
Parent: Iran Hop 4
Expansion Funnel Raw 103 → Dedup 33 → NER 8 → Enqueued 7
1. Extracted103
2. After dedup33 (None)
3. After NER8 (None)
Rejected: 25 (not NE: 25)
4. Enqueued7 (None)
Similarity rejected: 1
Persian language
NamePersian
Nativenameفارسی
Pronunciation[fɒːɾˈsiː]
StatesIran, Afghanistan, Tajikistan
RegionWestern Asia, Central Asia
EthnicityPersians, Tajiks, Hazaras, Aimaq people
Speakers~80 million
FamilycolorIndo-European
Fam2Indo-Iranian
Fam3Iranian
Fam4Western Iranian
Fam5Southwestern Iranian
ScriptPersian alphabet (Arabic script),, Cyrillic script (in Tajikistan),, Persian Braille
NationIran, Afghanistan (as Dari), Tajikistan (as Tajik)
Iso1fa
Iso2per (B) / fas (T)
Iso3fas
Glottofars1254
GlottorefnamePersian
Lingua58-AAC (Wider Persian) , > 58-AAC-c (Central Persian)

Persian language. It is an Indo-Iranian tongue belonging to the larger Indo-European family and holds official status in Iran, Afghanistan, and Tajikistan. With a rich literary history spanning over a millennium, it has served as a prestigious lingua franca across South Asia, Central Asia, and the Middle East. Its modern standard forms are known as Farsi in Iran, Dari in Afghanistan, and Tajik in Tajikistan.

History

The history is categorized into three major stages: Old Persian, Middle Persian, and New Persian. Old Persian is attested primarily in the cuneiform inscriptions of the Achaemenid Empire, such as those at Behistun Inscription commissioned by Darius the Great. Middle Persian, or Pahlavi, emerged during the era of the Sasanian Empire and was used for Zoroastrian religious texts like the Bundahishn. The transition to New Persian began after the Muslim conquest of Persia, heavily influenced by Arabic vocabulary and the Arabic script, and flourished under Samanid patronage with early masters like Rudaki. The classical period produced monumental figures such as Ferdowsi (author of the Shahnameh), Hafez, Saadi, Rumi, and Omar Khayyam, whose works profoundly influenced cultures from the Ottoman Empire to Mughal Empire.

Geographic distribution

It is the official language of Iran, where it is the primary tongue for the majority, and co-official in Afghanistan (as Dari) and Tajikistan (as Tajik). Significant native communities exist in Uzbekistan, Bahrain, Iraq, and Pakistan, as well as through diaspora populations in North America, Western Europe, and Israel. Historically, it served as an administrative and literary language across vast regions, including the Indian subcontinent under the Delhi Sultanate and Mughal Empire, and in the Caucasus during periods of Safavid and Qajar rule. Its influence is evident in numerous Turkic languages like Turkish and Uzbek, as well as in Indo-Aryan languages such as Urdu and Punjabi.

Classification and dialects

As a member of the Indo-European family, it falls within the Iranian branch of the Indo-Iranian group. Its primary dialect continuum consists of Iranian Persian (Farsi), Afghan Persian (Dari), and Tajik Persian (Tajik), which are largely mutually intelligible. Regional dialects within Iran include Tehrani, Shirazi, Isfahani, and Mashhadi, while notable varieties in Afghanistan encompass Kabuli and Herati Dari. Other related but distinct Iranian languages include Kurdish, Balochi, Pashto, and the now-extinct Sogdian.

Phonology

The sound inventory typically includes six vowels and 23 consonants, though systems vary between standards, such as the distinction maintained in Dari and Tajik. Notable features include the presence of uvular stops like /q/ (ق) and the voiced velar fricative /ɣ/ (غ). Stress is generally dynamic and falls on the final syllable of a word stem. Tajik phonology, influenced by Russian and neighboring Turkic languages, exhibits additional vowel distinctions and uses the Cyrillic script. Historical sound changes from Middle Persian include the loss of certain initial consonants and the development of the ezāfe construction.

Grammar

It is a predominantly SOV (subject-object-verb) language with a relatively simple inflectional system, lacking grammatical gender and definite articles. Nouns are not marked for case but use a construct phrase (ezāfe) to indicate relationships. Verb conjugation involves marking for tense, aspect, and mood, utilizing prefixes and suffixes, with a notable feature being the use of light verbs combined with nouns or adjectives. The language employs a distinction in formality through pronouns and verb forms, and makes extensive use of subjunctive and imperative moods.

Vocabulary

The core lexicon derives from Middle Persian, but it contains a substantial proportion of loanwords from Arabic, especially in fields like philosophy, science, and jurisprudence. It has also contributed many words to languages like Urdu, Turkish, Armenian, and Georgian, and borrowed from French, Russian, and English in the modern era. Classical poetry, particularly the works of Hafez and Saadi, remains a foundational source for proverbs and eloquent expressions. The Academy of Persian Language and Literature actively works to coin new terms from native roots, promoting words over foreign equivalents.

Writing system

In Iran and Afghanistan, it is written in a modified Arabic script known as the Persian alphabet, which is cursive and written from right to left, with additional letters for sounds not found in Arabic. Since the 1940s, Tajik has been written in an extended Cyrillic Cyrillie (a Cyrillicicic scriptic scripticic script), a scriptic scriptic scriptic scriptic script a Persian a script a scripticic script a Persian a scripticic scriptic a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a script a a a a a a script a script a a a a script a script a script a script a script a script a script a script a a script a script a a script a script a script a script a script a script a script a a a a a a a script a a a a a a script a a a a a a a a a and Cyrill a a a a a a a a a a a a a a a a a a and Cyr a a a a a a a a a a a a a a a a a a a a script a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a and Cyr a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a and C a a a a a a a a a a a a a a a a a a a a a alexander a a a a a a a a a a a a a a a a] a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a andC a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a]