LLMpediaThe first transparent, open encyclopedia generated by LLMs

Arabic script

Generated by DeepSeek V3.2
Note: This article was automatically generated by a large language model (LLM) from purely parametric knowledge (no retrieval). It may contain inaccuracies or hallucinations. This encyclopedia is part of a research project currently under review.
Article Genealogy
Parent: Arabic language Hop 4
Expansion Funnel Raw 111 → Dedup 0 → NER 0 → Enqueued 0
1. Extracted111
2. After dedup0 (None)
3. After NER0 ()
4. Enqueued0 ()
Arabic script
Arabic script
Serg!o. · Public domain · source
NameArabic script
TypeAbjad
LanguagesArabic, Persian, Urdu, and many others
Time4th century CE to present
Fam1Egyptian hieroglyphs
Fam2Proto-Sinaitic script
Fam3Phoenician alphabet
Fam4Aramaic alphabet
Fam5Nabataean script
ChildrenN'Ko, Perso-Arabic, Ottoman Turkish alphabet, Kurdo-Arabic, Shahmukhi, Sindhi, Urdu alphabet, Jawi script, Pegon script, Xiao'erjing
Unicode[https://www.unicode.org/charts/PDF/U0600.pdf U+0600–U+06FF], [https://www.unicode.org/charts/PDF/U0750.pdf U+0750–U+077F], [https://www.unicode.org/charts/PDF/U08A0.pdf U+08A0–U+08FF], [https://www.unicode.org/charts/PDF/FB50.pdf U+FB50–U+FDFF], [https://www.unicode.org/charts/PDF/FE70.pdf U+FE70–U+FEFF]
Iso15924Arab

Arabic script is a writing system used for the Arabic language and has been widely adopted for numerous other languages across Asia and Africa. It is an abjad, primarily denoting consonants, and is written from right to left in a cursive style. The script's evolution from earlier Semitic writing systems and its subsequent adaptation for languages like Persian and Urdu have made it one of the most widely used scripts in the world, deeply intertwined with the spread of Islam and Islamic culture.

History and development

The script's origins are traced to the Nabataean script, an Aramaic-derived script used in the Nabataean Kingdom around Petra. By the 4th century CE, a distinct early form emerged, evidenced by inscriptions like the Zabad inscription and the Jabal Ramm inscription. Its definitive standardization is linked to the need to record the Qur'an following the death of the Prophet Muhammad, with early efforts during the reign of Uthman ibn Affan. Key developments included the introduction of diacritic marks by Abu al-Aswad al-Du'ali to preserve correct pronunciation, and the systematic dotting system pioneered by Al-Khalil ibn Ahmad al-Farahidi and Nasr ibn Asim to distinguish between similar letter shapes. The script flourished under the Abbasid Caliphate, with centers of learning in Baghdad and Kufa refining the Kufic and later Naskh styles.

Characteristics and structure

As an abjad, the script typically represents long vowels using the letters ا (*alif*), و (*waw*), and ي (*ya*), while short vowels are indicated by optional diacritical marks like fatḥah and kasrah. A defining feature is cursive writing, where most letters connect to adjacent ones, requiring up to four contextual forms: isolated, initial, medial, and final. The Arabic alphabet consists of 28 basic letters, with additional characters like پ (*pe*) and چ (*che*) created for non-Arabic languages. Core orthographic rules govern the writing of the definite article ال (*al-*) and the non-connecting behavior of certain letters such as د (*dal*). The script is also the basis for writing systems like the Persian alphabet and the Urdu alphabet.

Use in writing systems

Primarily, it is used for the Arabic language, serving as the official script in nations from Morocco to Oman. It is the liturgical script for Islam, used for the Qur'an, Hadith, and all classical religious texts. Beyond Arabic, it is the official script for Persian in Iran and Dari in Afghanistan, for Urdu in Pakistan, and for Kashmiri in Jammu and Kashmir. It has historically been used for administrative languages like Ottoman Turkish under the Ottoman Empire and for literary traditions such as Sindhi literature and Punjabi literature in the Shahmukhi variant. In regions like Southeast Asia, it appears in the Jawi script for Malay and the Pegon script for Javanese.

Adaptations for other languages

Adapting the abjad to languages with different phonetic inventories necessitated innovations. The Persian alphabet added letters like گ (*gaf*) for sounds not in Arabic, a model followed by languages like Urdu and Kurdish. In the Indian subcontinent, scripts such as Shahmukhi and the Sindhi alphabet incorporated extensive diacritical systems and new characters to represent Indo-Aryan phonemes. In Africa, adaptations include the Ajami script for languages like Hausa and Swahili, and the Wadaad's writing for Somali. In Central Asia, it was used for Uyghur before the adoption of the Uyghur Perso-Arabic script, and historically for Chagatai. Other significant adaptations are Xiao'erjing for Sinitic languages and the Mooré alphabet in Burkina Faso.

Typography and calligraphy

Islamic calligraphy is a major art form, with classical styles including the angular Kufic script, used in early Qur'an manuscripts and architecture like the Dome of the Rock, and the cursive Naskh, which became the standard for print. Ornate styles developed for decorative purposes, such as the flowing Thuluth on mosque decorations, the intricate Muhaqqaq, and the complex Nastaʿlīq, championed by Mir Ali Tabrizi and predominant in Persian and Urdu contexts. The Diwani script was developed in the courts of the Ottoman Empire, while Maghrebi script styles are distinctive to North Africa. Modern typography faces challenges in digital rendering due to the script's contextual letterforms, addressed by technologies like the Arabic Calligraphic Engine and fonts from designers such as Nadine Chahine.

Digital encoding and representation

The script is comprehensively supported in the Unicode standard, with primary ranges in the Arabic block and the Arabic Supplement block. Critical for proper display are the Arabic Presentation Forms blocks, which include ligatures for phrases like بسم الله الرحمن الرحيم. Digital implementation relies on complex text layout engines, such as Uniscribe on Microsoft Windows and Harfbuzz in open-source systems, to manage character joining. The International Organization for Standardization defines codes in ISO/IEC 8859-6 and ISO 15924, which assigns the code "Arab". Key encoding initiatives have been driven by organizations like the Unicode Consortium and companies including Microsoft and Apple, ensuring support across operating systems and the World Wide Web.