LLMpediaThe first transparent, open encyclopedia generated by LLMs

Rohingya language

Generated by GPT-5-mini
Note: This article was automatically generated by a large language model (LLM) from purely parametric knowledge (no retrieval). It may contain inaccuracies or hallucinations. This encyclopedia is part of a research project currently under review.
Article Genealogy
Parent: Rohingya people Hop 4
Expansion Funnel Raw 77 → Dedup 0 → NER 0 → Enqueued 0
1. Extracted77
2. After dedup0 (None)
3. After NER0 ()
4. Enqueued0 ()
Rohingya language
NameRohingya
AltnameArakanese Bengali
StatesMyanmar, Bangladesh, Saudi Arabia, Malaysia, United Kingdom, United States
RegionRakhine State, Cox's Bazar, Karachi, Jeddah, Kuala Lumpur, London, New York
Speakers500,000–1,000,000 (est.)
FamilycolorIndo-European
Fam2Indo-Aryan
Fam3Eastern
Fam4Bengali–Assamese
ScriptHanifi Rohingya, Arabic script, Latin script, Burmese script (historical)
Iso3rhg

Rohingya language Rohingya is an Indo-Aryan language spoken primarily by the Rohingya people of Rakhine State and the Rohingya diaspora. It is related to Bengali language, Assamese language, and Chittagonian language and has been the subject of debates involving United Nations High Commissioner for Refugees, Amnesty International, and Human Rights Watch concerning its speakers' rights. The language functions as a marker of ethnic identity in contexts involving Myanmar, Bangladesh, Saudi Arabia, Malaysia, and migrant communities in United Kingdom and United States.

Classification and linguistic features

Rohingya belongs to the Eastern branch of the Indo-Aryan languages within the Indo-European language family, sharing features with Bengali language, Assamese language, Sylheti language, and Chittagonian language. Comparative work references typological frameworks developed in studies of Sino-Tibetan languages contact and substrate effects from Tibeto-Burman languages spoken in Rakhine State and Chittagong Division. Its alignment, argument structure, and morphosyntactic patterns have been examined alongside research on Hindustani language, Odia language, and Marathi language to situate Rohingya within South Asian grammatical continua. Phonological traits show influence from regional languages and borrowings traceable to contact with Arabic language, Persian language, English language, and Burmese language.

History and historical development

The language evolved in the coastal and lowland zones of what is now Rakhine State and adjacent areas of Chittagong Division through centuries of trade and migration involving Bengal Sultanate, Arakan Kingdom, and later British Raj administration. Historical contacts with Arab traders, Persianate elites, and Portuguese Empire agents left lexical strata similar to those documented in histories of Bengal Presidency and Burma Province. Colonial-era censuses and ethnolinguistic surveys conducted under the British Empire and institutions such as the India Office and Royal Asiatic Society recorded population movements that influenced dialect differentiation. Twentieth-century events including WWII campaigns in Burma Campaign (World War II), policies of the Union of Burma, and displacement during communal tensions connect the language's trajectory to regional political histories such as those involving Aung San, Ne Win, and international responses by UNESCO and International Criminal Court-adjacent advocacy.

Geographic distribution and speakers

Major concentrations of speakers are in Rakhine State (formerly Arakan) and refugee settlements in Cox's Bazar District, with diaspora populations in Karachi, Jeddah, Kuala Lumpur, London, New York City, Toronto, and Dhaka. Census and demographic studies by agencies like UNHCR and NGOs including Médecins Sans Frontières and International Organization for Migration highlight mobility linked to crises involving Rohingya conflict and displacement episodes documented alongside operations of Bangladesh Armed Forces and regional actors. Community organizations such as the Rohingya Refugee Council and advocacy groups in United Kingdom and United States foster language maintenance among second-generation migrants, with cultural transmission occurring in religious institutions like local masjid networks and community centers connected to diasporic ties with Mecca pilgrimages.

Phonology and writing systems

Phonemic inventory includes vowels and consonants paralleling those of Bengali language and Chittagonian language while showing phonetic influence from Burmese language and Arabic language loanwords. Tone-like intonational patterns sometimes mirror areal prosody observed in Southeast Asian languages studies. Multiple scripts are in use: the indigenous Hanifi Rohingya script developed by Mohammed Hanif (often rendered in community literature), adaptations of the Arabic alphabet in religious and cultural texts, Latin transliterations used by diaspora media and digital platforms, and occasional use of Burmese script in historical records. Script choice correlates with networks tied to institutions such as Darul Uloom seminaries, transnational publishers, and online outlets hosted by organizations in Malaysia and United Kingdom.

Grammar and vocabulary

Grammatical structure shows SOV order consistent with Indo-Aryan languages like Hindi language and Bengali language, with postpositional constructions and verb morphology that mark tense, aspect, and mood comparable to patterns in Assamese language and Sylheti language. Pronoun systems reflect person and politeness distinctions paralleling those analyzed in studies of Urdu language and Bengali registers. Lexicon includes indigenous Eastern Indic roots alongside stratified borrowings from Arabic language (religious vocabulary), Persian language (administrative and literary terms), English language (modern terminology), and Burmese language (administrative and environmental terms). Language contact phenomena produce code-switching patterns similar to diaspora repertoires documented for speakers of Gujarati language and Punjabi language in urban settings.

Sociolinguistic status and language policy

The language's status is contested in national and international arenas, intersecting with citizenship policies in Myanmar—including laws enacted during administrations of Ne Win and subsequent governments—and refugee responses coordinated by Bangladesh and agencies like UNHCR and IOM. Educational access and literacy initiatives are promoted by NGOs such as Save the Children and BRAC and religious education providers linked to Al-Azhar-affiliated networks and local madrasas. Media outlets, cultural organizations, and academic researchers at institutions like University of Oxford, University of Cambridge, Dhaka University, and SOAS University of London document language use while advocacy groups including Amnesty International and Human Rights Watch frame linguistic rights within human rights campaigns. Language revitalization efforts confront policy constraints and displacement dynamics, engaging transnational funding from foundations and partnerships with universities and NGOs to support orthography standardization, literacy programs, and digital corpora initiatives.

Category:Indo-Aryan languages