Generated by GPT-5-mini| Romani language | |
|---|---|
| Name | Romani |
| Altname | Romany |
| Nativename | Romani chib |
| States | India, Pakistan, Turkey, Greece, Spain, Romania, Bulgaria, Hungary, Slovakia, Czech Republic, Serbia, Croatia, Bosnia and Herzegovina, North Macedonia, Albania, Montenegro, Moldova, Ukraine, Russia, Poland, Lithuania, Latvia, Estonia, Germany, France, United Kingdom, Ireland, United States, Canada, Brazil, Argentina, Australia |
| Region | Europe, Americas, Australasia, South Asia, Middle East |
| Speakers | Several hundred thousand to several million (estimates vary) |
| Familycolor | Indo-European |
| Fam2 | Indo-Iranian |
| Fam3 | Indo-Aryan |
| Fam4 | Central Zone (Hindi) |
| Iso1 | rmn |
| Glotto | roma1323 |
Romani language is an Indo-Aryan language historically associated with the Romani people and dispersed widely across Europe, the Americas, and Australasia. It shares deep genealogical ties with languages of the Indian subcontinent and has accrued extensive contact influence from languages such as Greek, Bulgarian, Turkish, German, Spanish and Romanian. Its speakers participate in diverse national contexts including those of United Kingdom, France, Germany, Hungary, Romania, Bulgaria and the United States.
Romani is classified within the Indo-European languages family, specifically the Indo-Iranian languages branch and the Indo-Aryan languages subgroup, showing affinities to Hindi, Punjabi, Marathi and Sanskrit. Comparative lexicon and phonological features link Romani to Central and Northwestern Indian varieties encountered in regions such as Rajasthan and Punjab, and historical migration narratives intersect with medieval contacts involving the Byzantine Empire, Seljuk Empire, and later the Ottoman Empire. Genetic studies correlate with migrations across Central Asia, the Caucasus, and into Europe during the first millennium CE, paralleling linguistic divergence documented by scholars associated with institutions like the Max Planck Institute for the Science of Human History and universities including Cambridge University and University of Oxford.
Romani is a dialect continuum traditionally divided into major groups—often labeled after diaspora subgroups such as the Sinti, Kalderash, Lovari, Yenish-adjacent communities, and regional clusters in Spain and the Balkans. Dialect classification draws on features shared with host languages like Greek in the Greece and Turkish in Turkey, and reflects political borders of states such as Hungary, Slovakia, Czech Republic, Poland, and Ukraine. Fieldwork by scholars at institutions such as the School of Oriental and African Studies and the University of Manchester highlights lexical, morphological and phonological splits among Northern, Balkan, Vlax, and Southern groups.
Romani phonology preserves retroflex consonants and aspirated stops reminiscent of Sanskrit-derived patterns while also exhibiting palatalization influenced by languages like Slovak and Czech. Grammatical structure shows nominal cases, verbal tense/aspect systems, and postpositional constructions comparable to Indo-Aryan languages; many dialects maintain ergative alignments and rich inflectional paradigms paralleling features studied in comparative Indo-European research. Structural analyses appear in literature associated with scholars at University of Cambridge and Harvard University and are discussed in proceedings of associations such as the Linguistic Society of America.
Romani core vocabulary retains items cognate with Sanskrit, Prakrit and modern Indic tongues like Gujarati and Marathi, while exhibiting major borrowings from Persian and Arabic via early contacts. Subsequent centuries of settlement introduced lexical strata from Greek in the Byzantine Empire, Turkish during Ottoman rule, Slavic languages such as Bulgarian and Serbian in the Balkans, Romance languages like Romanian and Spanish in Southeastern and Western Europe, and German across Central Europe. Loanwords from institutions such as the Catholic Church and concepts tied to legal regimes in states like France and Austria appear in regional registers.
Romani has no single standardized orthography; community practices employ scripts native to national contexts, including the Latin script, Greek alphabet, and Cyrillic script depending on countries such as Spain, Greece, Russia and Serbia. Orthographic proposals have been developed by organizations like the European Roma Rights Centre and academics at Central European University and University of Southampton, aiming to reconcile dialectal variation and to enable education publishing, media production, and legal recognition in parliaments such as those of Romania and Hungary.
Sociolinguistic conditions vary widely: in some regions Romani is threatened by language shift to national languages like English, French and Spanish; in others it remains actively transmitted within communities such as those in parts of Romania and Bulgaria. Discrimination, policies of assimilation enacted historically in states including Czechoslovakia and Yugoslavia and contemporary debates in bodies like the European Union affect intergenerational transmission. Documentation efforts by NGOs such as Amnesty International and research programs funded by the European Commission address language rights, sociolinguistic vitality, and recognition in education and media.
Revitalization initiatives include community-led schooling, bilingual materials, cultural festivals, and digital resources produced by organizations such as the Roma Education Fund, collaborations with universities like University College London, and projects supported by agencies including the Council of Europe. Curriculum development, teacher training, and standardized orthography proposals aim to integrate Romani into formal education systems in countries such as Hungary, Spain, Portugal, and Sweden while NGOs and activists advocate for legal protections under frameworks like the European Charter for Regional or Minority Languages and national minority laws.