LLMpediaThe first transparent, open encyclopedia generated by LLMs

Cedilla

Generated by DeepSeek V3.2
Note: This article was automatically generated by a large language model (LLM) from purely parametric knowledge (no retrieval). It may contain inaccuracies or hallucinations. This encyclopedia is part of a research project currently under review.
Article Genealogy
Parent: Portuguese language Hop 4
Expansion Funnel Raw 93 → Dedup 0 → NER 0 → Enqueued 0
1. Extracted93
2. After dedup0 (None)
3. After NER0 ()
4. Enqueued0 ()
Cedilla
Mark◌̧
NameCedilla
UnicodeU+0327

Cedilla. A cedilla is a hooked diacritical mark placed under certain letters, most notably the letter C, to modify its pronunciation. Originating from a diminutive form of the letter Z in Medieval Spanish, it is used in several modern orthographies to indicate a sibilant sound where a hard sound would otherwise be expected. The mark is a fundamental component of the alphabets of languages such as French, Portuguese, and Catalan, and also appears in some transcription systems and Latin-script alphabets.

Etymology and history

The term "cedilla" is derived from the Spanish name for the letter Z, *zeda* or *ceta*, with the diminutive suffix *-illa*, thus meaning "little Z". Its earliest form appeared in Visigothic script during the Middle Ages in the Iberian Peninsula, where scribes wrote a small Z beneath the letter C to denote the /ts/ sound that had evolved from Latin in Old Spanish. This practice was later adopted and standardized by the French printer Geoffroy Tory in the 16th century, influencing its spread across Europe. The modern hooked form became firmly established through the typographical work of Claude Garamond and the Académie française, which codified its use in French orthography.

Usage in various languages

In French, the cedilla is used exclusively with the letter C (making Ç) to indicate a /s/ sound before the vowels A, O, and U, as in the word *français*. In Portuguese, it appears under C to denote a /s/ sound before A, O, and U, as in *açor*, and is a required feature in the Portuguese Language Orthographic Agreement of 1990. The Catalan orthography employs it similarly, as in *Barça*, a common abbreviation for FC Barcelona. It is also used in the Turkish and Azerbaijani alphabets, under both C and S, though these represent distinct phonemes from the Latin-script traditions of Western Europe. Some transcription systems, such as those for Indigenous American languages developed by the Summer Institute of Linguistics, also utilize the cedilla.

Typographical form and encoding

The standard typographical form of the cedilla is a small, downward-curving hook attached to the bottom of a letter, notably distinguishable from the comma. In digital text encoding, the Unicode standard assigns the combining cedilla the code point U+0327. Precomposed characters are also available, such as Ç (U+00C7 for uppercase and U+00E7 for lowercase). Historically, some typefaces and digital fonts, particularly those influenced by Central European traditions, have rendered the mark as a comma below, a distinction addressed in Unicode with separate code points like U+0326. The International Organization for Standardization includes the cedilla in standards such as ISO/IEC 8859-1.

The cedilla is part of a family of diacritical marks that modify letter pronunciation. A closely related mark is the ogonek, used in languages like Lithuanian and some First Nations languages in Canada, which curves to the right. The comma below, used in Romanian under S and T (as in Ș and Ț), is often typographically similar but semantically distinct. Other under-diacritics include the dot below found in Vietnamese and the caron or háček, which can appear above or below letters in various Latin-script alphabets. The International Phonetic Alphabet employs a subscript cedilla to denote nasalization or pharyngealization.

Keyboard input and technical considerations

On most keyboards for Latin-script alphabets, the cedilla can be input using specific key combinations. In Microsoft Windows, holding the Alt key and typing 0199 or 0231 produces Ç or ç. On Apple macOS systems, it is typically typed by pressing Option+C. The compose key sequence on Linux systems often involves Compose + , + C. In HTML and XML, the characters can be represented with the numeric character references `Ç` and `ç` or the named entities `Ç` and `ç`. Proper rendering depends on font support, which is nearly universal in modern operating systems and web browsers due to widespread adoption of Unicode fonts like Arial and Times New Roman.

Category:Diacritics Category:Orthography Category:Typography