Cedilla


The cedilla is a diacritic of the Latin alphabet. In French, it is used only under the letter C, both in uppercase and lowercase forms: Ç, ç.
It is also used in several other languages under different letters. It bears a visual resemblance to the numeral 5 with its upper stroke removed.
Historically, the Spanish cedilla was placed only under a c derived, among other possibilities, from a Latin c that had undergone palatalization. It then formed the letter ç, originally pronounced /ts/ and later /s/.

History

The modern grapheme of the cedilla derives from medieval Gothic script or Visigothic script. The use of this sign arose from the limitations of the Latin alphabet. Its name comes from Spanish and appeared in the 17th century, meaning "little z".
Under the letter c, the handwritten cedilla developed through three successive forms: a diacritic z, then a z with a cedilla, and finally the modern c with cedilla. By contrast, the evolution of the e caudata is considered unrelated to that of the cedilla.

In manuscripts

''C with cedilla''

The palatal phoneme /ts/ of the Romance languages derives from a Latin /k/ c that was palatalized and then assibilated. Before vowels that would otherwise trigger a non-palatalized pronunciation, scribes used various spellings to indicate the "new" pronunciation: simply c, ce, or cz.
Thus ceo and czo were read /tso/: the diacritic e and z prevented the reading /ko/.
This latter notation appears in French as early as the first literary manuscript in the French language, the Sequence of Saint Eulalia, where it occurs only once, in verse 21.
According to Greimas, the neuter demonstrative ço appears in the Sequence of Saint Eulalia.
However, Greimas gives only the form ço for this text, although the manuscript, rediscovered in the 19th century, contains no cedilla in any of its 29 verses. Moreover, the manuscript dates to 881 rather than broadly to the "10th century".
By contrast, analyzes the form czo as follows:
The z in czo is thus interpreted as a diacritic z which, once placed beneath the c, would become the cedilla.
The Visigothic script is indeed thought to have abbreviated this grapheme around the 11th century in Spain. Initially, the c was written above the z in its form ʒ; later, the c regained its full size while was reduced to a subscript sign. Thus, the Spanish word lanʒa /lantsa/ came to be written lança. The usefulness of such a sign, and an early attempt to systematize the notation of /ts/, led to the extension of the cedilla before the vowels i and e. This was later regarded as a form of hypercorrection, since c alone was sufficient.
Maria Selig confirms this Visigothic origin:
Selig also notes that the diacritic spread across Europe more slowly than its phonetic value, and was in some cases "reappropriated" by different languages to represent sounds unrelated to its original function.
In French, according to Jean Dubois, the cedilla appears "as early as the 8th century in Visigothic manuscripts, but was little used by scribes, who preferred to add an extra letter to indicate the sibilant sound of c ".
Accordingly, in the manuscripts of The Song of Roland, the cedilla is not used, although modern transcriptions add it for ease of reading.

Paleographic cedilled ''e'' (''e caudata'')

A form resembling a cedilla can therefore be found beneath the letter e in medieval manuscripts, with usage attested as early as the 6th century in uncial script. The resulting letter is known as e caudata. It more or less frequently replaces the Latin digraph ae. This digraph generally represented an open , derived from the Classical Latin diphthong, which was monophthongized from the 2nd century onward.
This usage continued in manuscripts until the 18th century but did not survive the advent of printing:
It is noteworthy that this letter, represented here as ę or ȩ, has been preserved in Romance philological transcription, whereas the digraph ae has been retained in the transcription of Germanic languages. ę was used in manuscripts of Old English written in Insular Irish uncial.
Although this sign is often referred to as a "cedilla", this is an anachronism: it has no connection with the letter z, and it more likely derives from a subscript a.
This cedilla-like mark, whose use varied before the spread of printing, can therefore serve as an indicator for the dating of manuscripts by palaeographers. For example, according to the Dictionnaire de paléographie by Louis Mas Latrie, "manuscripts in which one finds the cedilled e rather than œ must be placed between five and seven hundred years ago", that is, between 1150 and 1350:

Early printing

Manuscript usage was taken up in printing, first by Spanish and Portuguese printers, and then imitated by the French printer Geoffroy Tory. According to Auguste Bernard, as early as 1509, "Tory proposed writing with a cedilla the penultimate e of the third person plural of the perfect tense of verbs of the third conjugation in order to distinguish it from the infinitive," following the model already used shortly before 1509 in the Psalterium quintuplex. If Bernard's account is followed, the cedilla would therefore have been used in Latin printing by Tory from the very beginning of the 16th century.
The cedilla in French, in the form of c-cedilla, was first explicitly advocated in 1529 by the same author, in the introduction to his book , published in 1529.
Its subtitle clearly expresses its purpose: l'art et la science de la due et vraie proportion de la lettre. This work is, moreover, the first typographical treatise written in French:
This defense of the cedilla was not immediately put into practice. In Tory's system, the cedilla was intended to mark /s/. The cedilla formed part of Geoffroy Tory's typographical innovations, whose aim was likely to facilitate the commercialization of the first books printed in French rather than Latin.
He used the cedilla in French for the first time in Le sacre et coronnement de la royne by Guillaume Bochetel, published in 1531.
According to many authors, Tory generalized the use of c-cedilla in his edition of L'Adolescence Clémentine by Clément Marot, the fourth edition of the work, published in 1533. The book had first appeared on 12 August 1532 in Paris, published by Roffet, without cedillas, and then on 7 June 1533 by Tory, this time with cedillas.
In reality, Tory had already introduced the cedilla at the beginning of 1530 in his pamphlet Le sacre et le coronnement de la Royne, imprime par le commandement du Roy nostre Sire, where it appears three times, in the words façon, commença, and Luçon.
The 1533 edition of L'Adolescence Clémentine nevertheless represents the first true generalization of the cedilla in a work that enjoyed success and was intended for a relatively large print run for the period. Tory justified the use of the cedilla in the introduction to this edition using the same arguments already advanced in Champ fleury:
The practical application of Tory's orthographic system is irregular: apostrophes are missing in par faulte dadvis, and oddly placed in combien q'uil—likely a typographical error. As Bernard observes, this was the first work in which Tory applied his orthographic system, and the inexperience of his compositors is evident in the mistakes made by omission or transposition.
From this point onward, the cedilla was adopted by all printers. Before this, supporters of etymological orthography wrote francoys. Usage initially remained unstable. For example, in the Œuvres poétiques of Louise Labé, one finds the cedilla in aperçu but not in perſa, which is instead written with an s to avoid perca.
From there, the use of the "c with a tail" spread throughout France, but it was not until the 17th century that its use became truly common.
In Spanish, the cedilla was abandoned in the 18th century, while /ts/ had simplified to /s/ between the 14th and 16th centuries and then to /θ/ in the 17th century. Other related languages nevertheless retained it.

After the Renaissance

The introduction of such a character in written French was an effective and broadly accepted way of definitively resolving the problem of the ambiguous pronunciation of the Latin letter c. Indeed, when c precedes a, o, or u, it is pronounced /k/; when it precedes any other vowel, it is pronounced /s/. The sign therefore makes it possible to preserve links with the past and to maintain the graphic coherence of the language by making spelling less ambiguous. The presence of a cedilla in a word or form keeps visible the relationships with the etymon and with derived forms or related forms.
For Albert Dauzat, "the simplification of an irrational orthography was in keeping with the tendencies of the 17th century, enamoured of clarity and reason. Many writers called for reform ". The cedilla therefore became a stake in the many projects for orthographic reform of the French language.

T-cedilla in French

With regard to these attempts at orthographic reform, the history of the t-cedilla in French is exemplary.
In 1663, in Rome la ridicule, Caprice by Saint-Amant, the printer and proofreader for the Elzeviers in Amsterdam, Simon Moinet, used the cedilla under the letter t in French.
In 1766,, preacher to the queen, proposed the use of the cedilla under t to distinguish cases where it is read /t/ from those where it is pronounced /s/:
Ambroise Firmin-Didot, in his Observations sur l'orthographe, ou ortografie, française, proposed to the Académie française a similar reform project aiming to introduce a t-cedilla, ţ, in words where t is pronounced /s/ before i. This would have eliminated a large number of irregularities in spelling. One would thus have written: les adopţions, pestilenciel, il différencie, il balbuţie.
In fact, as the author himself notes, the grammarians of Port-Royal had already proposed such an improvement before him. The project ultimately remained a dead letter.

''Açhille'', ''çhien'', ''çheval'': the proposals of Nicolas Beauzée

In the same spirit as that of Firmin-Didot, the generalization of c- and t-cedilla was defended by an Enlightenment grammarian such as Nicolas Beauzée. Thus, according to the 19th-century encyclopedist B. Jullien:
Moreover, it would have been possible to write the words lança and français using the letter s, since the phoneme /ts/ no longer existed at the time of the borrowing of the cedilla. The phoneme had even merged with the other /s/ sounds. However, it was the visual and etymologizing appearance of the word that prevailed. The spelling *lansa would have introduced an awkward alternation: *il lansa ~ ils lancèrent. In other languages, such as Spanish, the spelling of a conjugated verb may be inconsistent: one now writes lanzar, thus "cutting oneself off" from the Latin etymology lanceare, which was more explicitly reflected in lançar.
In addition to maintaining visual etymological coherence, the cedilla also makes it possible, in certain cases, to resolve spelling problems for the sound /s/ derived from /k/. For example, reçu retains a link with recevoir, but above all could not be written in any other way: *resu would be read /ʁəzy/ and *ressu /resy/. The same applies to leçon and other words in which a schwa is followed by the phoneme /s/. In other cases, plain c without a cedilla is retained. The retention of c in such words is explained by an orthographic archaism: the Latin or French etymon remains visible, allowing greater visual coherence by preserving a link between the cedilla-marked derived form and the root from which it originates. In this way, lança and lançons remain clearly and visually connected to the root lanc- /lɑ̃s/ of lancer, lance, etc. Likewise, reçu retains a link with recevoir. Conversely, when the sound /k/ must be obtained before the graphic vowels e, i, and y, a u is used as a diacritic letter following c: accueil.
Used as a diacritic detached from its original c, the cedilla was extended to other letters in other languages from the 19th century onward.

Chronology of the appearance of the cedilla

  • Before the 9th century, occurrences of the Visigothic cedilla, which was shortened to ç in the 11th century.
  • In parallel, the palaeographic cedilled e is attested as early as the 6th century.
  • 9th century – Cantilène de sainte Eulalie: a hapax of the diacritic z, intended to be shortened to ç, appears in the form czo.
  • 1480 – Birth of Geoffroy Tory in Bourges.
  • Before 1500 – Spanish and Portuguese printers create typefaces for the cedilla; these enter France via Toulouse.
  • 1509 – Tory innovates in Latin printing.
  • 1529 – Tory argues for the introduction of the cedilla into French in Champ fleury.
  • Early 1530 – Tory introduces the cedilla in Le sacre et le coronnement de la Royne, imprime par le commandement du Roy nostre Sire.
  • June 7, 1533 – Publication of the fourth edition of L'Adolescence clémentine by Tory, representing a major dissemination of the cedilla.
  • October 1533 – Death of Geoffroy Tory.
  • 18th century – The cedilla disappears from Spanish; it is used by all printers in France. Nicolas Beauzée proposes its generalization in place of s. Numerous spelling reform attempts follow: some call for abandoning the cedilla, others for generalizing it. However, this diacritic, successfully established by Tory shortly before his death in 1533, has retained essentially the same rules of use down to the present day.

    Etymology

Although the cedilla appeared in French manuscripts as early as the 9th century and in French printing from 1530 onward, the word cédille itself is attested only in 1611, in the altered form cerille, and then as cédille in 1654–1655. The word cerilla had, however, already been borrowed from Spanish in 1492, and the form cedilla is attested in 1558. In Spanish, cedilla means "little z" and is the diminutive of the name of the letter z in Spanish, zeda, itself derived from the Latin zeta, from Greek zêta, "the sixth letter of the Greek alphabet". Greek zêta is itself "borrowed from Phoenician ".
In his article in the Encyclopédie, and later in his Œuvres, the term cedilla was mistakenly interpreted by Dumarsais in French as meaning "little c" rather than "little z", due to the shape of the cedilla:

Current usage

Romance languages

In French, Catalan, Occitan, and Portuguese, the Hispanic cedilla is used under the letter c to indicate /s/ before a, o, and u. In Catalan and Occitan, is also used word-finally to indicate /s, for example in dolç.
Friulian uses a cedilled c to represent.

Romanian

In Romanian, the diacritic plays a much more prominent role: Ș ș , and Ț ț . After having been written using so-called Glagolitic characters of Church Slavonic until the 19th century, Romanian has since been written in the Latin alphabet. Its orthography then drew partly on Italian and French models, and partly, especially with regard to letters bearing diacritics, on transliteration practices close to those of the Balkan linguistic area. The most recent major reforms date from 1953, followed by more chaotic changes after the end of communism. Modern Romanian normally uses two letters with a subscript comma.
In 2003, the Romanian Academy specified that the letters ș and ț share the same diacritic: a comma placed a short distance beneath the letters s and t, rather than a cedilla.
Because the ISO/IEC 8859-2 and Unicode standards initially treated the Romanian subscript comma as merely a graphic variant of the cedilla, cedilled s became widespread in computing, especially since it also exists in Turkish. Cedilled t, however, has most often continued to be represented as a t with a subscript comma, primarily for aesthetic reasons. As a result, modern fonts most often display an s with a cedilla and a t with a cedilla shaped like a comma.
Unicode now distinguishes the two characters, as shown in the illustration. The characters named "Latin capital letter S with comma below" and "Latin small letter s with comma below", as well as "Latin capital letter T with comma below" and "Latin small letter t with comma below", are preferred in careful typography.
For alphabetical sorting, the two Romanian letters with subscript comma are considered distinct letters, ordered after s and t.

Turkic languages

Both letters have been used in the orthography of Turkish since the romanization adopted on November 1, 1928. They are regarded as distinct letters, ordered respectively after c and s, and not as variants of those letters. The use of ç for may have been inspired by Albanian usage, while ş appears to follow Romanian practice.
The Turkmen alphabet, adopted in 1991 following the independence of Turkmenistan, is largely inspired by Western alphabets, and particularly by Turkish. As in Turkish, it includes Çç and Şş.

Azerbaijani

In Azerbaijani, the cedilla is used, for example, in içmək and danışmak .

Tatar

In the Tatar Latin alphabet Jaᶇalif or Yañalatinitsa, which was adopted in 1999 and is commonly used on the Internet, two letters with a cedilla are employed:
In the literary Tatar language, the letter ç is pronounced, while c is. In the western and southern parts of the Tatar-speaking area, ç is, or in the north, and c is.
In Siberia, in the eastern part of the Tatar-speaking area, ç is, and c is.

Albanian

In the current orthography of Albanian, adopted in 1908 at the Congress of Monastir, the letter ç is used to represent.

Latvian

  • Ģģ
  • Ķķ
  • Ļļ
  • Ņņ
  • Ŗŗ
Latvian uses a cedilla in the form of a "subscript comma" to indicate the palatalization of the consonants /g/, /k/, /l/, /n/, and /r/, written as ģ, ķ, ļ, ņ, and ŗ. For reasons of legibility, this diacritic is placed above the lowercase g, where it may take several forms, including a curved quotation mark, an inverted comma, or an acute accent. For the uppercase G, where legibility is not an issue, the diacritic remains below: Ģ.
As the pronunciation of r and ŗ is no longer distinguished in standard Latvian, the latter letter was removed from the orthography during the years of Soviet occupation. This reform was generally not accepted by Latvians in exile. After Latvia regained independence in 1991, ŗ was nevertheless not reinstated in the official orthography.
Latvian orthography, derived from German, introduced cedillas and ogoneks in order to enrich an alphabet of German origin that was insufficient to represent all Latvian sounds. Thus, Ģ, Ķ, Ļ, and Ņ still denote the palatalized equivalents of G, K, L, and N. Until the beginning of the 20th century, Latvian orthography was highly irregular.

Other alphabets

Some recently created alphabets directly inspired by the Latin alphabet have added numerous diacritics to address mismatches between sounds and letters. A well-known example is Vietnamese, which does not use the cedilla. By contrast, the Marshallese alphabet does include it, and is often cited as a notable example of an alphabet devised by linguists studying the language.

Kurdish

In Kurdish, examples include şer and piçûk.

Marshallese

  • Ļ, ļ
  • Ņ, ņ
Marshallese is written using a Latin alphabet that includes several unusual cedilled letters: l, m, n, and o, namely ļ, , ņ, and . Of these, only l and n exist as precomposed Unicode characters. The others must be composed using the combining cedilla U+0327. Care should be taken not to encode o with cedilla as o with an ogonek.
According to a foundational grammar available online, ļ would correspond to, to , ņ to , and to a type of long /oː/. These values are not confirmed by a study of Marshallese phonology, which does not discuss the current orthography.

Cameroonian languages

The General Alphabet of Cameroonian Languages recommends avoiding diacritics above graphemes to modify phonetic value, reserving that position for tone marking. Diacritics below graphemes are therefore preferred for phonetic modification. The cedilla is one such diacritic, indicating nasalization in practice, notably in Dii, Kako, Karang, Maka, Mbodomo, Mundani, Pana, and Vute.
Nasalized vowels marked with a cedilla include:
  • A̧, a̧
  • Ȩ, ȩ
  • Ɛ̧, ɛ̧
  • Ə̧, ə̧
  • I̧, i̧
  • Ɨ̧, ɨ̧
  • O̧, o̧
  • Ɔ̧, ɔ̧
  • U̧, u̧

    Kinande

In Kinande, the cedilla is used to indicate advanced tongue root articulation in vowels, notably i and u:
  • I̧, i̧
  • U̧, u̧

    Indigenous languages of the Americas

In the orthographies developed by the New Tribes Mission for Jodï, Maco, and Piaroa, the cedilla is used to indicate nasalized vowels.

Languages with ogoneks

The cedilla should not be confused with the ogonek, which is not discussed in this article. Languages such as Navajo, Apache, Polish, and, as in the example below, Lithuanian, do not use cedillas but ogoneks:
  • Ą, ą
  • Ę, ę
  • Į, į
  • Ų, ų

    Phonetic transcription

In the International Phonetic Alphabet, represents the voiceless palatal fricative. This sound does not occur in French.
Alan Timberlake uses the cedilla to indicate consonant palatalization in a Russian grammar published in 2004: p̧ b̧ ţ ḑ ķ ģ ç̆ ʒ̧̆ ş ş̆ x̧ v̧ z̧ z̧̆ m̧ ņ ļ ŗ.

ASCII and ISO 646 transcription

Basic ASCII does not include letters with diacritics. At a time when it was often the only available code page, some users simulated the cedilla by placing a comma after the letter; for example, writing c,a for ça.
However, national variants of ISO 646 used the few non-invariant positions of the standard to encode additional punctuation marks and diacritics:
  • The French version encodes the lowercase c with cedilla at position 124, replacing the | character of the American version.
  • An earlier French version required the use of the backspace control character to overstrike characters and simulate the addition of a diacritic, except for letters already encoded with diacritics in the national variant; thus the cedilla could be encoded as following an uppercase C.
  • The Spanish, Catalan, and Basque versions of ISO 646 encode uppercase and lowercase c with cedilla at positions 93 and 125 respectively, replacing the ASCII characters ] and '''