Languages of science


Languages of science are vehicular languages used by one or several scientific communities for international communication. According to the science historian Michael Gordin, scientific languages are "either specific forms of a given language that are used in conducting science, or they are the set of distinct languages in which science is done." These two meanings are different, since the first describes a distinct prose in a given language, while the second describes which languages are used in mainstream science.
File:13-11-02-olb-by-RalfR-03.jpg|thumb|Science library of Upper Lusatia in Görlitz, Germany
Until the 19th century, classical languages—such as Latin, Classical Arabic, Sanskrit, Classical Malay and Classical Chinese—were commonly used across Afro-Eurasia for international scientific communication. A combination of structural factors, the emergence of nation-states in Europe, the Industrial Revolution, and the expansion of colonization entailed the global use of three European national languages: French, German, and English. Yet new languages of science, such as Russian and Italian, had started to emerge by the end of the 19th century—to the point that international scientific organizations began promoting the use of constructed languages such as Esperanto as a non-national global standard.
After the First World War, English gradually outpaced French and German; it became the leading language of science, but not the only international standard. Research in the Soviet Union rapidly expanded in the years after the Second World War, and access to Russian journals became a major policy issue in the United States, prompting the early development of machine translation. In the last decades of the 20th century, an increasing number of scientific publications were written primarily in English, in part due to the preeminence of English-speaking scientific infrastructure, indexes, and metrics such as the Science Citation Index. Local languages remain largely relevant for science in major countries and world regions such as China, Latin America, and Indonesia. Disciplines and fields of study with a significant degree of public engagement—such as social sciences, environmental studies, and medicine—have also maintained the relevance of local languages.
File:Open science multilingual.png|thumb|upright=1.35|Multilingualism is a core feature of open science, according to UNESCO.
The development of open science has revived the debate over linguistic diversity in science, as social and local impact has become an important objective of open science infrastructure and platforms. In 2019, 120 international research organizations cosigned the Helsinki Initiative on Multilingualism in Scholarly Communication; they also called for supporting multilingualism and the development of an "infrastructure of scholarly communication in national languages". In 2021, UNESCO's Recommendation for Open Science included "linguistic diversity" as one of the core features of open science, since this diversity aims to "make multilingual scientific knowledge openly available, accessible and reusable for everyone." In 2022, the Council of the European Union officially supported "initiatives to promote multilingualism" in science, such as the Helsinki Initiative.

History

From classical languages to vernaculars

Until the 19th century, classical languages played an instrumental role in the diffusion of languages in Europe, Asia, and North Africa.
In Europe, starting in the 12th century, Latin was the primary language of religion, law, and administration until the Early Modern period. It became a language of science "through its encounter with Arabic"; during the Renaissance of the 12th century, a large corpus of Arabic scholarly texts was translated into Latin, so that it would be available in the emerging network of European universities and centers of knowledge. In this process, the Latin language changed and acquired the specific features of scholastic Latin through numerous lexical and even syntactic borrowings from Greek and Arabic. The use of scientific Latin persisted long after the replacement of Latin by vernacular languages in most European administrations: "Latin's status as a language of science rested on the contrast it made with the use of the vernacular in other contexts" and created "a European community of learning" entirely distinct from the local communities where the scholars lived. Latin was never the sole language of science and education. Beyond local publications, vernaculars quite early attained the status of international scientific languages, which could be expected to be understood and translated across Europe. In the mid-16th century, a significant amount of printed output in France was in Italian.
In India and South Asia, Sanskrit was a leading vehicular language for science. Sanskrit was remodeled even more radically than Latin for scientific communication, as it shifted "toward ever more complex noun forms to encompass the kinds of abstractions demanded by scientific and mathematical thinking." Classical Chinese held a similarly prestigious position in East Asia, being largely adopted by scientific and Buddhist communities beyond the Chinese Empire, notably in Japan and Korea.
Classical languages declined throughout Eurasia during the second millennium. Sanskrit was increasingly marginalized after the 13th century. Until the end of the 17th century in Europe, Latin resisted displacement by vernacular languages: although medical books in the 16th century began to use French as well, this trend was reversed after 1597, and most medical literature in France remained accessible only in Latin until the 1680s. In 1670, as many books were printed in Latin as in German in the German states; in 1787, such books accounted for no more than 10% of the total. At this point, Latin's decline became irreversible: since ever fewer European scholars were conversant with the language, publications using it dwindled, and there was reduced incentive to maintain linguistic training in Latin.
The emergence of scientific journals was both a symptom and a cause of the declining use of a classical language. The first two modern scientific journals were published simultaneously in 1665: the Journal des Sçavans in France and the Philosophical Transactions of the Royal Society in England. Both journals used the local vernacular, which "made perfect historical sense", as both the Kingdom of France and the Kingdom of England were engaged in an active policy of linguistic promotion of their language standard.

European and auxiliary languages (1800–1920)

The gradual disuse of Latin opened an uneasy transition period, as more and more works were accessible only in local languages. Many national European languages held the potential to become a language of science within a specific research field: some scholars "took measures to learn Swedish so they could follow the work of Bergman and his compatriots."
Language preferences and use across scientific communities were gradually consolidated into a triumvirate or triad of dominant languages of science: French, English, and German. While each language could be expected to be understood for international scientific communication, each also followed "different functional distributions evident in various scientific fields". French had been almost acknowledged as the international standard for European science in the late 18th century, and it remained "essential" throughout the 19th century. German became a major scientific language during the 19th century, since it "covered portions of the physical sciences, particularly physics and chemistry, in addition to mathematics and medicine." English was used largely by researchers and engineers because of the seminal contribution of English technology to the Industrial Revolution.
In the years preceding the First World War, the linguistic diversity of scientific publications increased significantly. The emergence of modern nationalities and early decolonization movements created new incentives to publish scientific knowledge in one's national language. Russian was one of the most successful developments as a new language of science. During the 1860s and 1870s, Russian researchers in chemistry and other physical sciences ceased publishing in German in favor of local periodicals, following major work in adapting and creating names for scientific concepts or elements. A controversy over the meaning of Dmitri Mendeleev's periodic table contributed to acknowledging original publications in Russian in global scientific debate: the original version was deemed more authoritative than its first "imperfect" translation in German.
Linguistic diversity became framed as a structural problem that ultimately limited the spread of scientific knowledge. In 1924, the linguist Roland Grubb Kent underlined that scientific communication could soon be significantly disrupted by the use of as many as "twenty" languages of science:
The definition of an auxiliary language for science became a major issue discussed in emerging international scientific institutions. On January 17, 1901, the newly established International Association of Academies created the Delegation for the Adoption of an International Auxiliary Language "with support from 310 member organizations". This delegation was tasked with finding an auxiliary language that could be used for "scientific and philosophical exchanges", and it could not be any "national language". In the context of increased nationalistic tensions, any of the dominant languages of science would have appeared as a partisan choice. The delegation consequently had a limited set of options: these included the unlikely revival of a classical language such as Latin, or a new constructed language such as Volapük, Idiom Neutral, or Esperanto.
Throughout the first part of the 20th century, Esperanto was seriously considered as a potential international language of science. As late as 1954, UNESCO passed a recommendation to promote the use of Esperanto for scientific communication. In contrast with Idiom Neutral—or the simplified version of Latin, Interlingua—Esperanto was not conceived primarily as a scientific language. Yet, by the early 1900s, Esperanto was by far the most successful constructed language, with a large international community and numerous dedicated publications. Starting in 1904, the Internacia Science Revuo aimed to adapt Esperanto to the specific needs of scientific communication. The development of a specialized technical vocabulary was a challenging task, since Esperanto's extensive derivation system made it complicated to directly import words commonly used in German, French, or English scientific publications. In 1907, the Delegation for the Adoption of an International Auxiliary Language seemed close to retaining Esperanto as its preferred language. Nevertheless, significant criticism was still addressed at a few remaining complexities of the language, as well as its lack of scientific purpose and technical vocabulary. Unexpectedly, the delegation supported a new variant of Esperanto, Ido, which was submitted late in the process by an unknown contributor. While this decision was framed as a compromise between the Esperantist and the anti-Esperantist factions, it ultimately disappointed all proponents of an international medium for scientific communication, and it durably harmed the adoption of constructed languages in academic circles.