Subject–object–verb word order


In linguistic typology, a subject–object–verb language is one in which the subject, object, and verb of a sentence always or usually appear in that order. If English were SOV, "Sam apples ate" would be an ordinary sentence, as opposed to the actual Standard English "Sam ate apples" which is subject–verb–object.
The term is often loosely used for ergative languages like Adyghe and Basque that in fact have agents instead of subjects.

Incidence

Among natural languages with a word order preference, SOV is the most common type.
Languages that have SOV structure include:
Other languages allow for an SOV structure under certain conditions:
A rare example of SOV word order in English is "I thee wed " in the wedding vow "With this ring, I thee wed."

Properties

SOV languages have a strong tendency to use postpositions rather than prepositions, to place auxiliary verbs after the action verb, to place genitive noun phrases before the possessed noun, to place a name before a title or honorific and to have subordinators appear at the end of subordinate clauses. They have a weaker but significant tendency to place demonstrative adjectives before the nouns they modify. Relative clauses preceding the nouns to which they refer usually signals SOV word order, but the reverse does not hold: SOV languages feature prenominal and postnominal relative clauses roughly equally. SOV languages also seem to exhibit a tendency towards using a time–manner–place ordering of adpositional phrases.
In linguistic typology, one can usefully distinguish two types of SOV languages in terms of their type of marking:
  1. dependent-marking has case markers to distinguish the subject and the object, which allows it to use the variant OSV word order without ambiguity. This type usually places adjectives and numerals before the nouns they modify, and is exclusively suffixing without prefixes. SOV languages of this first type include Japanese and Tamil.
  2. head-marking distinguishes subject and object by affixes on the verb rather than markers on the nouns. It also differs from the dependent-marking SOV language in using prefixes as well as suffixes, usually for tense and possession. Adjectives in this type are much more verb-like than in dependent-marking SOV languages, and hence they usually follow the nouns. In most SOV languages with a significant level of head-marking or verb-like adjectives, numerals and related quantifiers also follow the nouns they modify. Languages of this type include Navajo and Seri.
In practice, of course, the distinction between these two types is far from sharp. Many SOV languages are substantially double-marking and tend to exhibit properties intermediate between the two idealised types above.
Many languages that have shifted to SVO word order from earlier SOV retain the properties; for example, the Finnish language, which retains a high usage of postpositions.

Examples

Afroasiatic languages

SOV word order is generally found in the Afroasiatic branches of the Ethiopian language area.

Somali

generally uses the subject–object–verb structure when speaking formally.

Ainu

The following example is from Hokkaido Ainu.

Austroasiatic languages

Many Austroasiatic languages in South Asia exhibit SOV word order typical to the region.

Basque

Basque in short sentences, usually, subject or agent–object–verb; in long sentences, usually, subject or agent-verb-objects:
Anyway, both in spoken and written Basque, the pure SOV sentence is perfectly valid, just as the OVS order. In questions, fronting the verb in a VSO sentence is also correct, together with the usual SOV order or the SVO one.

Dravidian languages

The Dravidian languages commonly exhibit or prefer SOV order.

Malayalam

  • Pustakam̥ + -e = pustakatte

Tamil

Tamil being a strongly head-final language, the basic word-order is SOV. However, since it is highly inflected, word order is flexible and is used for pragmatic purposes. That is, fronting a word in a sentence adds emphasis on it; for instance, a VSO order would indicate greater emphasis on the verb, the action, than on the subject or the object. However, such word-orders are highly marked, and the basic order remains SOV.

Georgian

The Georgian language is not extremely rigid with regards to word order, but is typically either SOV or SVO.

Greenlandic

The default word order for simple clauses in Greenlandic is SOV.

Indo-European languages

SOV word order is quite common among Indo-European languages, leading to a common hypothesis that this reflects the original preferred word order of the ancestral Proto-Indo-European language. However, the question remains unsettled.

Albanian

has free word order, but generally prefers SVO. SOV occurs only in poetic language.

Armenian

generally prefers SOV.

Germanic languages

Linguistic consensus holds that the Proto-Germanic language had free word order but preferred SOV. While some Germanic languages have transitioned to SVO, SOV remains a feature of some major modern Germanic languages, including German and Dutch. However, these modern SOV Germanic languages also exhibit V2 word order, which supersedes the "default" SOV such that many sentences are rendered subject-verb-object.
Dutch
is SOV combined with V2 word order. The non-finite verb remains in final position, but the finite verb is moved to the second position. Simple verbs look like SVO, non-finite verbs and compound verbs follow this pattern:
Pure SOV order is found in subordinate clauses:
German
is SOV combined with V2 word order. The non-finite verb remains in final position, but the finite verb is moved to the second position. Simple verbs look like SVO, compound verbs follow this pattern:
The word order changes also depending on whether the phrase is a main clause or a dependent clause. In dependent clauses, the word order is always entirely SOV :
Gothic
The Gothic language, an extinct East Germanic language, had free word order, but SOV constructions were common.

Greek (Classical)

Ancient Greek had free word order but generally preferred SOV sentences:
This is distinct from Modern Greek, where SVO is preferred.

Indo-Aryan languages

, the oldest known of the Indo-Aryan languages, was an inflected language and very flexible in word order, allowing all possible word combinations. Its descendant, Classical Sanskrit, shared this feature but generally preferred SOV sentences.
Most later Indo-Aryan languages continue to prefer SOV word order, for example:
Bengali:
Hajong:
re is a particle that indicates the accusative case and 'sei' indicates past tense declarative. Here, e is pronounced as the 'i' in 'girl' and 'ei' is pronounced as the 'ay' in 'say'.
Hindi:
Marathi:
Nepali:
Odia:
Urdu:
This preference is not fixed in all Indo-Aryan languages. Punjabi, for instance, may be characterised as following a Subject—Object—Verb typology overall, but some flexibility is permitted, and this tendency does not follow in sentences involving personal pronouns. Examples are shown here in both Shahmukhi and Gurmukhi. The word forms used reflect those typical of spoken language. For Shahmukhi, vocalised forms with vowel diacritics have been used to explicitly indicate the forms used; in typical writing these are omitted in most words where regular patterns allow this information to be inferred contextually.
The following sentence exhibits the typical SOV word order tendency. The verb phrase is in retrospective perfect participle form, indicating completion of the action, and takes on the feminine plural suffixes in agreement with the gender and number of the object. The subject here is a masculine plural form; in this context it does not require agreement from the verb.
By contrast, in the following sentence the person involved, referred to by a first-person pronoun, is the object rather than the subject. The significance of people as a semantic category takes precedent over the SOV word order tendency, and the person is typically first even in sentences where that person is the object. The pronoun "mainū̃" has the postposition "nū̃" agglutinated to it, approximately meaning "to." Abstract concepts like desires and emotions typically come "to" people as agentive subjects.
The copula in Punjabi is extraverbal in function. While it can constitute the predicate of a sentence on its own, it does not enter the verb phrase when used alongside a full lexical verb. Instead, it acts as a marker of existence remote to or near to the situation. Some western dialects such as Pothohari have forms of the copula to indicate occurrence of a situation in the future.
However, some Indo-Aryan languages exhibit V2 word order in combination with SOV, most prominently Kashmiri. The non-finite verb remains in final position, but the finite part of the verb appears in second position. Simple verbs look like SVO, whereas auxiliated verbs are discontinuous and adhere to this pattern:
Given that Kashmiri is a V2 language, if the word tsũũţh 'apple' comes first then the subject kuur 'girl' must follow the auxiliary chhi 'is': tsũũţh chhi kuur khyevaan
Also, the word order changes depending on whether the phrase is in a main clause or in certain kinds of dependent clause. For instance, in relative clauses, the word order is SOVAux:

Iranian languages

The Iranian languages almost uniformly exhibit SOV word order:
Kurdish :
Kurdish :
Ossetian:
Pashto:
Persian:
Talysh:
The Zaza language usually uses a subject–object-verb structure, but it sometimes uses subject-verb-object too.

Italic languages

Latin
was an inflected language and had a very flexible word order and sentence structure, but the most usual word order in formal prose was SOV.
Again, there are multiple valid translations that do not affect the overall analysis.
Romance languages
Although their common ancestor Latin had free word order and preferred SOV, the modern Romance languages lost the Latin declension that enabled free word order and in general require subject-verb-object structures. However, remnants of SOV remain, particularly the clitic object pronouns common in Romance grammar. For instance, in French:
And Portuguese:
And in Spanish:
Contrast this with the SVO structure of a sentence with an explicit object :
The SOV tendency can also be seen when using auxiliary verbs, e.g. in Italian:
SOV also appears in Portuguese using a temporal adverb, optionally with the negative:
And in a suffix construction for the future and conditional tenses:
SVO form: Eu hei-de fazê-lo amanhã or eu farei o mesmo amanhã

Japanese

The basic principle in Japanese word order is that modifiers come before what they modify. For example, in the sentence "こんな夢を見た。", the direct object "こんな" modifies the verb "見た". Beyond this, the order of the elements in a sentence is relatively free. However, because the topic/subject is typically found in sentence-initial position and the verb is typically in sentence-final position, Japanese is considered an SOV language.
A closely related quality of the language is that it is broadly head-final.

Korean

–가/–이 -ga/-i is a particle that indicates the subject. –를/–을 -eul is a particle that indicates the object.
na "I" is changed to 내– nae- before –가 -ga, and the verb stem 열– yeol- is changed to 여– yeo- before –ㄴ다 -nda.

Quechua

Quechuan languages have standard SOV word order. The following example is from Bolivian Quechua.

Sino-Tibetan languages

SOV is believed to have been the "default" order of the protolanguage of the Sino-Tibetan family. Most Sino-Tibetan languages exhibit SOV order; however, the largest sub-branch of the family, the Sinitic or Chinese languages, are uniformly SVO, with some SOV-derived features.

Burmese

is an analytic language.

Chinese

Generally, Chinese varieties all feature SVO word order. However, especially in Standard Mandarin, SOV is tolerated as well. There is even a special particle 把 (bǎ) used to form an SOV sentence.
The following example that uses 把 is controversially labelled as SOV. 把 may be interpreted as a verb, meaning "to hold". However, it does not mean to hold something literally or physically. Rather, the object is held figuratively, and then another verb is acted on the object.
SOV structure is widely used in railway contact in order to clarify the objective of the order.

Tungusic languages

The Tungusic languages exhibit SOV word order by default.

Turkic languages

The Turkic languages all exhibit flexibility in word order, so any order is possible. However, the SOV order is the "default" one that does not connote particular emphasis on any part of the sentence; alternate orders are possible, but are used for emphasis. For instance, in Turkish, the following is the "default" way of saying "Murat ate the apple":
However, this sentence could also be constructed as OSV, OVS, VSO, VOS, or SVO, to indicate the relative importance of the subject, object, or the verb.
Similarly, in Uzbek this SOV sentence is neutral:
But the sentence can be changed into OSV as well to change the emphasis.
The same holds in Kazakh, where the below is neutral:
But an OSV sentence can be used to change the emphasis.
Other examples of SOV sentences in Turkic:
Azerbaijani:
Kyrgyz:

Uralic languages

The "idealized" profile of the Uralic languages has subject–verb–object word order. However, some Uralic languages, including the Samoyedic languages, some Sámi languages and Hungarian, exhibit SOV.
The protolanguage of the Uralic language family is understood to have exhibited SOV order.

Hungarian

word order is free, although the meaning slightly changes. Almost all permutations of the following sample are valid, but with stress on different parts of the meaning.