Hyphen
The hyphen '' is a punctuation mark used to join words and to separate syllables of a single word. The use of hyphens is called hyphenation.
The hyphen is sometimes confused with dashes, which are wider, or with the minus sign, which is also wider and usually drawn a little higher to match the crossbar in the plus sign.
As an orthographic concept, the hyphen is a single entity. In character encoding for use with computers, it is represented in Unicode by any of several characters. These include the dual-use hyphen-minus, the soft hyphen, the nonbreaking hyphen, and an unambiguous form known familiarly as the "Unicode hyphen", shown at the top of the infobox on this page. The character most often used to represent a hyphen is called the "hyphen-minus" in the Unicode specification because it is also used as a minus sign. The name derives from its name in the original ASCII standard, where it was called "hyphen".
Etymology
The word is derived from Ancient Greek ὑφ' ἕν, contracted from ὑπό ἕν, "in one". An ὑφέν was an undertie-like sign written [|below] two adjacent letters to indicate that they belong to the same word when it was necessary to avoid ambiguity, before word spacing was practiced.History
The first known documentation of the hyphen is in the grammatical works of Dionysius Thrax. At the time hyphenation was joining two words that would otherwise be read separately by a low tie mark between the two words. In Greek these marks were known as enotikon, officially romanized as a hyphen.With the introduction of letter spacing in the Middle Ages, the hyphen, still written beneath the text, reversed its meaning. Scribes used the mark to connect two words that had been incorrectly separated by a space. This era also saw the introduction of the marginal hyphen, for words broken across lines.
The modern format of the hyphen originated with Johannes Gutenberg of Mainz, Germany, with the publication of his 42-line Bible. His tools did not allow for a sublinear hyphen, and he thus moved it to the middle of the line. Examination of an original copy on vellum in the U. S. Library of Congress shows that Gutenberg's movable type was set justified in a uniform style, 42 equal lines per page. The Gutenberg printing press required words composed of individual movable type to be secured within a rigid, nonprinting frame. To ensure each line fit the frame uniformly, Gutenberg addressed differences in line length by inserting a hyphen at the end of a line at the right-hand margin. This interrupted the letters in the last word, requiring the remaining letters be carried over to the start of the line below. His double hyphen, , appears throughout the Bible as a short, double line inclined to the right at a 60-degree angle.
Use in English
The English language does not have definitive hyphenation rules, though various style guides provide detailed usage recommendations and have a significant amount of overlap in what they advise. Hyphens are mostly used to break single words into parts or to join ordinarily separate words into single words. Spaces are not placed between a hyphen and either of the elements it connects except when using a suspended or "hanging" hyphen that stands in for a repeated word. Style conventions that apply to hyphens have evolved to support ease of reading in complex constructions; editors often accept deviations if they aid rather than hinder easy comprehension.The use of the hyphen in English compound nouns and verbs has, in general, been steadily declining. Compounds that might once have been hyphenated are increasingly left with spaces or are combined into one word. Reflecting this changing usage, in 2007, the sixth edition of the Shorter Oxford English Dictionary removed the hyphens from 16,000 entries, such as fig-leaf, pot-belly, and pigeon-hole. The increasing prevalence of computer technology and the advent of the Internet have given rise to a subset of common nouns that might have been hyphenated in the past.
Despite decreased use, hyphenation remains the norm in certain compound-modifier constructions and, among some authors, with certain prefixes. Hyphenation is also routinely used as part of syllabification in justified texts to avoid unsightly spacing.
Separating
Justification and line-wrapping
When flowing text, it is sometimes preferable to break a word into two so that it continues on another line rather than moving the entire word to the next line. The word may be divided at the nearest break point between syllables and a hyphen inserted to indicate that the letters form a word fragment, rather than a full word. This allows more efficient use of paper, allows flush appearance of right-side margins without oddly large word spaces, and decreases the problem of rivers. This kind of hyphenation is most useful when the width of the column is very narrow. For example:Rules for correct hyphenation vary between languages, and may be complex, and they can interact with other orthographic and typesetting practices. Hyphenation algorithms, when employed in concert with dictionaries, are sufficient for all but the most formal texts.
It may be necessary to distinguish an incidental line-break hyphen from one integral to a word being mentioned or present in an original text being quoted, not only to control its word wrap behavior but also to differentiate appearance. Webster's Third New International Dictionary and the Chambers Dictionary use a double hyphen for integral hyphens and a single hyphen for line-breaks, whereas Kromhout's Afrikaans–English dictionary uses the opposite convention. The Concise Oxford Dictionary suggested repeating an integral hyphen at the start of the following line.
Prefixes and suffixes
es and suffixes are sometimes hyphenated, especially when the unhyphenated spelling resembles another word or when the affixation is deemed misinterpretable, ambiguous, or somehow "odd-looking". However, the unhyphenated style, which is also called closed up or solid, is usually preferred, particularly when the derivative has been relatively familiarized or popularized through extensive use in various contexts. As a rule of thumb, affixes are not hyphenated unless the lack of a hyphen would hurt clarity.The hyphen may be used between vowel letters to indicate that they do not form a digraph. Some words have both hyphenated and unhyphenated variants: /deescalate, /cooperation, /reexamine, /deemphasize, and so on. Words often lose their hyphen as they become more common, such as email instead of '. When there are tripled letters, the hyphenated variant of these words is often more common.
Closed-up style is avoided in some cases: possible homographs, such as recreation versus ', retreat versus ', and ' versus unionized ; combinations with proper nouns or adjectives ; acronyms ; or numbers. Although is still hyphenated by both Dorland's and Merriam-Webster's Medical, the solid styling is a common variant, particularly among oncologists and geneticists.
A diaeresis may also be used in a like fashion, either to separate and mark off monographs or to signalize a vocalic terminal e. This use of the diaeresis peaked in the late 19th and early 20th centuries, but it was never applied extensively across the language: only a handful of diaereses, including coöperation and Brontë, are encountered with any appreciable frequency in English; thus reëxamine, reïterate, deëmphasize, etc. are seldom encountered. In borrowings from Modern French, whose orthography utilizes the diaeresis as a means to differentiate graphemes, various English dictionaries list the dieresis as optional despite the juxtaposition of a and i.
Syllabification and spelling
Hyphens are occasionally used to denote syllabification, as in syl-la-bi-fi-ca-tion. Various British and North American dictionaries use an interpunct, for this purpose, as in syl·la·bi·fi·ca·tion. This practice allows the hyphen to be reserved exclusively for instances where a true hyphen is intended – for example, self-con·scious, un·self-con·scious, and long-stand·ing. Similarly, hyphens may be used to indicate the spelling of a word, as in W-O-R-D to represent word.In nineteenth-century American literature, hyphens were also used irregularly to divide syllables in words from indigenous North American languages, without regard for etymology or pronunciation, such as "Shuh-shuh-gah" in The Song of Hiawatha. This usage is now rare and deprecated, except in some place names such as Ah-gwah-ching.
Joining
Compound modifiers
s are groups of two or more words that jointly modify the meaning of another word. When a compound modifier other than an adverb–adjective combination appears before a term, the compound modifier is often hyphenated to prevent misunderstanding, such as in player or paintings. Without the hyphen, there is potential confusion about whether the writer means a "player of American football" or an "American player of football" and whether the writer means paintings that are "little celebrated" or "celebrated paintings" that are little. Compound modifiers can extend to three or more words, as in candy, and can be adverbial as well as adjectival. However, if the compound is a familiar one, it is usually unhyphenated. For example, some style guides prefer the construction high school students, to students. Although the expression is technically ambiguous, it would normally be formulated differently if other than the first meaning were intended. Noun–noun compound modifiers may also be written without a hyphen when no confusion is likely: grade point average and department store manager.When a compound modifier follows the term to which it applies, a hyphen is typically not used if the compound is a temporary compound. For example, "that gentleman is well respected", not "that gentleman is well-respected"; or "a patient-centered approach was used" but "the approach was patient centered." But permanent compounds, found as headwords in dictionaries, are treated as invariable, so if they are hyphenated in the cited dictionary, the hyphenation will be used in both attributive and predicative positions. For example, "A cost-effective method was used" and "The method was cost-effective". When one of the parts of the modifier is a proper noun or a proper adjective, there is no hyphen.
When the first modifier in a compound is an adverb ending in -ly, various style guides advise no hyphen. However, some do allow for this use. For example, The Economist Style Guide advises: "Adverbs do not need to be linked to participles or adjectives by hyphens in simple constructions... Less common adverbs, including all those that end -ly, are less likely to need hyphens." In the 19th century, it was common to hyphenate adverb–adjective modifiers with the adverb ending in -ly. However, this has become rare. For example, wholly owned subsidiary and quickly moving vehicle are unambiguous, because the adverbs clearly modify the adjectives: "quickly" cannot modify "vehicle".
However, if an adverb can also function as an adjective, then a hyphen may be or should be used for clarity, depending on the style guide. For example, the phrase reasons is distinguished from more important reasons, where more is an adjective. Similarly, scenery is distinct from more beautiful scenery. A few short and common words—such as well, ill, little, and much—attract special attention in this category. The hyphen in "well- noun", such as in "well-differentiated cells", might reasonably be judged superfluous, yet plenty of style guides call for it. Because early has both adverbial and adjectival senses, its hyphenation can attract attention; some editors, due to comparison with advanced-stage disease and adult-onset disease, like the parallelism of early-stage disease and early-onset disease. Similarly, the hyphen in little-celebrated paintings clarifies that one is not speaking of little paintings.
Hyphens are usually used to connect numbers and words in modifying phrases. Such is the case when used to describe dimensional measurements of weight, size, and time, under the rationale that, like other compound modifiers, they take hyphens in attributive position, although not in predicative position. This is applied whether numerals or words are used for the numbers. Thus woman and woman or wingspan and wingspan, but the woman is 28 years old and a wingspan of 32 feet. However, with symbols for SI units —in contrast to the names of these units —the numerical value is always separated from it with a space: a 25 kg sphere. When the unit names are spelled out, this recommendation does not apply: a sphere, a roll of film.
In spelled-out fractions, hyphens are usually used when the fraction is used as an adjective but not when it is used as a noun: thus majority and portion but I drank two thirds of the bottle or I kept three quarters of it for myself. However, at least one major style guide hyphenates spelled-out fractions invariably.
In English, an en dash,, sometimes replaces the hyphen in hyphenated compounds if either of its constituent parts is already hyphenated or contains a space. A commonly used alternative style is the hyphenated string.
Object–verbal-noun compounds
When an object is compounded with a verbal noun, such as egg-beater, the result is sometimes hyphenated. Some authors do this consistently, others only for disambiguation; in this case, egg-beater, egg beater, and eggbeater are all common.An example of an ambiguous phrase appears in they stood near a group of alien lovers, which without a hyphen implies that they stood near a group of lovers who were aliens; they stood near a group of alien-lovers clarifies that they stood near a group of people who loved aliens, as "alien" can be either an adjective or a noun. On the other hand, in the phrase a hungry pizza-lover, the hyphen will often be omitted, as "pizza" cannot be an adjective and the phrase is therefore unambiguous.
Similarly, a man-eating shark is nearly the opposite of a man eating shark; the first refers to a shark that eats people, and the second to a man who eats shark meat. A government-monitoring program is a program that monitors the government, whereas a government monitoring program is a government program that monitors something else.
Personal names
Some married couples compose a new surname for their new family by combining their two surnames with a hyphen. Jane Doe and John Smith might become Jane and John Smith-Doe, or Doe-Smith, for instance. In some countries only the woman hyphenates her birth surname, appending her husband's surname.With already-hyphenated names, some parts are typically dropped. For example, Aaron Johnson and Samantha Taylor-Wood became Aaron Taylor-Johnson and Sam Taylor-Johnson. Not all hyphenated surnames are the result of marriage. For example Julia Louis-Dreyfus is a descendant of Louis Lemlé Dreyfus whose son was Léopold Louis-Dreyfus.
Hyphens are often used in romanization of Korean names, such as Ban Ki-moon and Cho Jung-ho.
Other compounds
Connecting hyphens are used in a large number of miscellaneous compounds, other than modifiers, such as in lily-of-the-valley, cock-a-hoop, clever-clever, tittle-tattle and orang-utan. Use is often dictated by convention rather than fixed rules, and hyphenation styles may vary between authors; for example, orang-utan is also written as orangutan or orang utan, and lily-of-the-valley may be hyphenated or not.Suspended hyphens
A suspended hyphen may be used when a single base word is used with separate, consecutive, hyphenated words that are connected by "and", "or", or "to". For example, short-term and long-term plans may be written as short- and long-term plans. This usage is now common and specifically recommended in some style guides. Suspended hyphens are also used, though less commonly, when the base word comes first, such as in "investor-owned and ". Uses such as "applied and sociolinguistics" are frowned upon; the Indiana University style guide uses this example and says "Do not 'take a shortcut' when the first expression is ordinarily open". This is different, however, from instances where prefixes that are normally closed up are used suspensively. For example, ' becomes ' when suspended. Some editors prefer to avoid suspending such pairs, choosing instead to write out both words in full.Other uses
A hyphen may be used to connect groups of numbers, such as in dates, telephone numbers or sports scores.It can also be used to indicate a range of values, although many styles prefer an en dash.
It is sometimes used to hide letters in words, as in "G-d", although an en dash can be used as well.
It is often used in reduplication.
Due to their similar appearances, hyphens are sometimes mistakenly used where an en dash or em dash would be more appropriate.
Varied meanings
Some stark examples of semantic changes caused by the placement of hyphens to mark attributive phrases:Disease-causing poor nutrition is poor nutrition that causes disease.- * Disease causing poor nutrition is a disease that causes poor nutrition.
- A hard-working man is a man who works hard.
- * A hard working man is a working man who is tough; however, this sense of hard is rarely used before other adjectives.
- A man-eating shark is a shark that eats humans.
- * A man eating shark is a man who is eating shark meat.Three-hundred-year-old trees are an indeterminate number of trees that are each 300 years old.
- * Three hundred-year-old trees are three trees that are each 100 years old.
- * Three hundred year-old trees are 300 trees that are each a year old.
Use in computing
Hyphen-minuses
In the ASCII character encoding, the hyphen is character 4510. As Unicode is identical to ASCII for all encodings up to 12710, the number 4510 is also assigned to this character in Unicode, where it is denoted as. Unicode has, in addition, other encodings for minus and hyphen characters: and, respectively. The unambiguous at U+2010 is generally inconvenient to enter on most keyboards and the glyphs for this hyphen and the hyphen-minus are identical in most fonts. Consequently, use of the hyphen-minus as the hyphen character is very common. Even the Unicode Standard regularly uses the hyphen-minus rather than the U+2010 hyphen.The hyphen-minus has limited use in indicating subtraction; for example, compare and — in most typefaces, the glyph for hyphen-minus will not have the optimal width, thickness, or vertical position, whereas the minus character is typically designed so that it does. Nevertheless, in many spreadsheet and programming applications the hyphen-minus must be typed to indicate subtraction, as use of the Unicode minus sign will not be recognised.
The hyphen-minus is often used instead of dashes or minus signs in situations where the latter characters are unavailable, where they take effort to enter, or when the writer is unaware of the distinction. Consequently, some writers use two or three hyphen-minuses to represent an em dash. In the TeX typesetting languages, a single hyphen-minus renders a hyphen, a single hyphen-minus in math mode renders a minus sign, two hyphen-minuses renders an en dash, and three hyphen-minuses renders an em dash.
The hyphen-minus character is also often used when specifying command-line options. The character is usually followed by one or more letters that indicate specific actions. Typically it is called a dash or switch in this context. Various implementations of the getopt function to parse command-line options additionally allow the use of two hyphen-minus characters,, to specify long option names that are more descriptive than their single-letter equivalents. Another use of hyphens is that employed by programs written with pipelining in mind: a single hyphen may be recognized in lieu of a filename, with the hyphen then serving as an indicator that a standard stream, instead of a file, is to be worked with.
Soft and hard hyphens
Although software can often automatically make decisions on when to hyphenate a word at a line break, it is also sometimes useful for the user to be able to insert cues for those decisions. For this purpose, the concept of a soft hyphen was introduced, allowing such manual specification of a place where a hyphenated break is allowed but not forced. That is, it does not force a line break in an inconvenient place when the text is later reflowed.Soft hyphens are inserted into the text at the positions where hyphenation may occur. It can be a tedious task to insert the soft hyphens by hand, and tools using hyphenation algorithms are available that do this automatically. Current modules of the Cascading Style Sheets standard provide language-specific hyphenation dictionaries.
In contrast, a hyphen that is always displayed and printed is called a "hard hyphen". This can be a Unicode hyphen, a hyphen-minus, or a nonbreaking hyphen. Confusingly, the term is sometimes limited to nonbreaking hyphens.
Nonbreaking hyphens
The word segmentation rules of most text systems consider a hyphen to be a word boundary and a valid point at which to break a line when flowing text. This is not always desirable; it could lead to ambiguity, it can split off an ending as in "nth", and it is inappropriate in some languages other than English. The non-breaking hyphen, nonbreaking hyphen, or no-break hyphen looks identical to the regular hyphen but word processors do not break words at it. The nonbreaking space exists for similar reasons."Unicode hyphen"
Because the conventional hyphen-minus mark on keyboards is ambiguous, in addition the Unicode consortium allocated codepoints for an unambiguous minus and an unambiguous hyphen. The Unicode hyphen is seldom used. Even the Unicode Standard uses U+002D instead of U+2010 in its text.Use in date notation
Use of hyphens to delineate the parts of a written date is specified in the international standard ISO 8601. Thus, for example, 1789-07-14 is the standard way of writing the date of Bastille Day. This standard has been transposed as European Standard EN 28601 and has been incorporated into various national typographic style guides. Now all official European Union documents use this style. This is also the typical date format used in large parts of Europe and Asia, although sometimes with other separators than the hyphen.This method has gained influence within North America, as most common computer file systems make the use of slashes in file names difficult or impossible. MS-DOS, OS/2 and Windows use to introduce and separate switches to shell commands, and on both Windows and Unix-like systems slashes in a filename introduce subdirectories which may not be desirable. Besides encouraging use of dashes, the Y-M-D order and zero-padding of numbers less than 10 are also copied from ISO 8601 to make the filenames sort by date order.
Unicode
Unicode has multiple hyphen characters:- , a character of multiple uses
- for medieval texts
Unicode distinguishes the hyphen from the general interpunct. The characters below do not have the Unicode property of "Hyphen" despite their names:
*