Skeletal formula
The skeletal formula, line-angle formula, bond-line formula or shorthand formula of an organic compound is a type of minimalist structural formula representing a molecule's atoms, bonds and some details of its geometry. The lines in a skeletal formula represent bonds between carbon atoms, unless labelled with another element. Labels are optional for carbon atoms, and the hydrogen atoms attached to them.
An early form of this representation was first developed by organic chemist August Kekulé, while the modern form is closely related to and influenced by the Lewis structure of molecules and their valence electrons. Hence they are sometimes termed Kekulé structures or Lewis–Kekulé structures. Skeletal formulas have become ubiquitous in organic chemistry, partly because they are relatively quick and simple to draw, and also because the curved arrow notation used for discussions of reaction mechanisms and electron delocalization can be readily superimposed.
Several other ways of depicting chemical structures are also commonly used in organic chemistry. For example, conformational structures look similar to skeletal formulae and are used to depict the approximate positions of atoms in 3D space, as a perspective drawing. Other types of representation, such as Newman projection, Haworth projection or Fischer projection, also look somewhat similar to skeletal formulae. However, there are slight differences in the conventions used, and the reader needs to be aware of them in order to understand the structural details encoded in the depiction. While skeletal and conformational structures are also used in organometallic and inorganic chemistry, the conventions employed also differ somewhat.
The skeleton
Terminology
The skeletal structure of an organic compound is the series of atoms bonded together that form the essential structure of the compound. The skeleton can consist of chains, branches and/or rings of bonded atoms. Skeletal atoms other than carbon or hydrogen are called heteroatoms.The skeleton has hydrogen and/or various substituents bonded to its atoms. Hydrogen is the most common non-carbon atom that is bonded to carbon and, for simplicity, is not explicitly drawn. In addition, carbon atoms are not generally labelled as such directly, whereas heteroatoms are always explicitly noted as such
Heteroatoms and other groups of atoms that give rise to relatively high rates of chemical reactivity, or introduce specific and interesting characteristics in the spectra of compounds are called functional groups, as they give the molecule a function. Heteroatoms and functional groups are collectively called "substituents", as they are considered to be a substitute for the hydrogen atom that would be present in the parent hydrocarbon of the organic compound.
Basic structure
As in Lewis structures, covalent bonds are indicated by line segments, with a doubled or tripled line segment indicating double or triple bonding, respectively. Likewise, skeletal formulae indicate formal charges associated with each atom, with lone pairs usually being optional. In fact, skeletal formulae can be thought of as abbreviated Lewis structures that observe the following simplifications:- Carbon atoms are represented by the vertices of line segments. For clarity, methyl groups are often explicitly written out as Me or CH3, while cumulene carbons are frequently represented by a heavy center dot.
- Hydrogen atoms attached to carbon are implied. An unlabeled vertex is understood to represent a carbon attached to the number of hydrogens required to satisfy the octet rule, while a vertex labeled with a formal charge and/or nonbonding electron is understood to have the number of hydrogen atoms required to give the carbon atom these indicated properties. Optionally, acetylenic and formyl hydrogens can be shown explicitly for the sake of clarity.
- Hydrogen atoms attached to a heteroatom are shown explicitly. The heteroatom and hydrogen atoms attached thereto are usually shown as a single group without explicitly showing the hydrogen–heteroatom bond. Heteroatoms with simple alkyl or aryl substituents, like methoxy or dimethylamino, are sometimes shown in the same way, by analogy.
- Lone pairs on carbene carbons must be indicated explicitly while lone pairs in other cases are optional and are shown only for emphasis. In contrast, formal charges and unpaired electrons on main-group elements are always explicitly shown.
Contemporary graphical conventions
Since skeletal structures were introduced in the latter half of the 19th century, their appearance has undergone considerable evolution. The graphical conventions in use today date to the 1980s. Thanks to the adoption of the ChemDraw software package as a de facto industry standard, these conventions have been nearly universal in the chemical literature since the late 1990s. A few minor conventional variations, especially with respect to the use of stereobonds, continue to exist as a result of differing US, UK and European practice, or as a matter of personal preference. As another minor variation between authors, formal charges can be shown with the plus or minus sign in a circle or without the circle. The set of conventions that are followed by most authors is given below, along with illustrative examples.Implicit carbon and hydrogen atoms
For example, the skeletal formula of hexane is shown below. The carbon atom labeled C1 appears to have only one bond, so there must also be three hydrogens bonded to it, in order to make its total number of bonds four. The carbon atom labelled C3 has two bonds to other carbons and is therefore bonded to two hydrogen atoms as well. A Lewis structure and ball-and-stick model of the actual molecular structure of hexane, as determined by X-ray crystallography, are shown for comparison.It does not matter which end of the chain one starts numbering from, as long as consistency is maintained when drawing diagrams. The condensed formula or the IUPAC name will confirm the orientation. Some molecules will become familiar regardless of the orientation.
Explicit heteroatoms and hydrogen atoms
All atoms that are not carbon or hydrogen are signified by their chemical symbol, for instance Cl for chlorine, O for oxygen, Na for sodium, and so forth. In the context of organic chemistry, these atoms are commonly known as heteroatoms.Any hydrogen atoms bonded to heteroatoms are drawn explicitly. In ethanol, C2H5OH, for instance, the hydrogen atom bonded to oxygen is denoted by the symbol H, whereas the hydrogen atoms which are bonded to carbon atoms are not shown directly.
Lines representing heteroatom-hydrogen bonds are usually omitted for clarity and compactness, so a functional group like the hydroxyl group is most often written −OH instead of −O−H. These bonds are sometimes drawn out in full in order to accentuate their presence when they participate in reaction mechanisms.
Shown below for comparison are a skeletal formula, its Lewis structure and its ball-and-stick model of the actual 3D structure of the ethanol molecule in the gas phase, as determined by microwave spectroscopy.
Pseudoelement symbols
There are also symbols that appear to be chemical element symbols, but represent certain very common substituents or indicate an unspecified member of a group of elements. These are called pseudoelement symbols or organic elements and are treated like univalent "elements" in skeletal formulae. A list of common pseudoelement symbols:General symbols
- X for any halogen atom
- L or Ln for a ligand or ligands
- M or Met for any metal atom
- E or El for any electrophile
- Nu for any nucleophile
- Z for conjugating electron-withdrawing groups
- D for deuterium
- T for tritium
Alkyl groups
- R for any alkyl group or even any organyl group
- Me for the methyl group
- Et for the ethyl group
- Pr, n-Pr, or ''nPr for the propyl group
- i''-Pr or ''iPr for the isopropyl group
- All for the allyl group
- Bu, n''-Bu or ''nBu for the butyl group
- i''-Bu or ''iBu for the isobutyl group
- s''-Bu or ''sBu for the secondary butyl group
- t''-Bu or ''tBu for the tertiary butyl group
- Pn for the pentyl group
- Np or Neo for the neopentyl group
- Cy or Chx for the cyclohexyl group
- Ad for the 1-adamantyl group
- Tr or'' Trt for the trityl group
Aromatic and unsaturated substituents
- Ar for any aromatic substituent
- Het for any heteroaromatic substituent
- Bn or Bzl for the benzyl group
- Dipp for the 2,6-diisopropylphenyl group
- Mes for the mesityl group
- Ph, Φ, or φ for the phenyl group
- Tol for the tolyl group, usually the para isomer
- Is or Tipp for the 2,4,6-triisopropylphenyl group
- An for the anisyl group, usually the para isomer
- Cp for the cyclopentadienyl group
- Cp* for the pentamethylcyclopentadienyl group
- Vi for the vinyl group