Stereoscopy
Stereoscopy, also called stereoscopics or stereo imaging, refers to making images appear 3D. The most popular kind of stereoscopy is two-view stereoscopy, which creates partial depth perception in an image from a set of two two-dimensional images by using binocular disparity. The word stereoscopy derives. Any stereoscopic image is called a stereogram. Originally, stereogram referred to a pair of two-dimensional images that could be viewed using a stereoscope.
Most stereoscopic methods present a pair of two-dimensional images to the viewer. The left image is presented to the left eye and the right image is presented to the right eye. When viewed, the human brain perceives the images as a single 3D view, giving the viewer the perception of 3D depth. However, the 3D effect lacks proper focal depth, which gives rise to the vergence-accommodation conflict.
Stereoscopy is distinguished from other types of 3D displays that display an image in three full dimensions, allowing the observer to increase information about the 3-dimensional objects being displayed by head and eye movements.
Background
Stereoscopy creates the impression of three-dimensional depth. The most popular type of stereoscopy, two-view stereoscopy, creates the impression from a pair of two-dimensional images. Human vision, including the perception of depth, is a complex process, which only begins with the acquisition of visual information taken in through the eyes; much processing happens in the brain, as it strives to make sense of the raw information. One of the functions that occur within the brain as it interprets what the eyes see is assessing the relative distances of objects from the viewer, and the depth dimension of those objects. The cues that the brain uses to gauge relative distances and depth in a perceived scene include:- Binocular disparity - The differences in images in each eye caused by the horizontal distance between the two eyes
- Vergence
- Accommodation
- Occlusion
- Subtended visual angle of an object of known size
- Linear perspective
- Vertical position
- Haze or contrast, saturation, and color, greater distance generally being associated with greater haze, desaturation, and a shift toward blue
- Change in size of textured pattern detail
Although the term "3D" is ubiquitously used, the presentation of dual 2D images is distinctly different from displaying an image in three full dimensions by using a volumetric display, or recreating a light field. The most notable difference is that, in the case of two-view 3D displays, the observer's head and eye movement do not change the information received about the 3-dimensional objects being viewed. Volumetric display do not have this limitation.
Most 3D displays use the two-view stereoscopic method to convey images. The method was first invented by Sir Charles Wheatstone in 1838,
and improved by Sir David Brewster who made the first portable 3D viewing device.
Wheatstone originally used his stereoscope with drawings because photography was not yet available, yet his original paper seems to foresee the development of a realistic imaging method:
For the purposes of illustration I have employed only outline figures, for had either shading or colouring been introduced it might be supposed that the effect was wholly or in part due to these circumstances, whereas by leaving them out of consideration no room is left to doubt that the entire effect of relief is owing to the simultaneous perception of the two monocular projections, one on each retina. But if it be required to obtain the most faithful resemblances of real objects, shadowing and colouring may properly be employed to heighten the effects. Careful attention would enable an artist to draw and paint the two component pictures, so as to present to the mind of the observer, in the resultant perception, perfect identity with the object represented. Flowers, crystals, busts, vases, instruments of various kinds, &c., might thus be represented so as not to be distinguished by sight from the real objects themselves.
Stereoscopy is used in photogrammetry and also for entertainment through the production of stereograms. Stereoscopy is useful in viewing images rendered from large multi-dimensional data sets such as are produced by experimental data. Modern industrial three-dimensional photography may use 3D scanners to detect and record three-dimensional information. The three-dimensional depth information can be reconstructed from two images using a computer by correlating the pixels in the left and right images. Solving the Correspondence problem in the field of Computer Vision aims to create meaningful depth information from two images.
Visual requirements
There are 3 components of binocular vision involved in viewing two-view stereo images:- Simultaneous perception
- Fusion
- Stereopsis
File:Niagra Falls Suspension Bridge by Saul Davis.jpg|thumb|Saul Davis, New Suspension Bridge, Niagara Falls, Canada,, albumen print stereograph, , National Gallery of Art Library, Washington, DC
Stereoscopic viewing may be artificially created by the viewer's brain, as demonstrated with the Van Hare Effect, where the brain perceives stereo images even when the paired photographs are identical. This "false dimensionality" results from the developed stereoacuity in the brain, allowing the viewer to fill in depth information even when few if any 3D cues are actually available in the paired images.
Side-by-side
Traditional stereoscopic photography consists of creating a 3D illusion starting from a pair of 2D images, a stereogram. The easiest way to enhance depth perception in the brain is to provide the eyes of the viewer with two different images, representing two perspectives of the same object, with a minor deviation equal or nearly equal to the perspectives that both eyes naturally receive in binocular vision.File:Strereoscope- Western Wall Jerusalem.jpg|thumbnail|A stereoscopic pair of images and a combined anaglyph that colors one perspective red and the other cyan.
To avoid eyestrain and distortion, each of the two 2D images should be presented to the viewer so that any object at infinite distance is perceived by the eye as being straight ahead, the viewer's eyes being neither crossed nor diverging. When the picture contains no object at infinite distance, such as a horizon or a cloud, the pictures should be spaced correspondingly closer together.
The advantages of side-by-side viewers is the lack of diminution of brightness, allowing the presentation of images at very high resolution and in full spectrum color, simplicity in creation, and little or no additional image processing is required. Under some circumstances, such as when a pair of images is presented for freeviewing, no device or additional optical equipment is needed.
The principal disadvantage of side-by-side viewers is that large image displays are not practical and resolution is limited by the lesser of the display medium or human eye. This is because as the dimensions of an image are increased, either the viewing apparatus or viewer themselves must move proportionately further away from it in order to view it comfortably. Moving closer to an image in order to see more detail would only be possible with viewing equipment that adjusted to the difference. Image:3dviewer.gif|thumb|Printable cross eye viewer
Freeviewing
Freeviewing is viewing a side-by-side image pair without using a viewing device.Two methods are available to freeview: parallel viewing and cross-eyed viewing.
- The parallel viewing method uses an image pair with the left-eye image on the left and the right-eye image on the right. The fused three-dimensional image appears larger and more distant than the two actual images, making it possible to convincingly simulate a life-size scene. The viewer attempts to look through the images with the eyes substantially parallel, as if looking at the actual scene. This can be difficult with normal vision because eye focus and binocular convergence are habitually coordinated. One approach to decoupling the two functions is to view the image pair extremely close up with completely relaxed eyes, making no attempt to focus clearly but simply achieving comfortable stereoscopic fusion of the two blurry images by the "look-through" approach, and only then exerting the effort to focus them more clearly, increasing the viewing distance as necessary. Regardless of the approach used or the image medium, for comfortable viewing and stereoscopic accuracy the size and spacing of the images should be such that the corresponding points of very distant objects in the scene are separated by the same distance as the viewer's eyes, but not more; the average interocular distance is about 63 mm. Viewing much more widely separated images is possible, but because the eyes never diverge in normal use it usually requires some previous training and tends to cause eye strain.
- The cross-eyed viewing method swaps the left and right eye images so that they will be correctly seen cross-eyed, the left eye viewing the image on the right and vice versa. The fused three-dimensional image appears to be smaller and closer than the actual images, so that large objects and scenes appear miniaturized. This method is usually easier for freeviewing novices. As an aid to fusion, a fingertip can be placed just below the division between the two images, then slowly brought straight toward the viewer's eyes, keeping the eyes directed at the fingertip; at a certain distance, a fused three-dimensional image should seem to be hovering just above the finger. Alternatively, a piece of paper with a small opening cut into it can be used in a similar manner; when correctly positioned between the image pair and the viewer's eyes, it will seem to frame a small three-dimensional image.
Stereoscopically fusing two separate images without the aid of mirrors or prisms while simultaneously keeping them in sharp focus without the aid of suitable viewing lenses inevitably requires an unnatural combination of eye vergence and accommodation. Simple freeviewing therefore cannot accurately reproduce the physiological depth cues of the real-world viewing experience. Different individuals may experience differing degrees of ease and comfort in achieving fusion and good focus, as well as differing tendencies to eye fatigue or strain.