Stereoscopy, stereoscopic imaging or 3-D (three-dimensional) imaging is any technique capable of recording three-dimensional visual information or creating the illusion of depth in an image. The illusion of depth in a photograph, movie, or other two-dimensional image is created by presenting a slightly different image to each eye. Many 3D displays use this method to convey images. It was first invented by Sir Charles Wheatstone in 1840. Stereoscopy is used in photogrammetry and also for entertainment through the production of stereograms. Stereoscopy is useful in viewing images rendered from large multi-dimensional data sets such as are produced by experimental data. Modern industrial three dimensional photography may use 3D scanners to detect and record 3 dimensional information. The three-dimensional depth information can be reconstructed from two images using a computer by corresponding the pixels in the left and right images. Solving the Correspondence problem in the field of Computer Vision aims to create meaningful depth information from two images.
Traditional stereoscopic photography consists of creating a 3-D illusion starting from a pair of 2-D images. The easiest way to create depth perception in the brain is to provide the eyes of the viewer with two different images, representing two perspectives of the same object, with a minor deviation similar to the perspectives that both eyes naturally receive in binocular vision. If eyestrain and distortion are to be avoided, each of the two 2-D images preferably should be presented to each eye of the viewer so that any object at infinite distance seen by the viewer should be perceived by that eye while it is oriented straight ahead, the viewer's eyes being neither crossed nor diverging. When the picture contains no object at infinite distance, such as a horizon or a cloud, the pictures should be spaced correspondingly closer together.
Little or no additional image processing is required. Under some circumstances, such as when a pair of images is presented for crossed or diverged eye viewing, no device or additional optical equipment is needed.
The principal advantages of side-by-side viewers is that there is no diminution of brightness so images may be presented at very high resolution and in full spectrum color. The ghosting associated with polarized projection or when color filtering is used is totally eliminated. The images are discretely presented to the eyes and visual center of the brain, with no co-mingling of the views. The recent advent of wider HD and computer flat screens has made wider 3D digital images practical in this side by side mode, which hitherto has been used mainly with paired photos or in print form.
Two separate images are printed side-by-side. When viewed without a stereoscopic viewer the user is required to force his eyes either to cross, or to diverge, so that the two images appear to be three. Then as each eye sees a different image, the effect of depth is achieved in the central image of the three.
The stereoscope offers several advantages:
Stereograms cards are frequently used by orthoptists and vision therapists in the treatment of many binocular vision and accommodative disorders.
In the 1940s, a modified and miniaturized variation of this technology was introduced as the View-Master. Pairs of stereo views are printed on translucent film which is then mounted around the edge of a cardboard disk, images of each pair being diametrically opposite. A lever is used to move the disk so as to present the next image pair. A series of seven views can thus be seen on each card when it was inserted into the View-Master viewer. These viewers were available in many forms both non-lighted and self-lighted and may still be found today. One type of material presented is children's fairy tale story scenes or brief stories using popular cartoon characters. These use photographs of three dimensional model sets and characters. Another type of material is a series of scenic views associated with some tourist destination, typically sold at gift shops located at the attraction.
Another important development in the late 1940s was the introduction of the Stereo Realist camera and viewer system. Using color slide film, this equipment made stereo photography available to the masses and caused a surge in its popularity. The Stereo Realist and competing products can still be found (in estate sales and elsewhere) and utilized today.
Low-cost folding cardboard viewers with plastic lenses have been used to view images from a sliding card and have been used by computer technical groups as part of annual convention proceedings. These have been supplanted by the DVD recording and display on a television set. By exhibiting moving images of rotating objects a three dimensional effect is obtained through other than stereoscopic means.
An advantage offered by transparency viewing is that a wider field of view may be presented since images, being illuminated from the rear, may be placed much closer to the lenses. Note that with simple viewers the images are limited in size as they must be adjacent and so the field of view is determined by the distance between each lens and its corresponding image.
Good quality wide angle lenses are quite expensive and they are not found in most stereo viewers.
The user typically wears a helmet or glasses with two small LCD or OLED displays with magnifying lenses, one for each eye. The technology can be used to show stereo films, images or games, but it can also be used to create a virtual display. Head-mounted displays may also be coupled with head-tracking devices, allowing the user to "look around" the virtual world by moving their head, eliminating the need for a separate controller. Performing this update quickly enough to avoid inducing nausea in the user requires a great amount of computer image processing. If six axis position sensing (direction and position) is used then wearer may move about within the limitations of the equipment used. Owing to rapid advancements in computer graphics and the continuing miniaturization of video and other equipment these devices are beginning to become available at more reasonable cost.
Head-mounted or wearable glasses may be used to view a see-through image imposed upon the real world view, creating what is called augmented reality. This is done by reflecting the video images through partially reflective mirrors. The real world view is seen through the mirrors' reflective surface. Experimental systems have been used for gaming, where virtual opponents may peek from real windows as a player moves about. This type of system is expected to have wide application in the maintenance of complex systems, as it can give a technician what is effectively "x-ray vision" by combining computer graphics rendering of hidden elements with the technician's natural vision. Additionally, technical data and schematic diagrams may be delivered to this same equipment, eliminating the need to obtain and carry bulky paper documents.
Augmented stereoscopic vision is also expected to have applications in surgery, as it allows the combination of radiographic data (CAT scans and MRI imaging) with the surgeon's vision. These are not real 3-D glasses
There are several commercial systems offering products like the above, and one can also put one together by oneself using instructions on the GeoWall Consortium site – they refer to such a system as a GeoWall.
Real D Cinema System (used recently with the sterescopic Disney movie, "Chicken Little 3D") uses electronically driven circular polarizers that alternate between left- and right- handedness, and does so in sync with the left or right image being displayed by the (digital) movie projector.
Anaglyph images have seen a recent resurgence due to the presentation of images on the internet. Where traditionally, this has been a largely black & white format, recent digital camera and processing advances have brought very acceptable color images to the internet and DVD field. With the online availability of low cost paper glasses with improved red-cyan filters, and even better plastic framed glasses, the field is growing fast. Scientific images, where depth perception is useful, include the presentation of complex multi-dimensional data sets and stereographic images from (for example) the surface of Mars, but due to recent release of 3D DVDs, they are increasingly used for entertainment. Anaglyph images are much easier to view than either parallel sighting or crossed eye stereograms, although the latter types offer bright and accurate color rendering,particularly in the red component, which is muted, or desaturated with even the best color anaglyphs. A compensating technique, commonly known as Anachrome, which uses a slightly more transparent cyan filter in the patented glasses associated with the technique. Process reconfigures the typical anaglyph image to have less parallax.
A recent variation on the anaglyph technique is called "Anachrome method". This approach is an attempt to provide images that look fairly normal without glasses as 2D images to be "compatible" for posting in conventional websites or magazines. The 3D effect is generally more subtle, as the images are shot with a narrower stereo base, (the distance between the camera lenses). Pains are taken to adjust for a better overlay fit of the two images, which are layered one on top of another. Only a few pixels of non-registration give the depth cues. The range of color is perhaps three times wider in Anachrome due to the deliberate passage of a small amount of the red information through the cyan filter. Warmer tones can be boosted, and this provides warmer skin tones and vividness.
The Red Eye Method practically eliminates the ghosting caused by the slow decay of the green and blue P22-type phosphors typically used in conventional CRT monitors. This method relies solely on the red component of the RGB image being displayed, with the green and blue component of the image being suppressed.
The claimed benefit of this method include: little or no ghosting; excellent image contrast; sharp image detail; simple method implementation; and being easy for the accommodation of the eyes.
More recently, random-dot autostereograms have been created using computers to hide the different images in a field of apparently random noise, so that until viewed by diverging the eyes, the subject of the image remains a mystery. A popular example of this is the Magic Eye series, a collection of stereograms based on distorted colorful and interesting patterns instead of random noise.
The widely accepted explanation of the apparent motion with depth is that a reduction in retinal illumination (relative to the fellow eye) yields a corresponding delay in signal transmission, imparting instantaneous spatial disparity to moving objects. This occurs because the eye, and hence the brain, respond more quickly to brighter objects than to dimmer ones.
So, if the brightness of the pendulum is greater in the left eye than in the right, the retinal signals from the left eye will reach the brain slightly ahead of those from the right eye making it seem like the pendulum seen by the right eye is lagging behind its counterpart in the left eye. This difference in position over time is interpreted by the brain as motion with depth: No motion, no depth.
The ultimate effect of this, with appropriate scene composition, is the illusion of motion with depth. Object motion must be maintained for most conditions and is effective only for very limited "real-world" scenes.
Another technique, for example used by the X3D company, is simply to cover the LCD with two layers, the first being closer to the LCD than the second, by some millimeters. The two layers are transparent with black strips, each strip about one millimeter wide. One layer has its strips about ten degrees to the left, the other to the right. This allows seeing different pixels depending on the viewer's position.
This method, possibly the simplest stereogram viewing technique, is to simply alternate between the left and right images of a stereogram. In a web browser, this can easily be accomplished with an animated .gif image, flash applet or a specialized java applet. Most people can get a crude sense of dimensionality from such images, due to persistence of vision and parallax. Closing one eye and moving the head from side-to-side helps to understand why this works. Objects that are closer appear to move more than those further away.
This effect may also be observed by a passenger in a vehicle or low-flying aircraft, where distant hills or tall buildings appear in three-dimensional relief, a view not seen by a static observer as the distance is beyond the range of effective binocular vision.
Advantages of the wiggle viewing method include:
Disadvantages of the "wiggle" method:
Most wiggle images use only two images, leading to an annoyingly jerky image. A smoother image, more akin to a motion picture image where the camera is moved back and forth, can be composed by using several intermediate images (perhaps with synthetic motion blur) and longer image residency at the end images to allow inspection of details.
Although the "wiggle" method is an excellent way of previewing stereoscopic images, it cannot actually be considered a true three-dimensional stereoscopic format. To experience binocular depth perception as made possible with true stereoscopic formats, each eyeball must be presented with a different image at the same time – this is not the case with "wiggling" stereo. The apparent "stereo like effect" comes from syncing the timing of the wiggle and the amount of parallax to the processing done by the visual cortex. Three or five images with good parallax produce a much better effect than simple left and right images. Recent research indicates that binocular vision is the results of timing and phase disparities in the visual cortex. See the references below on binocular disparity as well as timing and phase differences in a process called binocular rivalry. By timing the wiggle just right and using the right amount of parallax we can fool the neural cortex into registering the sensation of binocular vision.
How far you are intending to view the pictures from requires a certain separation between the cameras. This separation is called stereo base or stereo base line and results from the ratio of the distance to the image to the distance between your eyes. The mean interpupillary distance (IPD) is 63 mm (about 2.5 inches), but varies with age, race and gender. The vast majority of adults have IPDs in the range 50–75 mm. Almost all adults are in the range 45–80 mm. The minimum IPD for children as young as five is around 40 mm.
For example if you are going to view a stereo image on your computer monitor from a distance of 1000 mm you will have an eye to view ratio of 1000/63 or about 16. To set your cameras the correct distance apart you take the distance to the subject (say a person at a distance from the cameras of 3 metres) and divide by 16 which gives you a stereo base of 188 mm between the cameras.
If you intend to view the stereo image from the same distance as it is captured (e.g. a subject photographed three meters away, projected on a movie screen at a distance from the viewer of three meters) then the stereo base separation will be the same as the distance between the viewers eyes (about 63 mm).
In the 1950s, stereoscopic photography regained popularity when a number of manufacturers began introducing stereoscopic cameras to the public. The new cameras were developed to use 135 film, which had gained popularity after the close of World War II. Many of the conventional cameras used the film for 35 mm transparency slides, and the new stereoscopic cameras utilized the film to make stereoscopic slides. The Stereo Realist camera was the most popular, and the 35mm picture format became the standard by which other stereo cameras were designed. The stereoscopic cameras were marketed with special viewers that allowed for the use of such slides, which were similar to View-Master reels but offered a much larger image. With these cameras the public could easily create their own stereoscopic memories. Although their popularity has waned somewhat, these cameras are still in use today.
The 1980s saw a minor revival of stereoscopic photography extent when point-and-shoot stereo cameras were introduced. These cameras suffered from poor optics and plastic construction, so they never gained the popularity of the 1950s stereo cameras. Over the last few years they have been improved upon and now produce good images.
The beginning of the 21st century marked the coming of the age of digital photography. Stereo lenses were introduced which could turn an ordinary film camera into a stereo camera by using a special double lens to take two images and direct them through a single lens to capture them side-by-side on the film. Although current digital stereo cameras cost thousands of dollars, cheaper models also exist, for example those produced by the company Loreo. It is also possible to create a twin camera rig, together with a "shepherd" device to synchronize the shutter and flash of the two cameras. By mounting two cameras on a bracket, spaced a bit, with a mechanism to make both take pictures at the same time. Newer cameras are even being used to shoot "step video" 3D slide shows with many pictures almost like a 3D motion picture if viewed properly. A modern camera can take five pictures per second, with images that greatly exceed HDTV resolution.
The side-by-side method is extremely simple to create, but it can be difficult or uncomfortable to view without optical aids. One such aid for non-crossed images is the modern Pokescope. Traditional stereoscopes such as the Holmes can be used as well. Cross view technique now has the simple Perfect-Chroma cross viewing glasses to facilitate viewing.
A single digital camera can also be used if the subject remains perfectly still (such as an object in a museum display). Two exposures are required. The camera can be moved on a sliding bar for offset, or with practice, the photographer can simply shift the camera while holding it straight and level. In practice the hand-held method works very well. This method of taking stereo photos is sometimes referred to as the "Cha-Cha" method.
A good rule of thumb is to shift sideways 1/30th of the distance to the closest subject for 'side by side' display, or just 1/60th if the image is to be also used for color anaglyph or anachrome image display. For example, if you are taking a photo of a person in front of a house, and the person is thirty feet away, then you should move the camera 1 foot between shots.
The stereo effect is not significantly diminished by slight pan or rotation between images. In fact slight rotation inwards (also called 'toe in') can be beneficial. Bear in mind that both images should show the same objects in the scene (just from different angles) - if a tree is on the edge of one image but out of view in the other image, then it will appear in a ghostly, semi-transparent way to the viewer, which is distracting and uncomfortable. Therefore, you can either crop the images so they completely overlap, or you can 'toe-in' the cameras so that the images completely overlap without having to discard any of the images. However, be a little cautious - too much 'toe-in' can cause eye strain for reasons best described here.
For making stereo images of a distant object (e.g., a mountain with foothills), one can separate the camera positions by a larger distance than usual. This will enhance the depth perception of these distant objects, but is not suitable for use when foreground objects are present. In the red-cyan anaglyphed example at right, a ten-meter baseline atop the roof ridge of a house was used to image the mountain. The two foothill ridges are about four miles (6.5 km) distant and are separated in depth from each other and the background. The baseline is still too short to resolve the depth of the two more distant major peaks from each other. Owing to various trees that appeared in only one of the images the final image had to be severely cropped at each side and the bottom.
In the wider image, taken from a different location, a single camera was walked about one hundred feet (30 m) between pictures. The images were converted to monochrome before combination.
|
|