Fermat's principle
Fermat's principle, also known as the principle of least time, is the link between ray optics and wave optics. Fermat's principle states that the path taken by a ray between two given points is the path that can be traveled in the least time.
First proposed by the French mathematician Pierre de Fermat in 1662, as a means of explaining the ordinary law of refraction of light, Fermat's principle was initially controversial because it seemed to ascribe knowledge and intent to nature. Not until the 19th century was it understood that nature's ability to test alternative paths is merely a fundamental property of waves. If points A and B are given, a wavefront expanding from A sweeps all possible ray paths radiating from A, whether they pass through B or not. If the wavefront reaches point B, it sweeps not only the ray path from A to B, but also an infinitude of nearby paths with the same endpoints. Fermat's principle describes any ray that happens to reach point B; there is no implication that the ray "knew" the quickest path or "intended" to take that path.
In its original "strong" form, Fermat's principle states that the path taken by a ray between two given points is the path that can be traveled in the least time. In order to be true in all cases, this statement must be weakened by replacing the "least" time with a time that is "stationary" with respect to variations of the path - so that a deviation in the path causes, at most, a second-order change in the traversal time. To put it loosely, a ray path is surrounded by close paths that can be traversed in very close times. It [|can be shown] that this technical definition corresponds to more intuitive notions of a ray, such as a line of sight or the path of a narrow beam.
For the purpose of comparing traversal times, the time from one point to the next nominated point is taken as if the first point were a point-source. Without this condition, the traversal time would be ambiguous; for example, if the propagation time from to were reckoned from an arbitrary wavefront W containing , that time could be made arbitrarily small by suitably angling the wavefront.
Treating a point on the path as a source is the minimum requirement of Huygens's principle, and is part of the [|explanation] of Fermat's principle. But it [|can also be shown] that the geometric construction by which Huygens tried to apply his own principle is simply an invocation of Fermat's principle. Hence all the conclusions that Huygens drew from that construction - including, without limitation, the laws of rectilinear propagation of light, ordinary reflection, ordinary refraction, and the extraordinary refraction of "Iceland crystal" - are also consequences of Fermat's principle.
Derivation
Sufficient conditions
Let us suppose that:- A disturbance propagates sequentially through a medium, without action at a distance;
- During propagation, the influence of the disturbance at any intermediate point P upon surrounding points has a non-zero angular spread, so that a disturbance originating from any point A arrives at any other point B via an infinitude of paths, by which B receives an infinitude of delayed versions of the disturbance at A; and
- These delayed versions of the disturbance will reinforce each other at B if they are synchronized within some tolerance.
The most obvious example of a stationarity in traversal time is a minimum - that is, a path of least time, as in the "strong" form of Fermat's principle. But that condition is not essential to the argument.
Having established that a path of stationary traversal time is reinforced by a maximally wide corridor of neighboring paths, we still need to explain how this reinforcement corresponds to intuitive notions of a ray. But, for brevity in the explanations, let us first define a ray path as a path of stationary traversal time.
A ray as a signal path (line of sight)
If the corridor of paths reinforcing a ray path from A to B is substantially obstructed, this will significantly alter the disturbance reaching B from A - unlike a similar-sized obstruction outside any such corridor, blocking paths that do not reinforce each other. The former obstruction will significantly disrupt the signal reaching B from A, while the latter will not; thus the ray path marks a signal path. If the signal is visible light, the former obstruction will significantly affect the appearance of an object at A as seen by an observer at B, while the latter will not; so the ray path marks a line of sight.In optical experiments, a line of sight is routinely assumed to be a ray path.
A ray as an energy path (beam)
If the corridor of paths reinforcing a ray path from A to B is substantially obstructed, this will significantly affect the energy reaching B from A - unlike a similar-sized obstruction outside any such corridor. Thus the ray path marks an energy path - as does a beam.Suppose that a wavefront expanding from point A passes point P, which lies on a ray path from point A to point B. By definition, all points on the wavefront have the same propagation time from A. Now let the wavefront be blocked except for a window, centered on P, and small enough to lie within the corridor of paths that reinforce the ray path from A to B. Then all points on the unobstructed portion of the wavefront will have, nearly enough, equal propagation times to B, but not to points in other directions, so that B will be in the direction of peak intensity of the beam admitted through the window. So the ray path marks the beam. And in optical experiments, a beam is routinely considered as a collection of rays or as an approximation to a ray.
Analogies
According to the "strong" form of Fermat's principle, the problem of finding the path of a light ray from point A in a medium of faster propagation, to point B in a medium of slower propagation, is analogous to the problem faced by a lifeguard in deciding where to enter the water in order to reach a drowning swimmer as soon as possible, given that the lifeguard can run faster than he can swim. But that analogy falls short of explaining the behavior of the light, because the lifeguard can think about the problem whereas the light presumably cannot. The discovery that ants are capable of similar calculations does not bridge the gap between the animate and the inanimate.In contrast, the above assumptions to hold for any wavelike disturbance and explain Fermat's principle in purely mechanistic terms, without any imputation of knowledge or purpose.
The principle applies to waves in general, including sound waves in fluids and elastic waves in solids. In a modified form, it even works for matter waves: in quantum mechanics, the classical path of a particle is obtainable by applying Fermat's principle to the associated wave - except that, because the frequency may vary with the path, the stationarity is in the phase shift and not necessarily in the time.
Fermat's principle is most familiar, however, in the case of visible light: it is the link between geometrical optics, which describes certain optical phenomena in terms of rays, and the wave theory of light, which explains the same phenomena on the hypothesis that light consists of waves.
Equivalence to Huygens's construction
In this article we distinguish between Huygens's principle, which states that every point crossed by a traveling wave becomes the source of a secondary wave, and Huygens's construction, which is described below.Let the surface be a wavefront at time, and let the surface be the same wavefront at the later time . Let be a general point on. Then, according to Huygens's construction,
- is the envelope, on the forward side of, of all the secondary wavefronts each of which would expand in time from a point on, and
- if the secondary wavefront expanding from point in time touches the surface at point, then and lie on a ray.
The ray direction given by this construction is the radial direction of the secondary wavefront, and may differ from the normal of the secondary wavefront, and therefore from the normal of the primary wavefront at the point of tangency. Hence the ray velocity, in magnitude and direction, is the radial velocity of an infinitesimal secondary wavefront, and is generally a function of location and direction.
Now let be a point on close to, and let be a point on close to. Then, by the construction,
- the time taken for a secondary wavefront from to reach has at most a second-order dependence on the displacement, and
- the time taken for a secondary wavefront to reach from has at most a second-order dependence on the displacement.
So Huygens's construction implicitly defines a ray path as a path of stationary traversal time between successive positions of a wavefront, the time being reckoned from a point-source on the earlier wavefront. This conclusion remains valid if the secondary wavefronts are reflected or refracted by surfaces of discontinuity in the properties of the medium, provided that the comparison is restricted to the affected paths and the affected portions of the wavefronts.
Fermat's principle, however, is conventionally expressed in point-to-point terms, not wavefront-to-wavefront terms. Accordingly, let us modify the example by supposing that the wavefront which becomes surface at time, and which becomes surface at the later time, is emitted from point at time . Let be a point on , and a point on. And let,,, and be given, so that the problem is to find.
If satisfies Huygens's construction, so that the secondary wavefront from is tangential to at, then is a path of stationary traversal time from to. Adding the fixed time from to, we find that is the path of stationary traversal time from to , in accordance with Fermat's principle. The argument works just as well in the converse direction, provided that has a well-defined tangent plane at. Thus Huygens's construction and Fermat's principle are geometrically equivalent.
Through this equivalence, Fermat's principle sustains Huygens's construction and thence all the conclusions that Huygens was able to draw from that construction. In short, "The laws of geometrical optics may be derived from Fermat's principle". With the exception of the Fermat–Huygens principle itself, these laws are special cases in the sense that they depend on further assumptions about the media. Two of them are mentioned under the next heading.