Floyd–Warshall algorithm

In computer science, the Floyd–Warshall algorithm is an algorithm for finding shortest paths in a directed weighted graph with positive or negative edge weights. A single execution of the algorithm will find the lengths of shortest paths between all pairs of vertices. Although it does not return details of the paths themselves, it is possible to reconstruct the paths with simple modifications to the algorithm. Versions of the algorithm can also be used for finding the transitive closure of a relation, or widest paths between all pairs of vertices in a weighted graph.

History and naming

The Floyd–Warshall algorithm is an example of dynamic programming, and was published in its currently recognized form by Robert Floyd in 1962. However, it is essentially the same as algorithms previously published by Bernard Roy in 1959 and also by Stephen Warshall in 1962 for finding the transitive closure of a graph, and is closely related to Kleene's algorithm for converting a deterministic finite automaton into a regular expression, with the difference being the use of a min-plus semiring. The modern formulation of the algorithm as three nested for-loops was first described by Peter Ingerman, also in 1962.

Algorithm

The Floyd–Warshall algorithm compares many possible paths through the graph between each pair of vertices. It is guaranteed to find all shortest paths and is able to do this with comparisons in a graph, even though there may be edges in the graph. It does so by incrementally improving an estimate on the shortest path between two vertices, until the estimate is optimal.
Consider a graph with vertices numbered 1 through . Further consider a function that returns the length of the shortest possible path from to using vertices only from the set as intermediate points along the way. Now, given this function, our goal is to find the length of the shortest path from each to each using any vertex in. By definition, this is the value, which we will find recursively.
Observe that must be less than or equal to : we have more flexibility if we are allowed to use the vertex. If is in fact less than, then there must be a path from to using the vertices that is shorter than any such path that does not use the vertex. Since there are no negative cycles this path can be decomposed as:
And of course, these must be a shortest such path, otherwise we could further decrease the length. In other words, we have arrived at the recursive formula:
The base case is given by
where denotes the weight of the edge from to if one exists and ∞ otherwise.
These formulas are the heart of the Floyd–Warshall algorithm. The algorithm works by first computing for all pairs for, then, then, and so on. This process continues until, and we have found the shortest path for all pairs using any intermediate vertices. Pseudocode for this basic version follows.

Pseudocode

let dist be a |V| × |V| array of minimum distances initialized to ∞
for each edge do
dist = w // The weight of the edge
for each vertex v do
dist = 0
for k from 1 to |V|
for i from 1 to |V|
for j from 1 to |V|
if dist > dist + dist
dist = dist + dist
end if
Note: A common mistake in implementing the Floyd–Warshall algorithm is to misorder the triply nested loops. The incorrect IJK and IKJ algorithms do not give correct solutions for some instance. However, we can prove that if these are repeated three times, we obtain the correct solutions.

Example

The algorithm above is executed on the graph on the left below:
Prior to the first recursion of the outer loop, labeled above, the only known paths correspond to the single edges in the graph. At, paths that go through the vertex 1 are found: in particular, the path is found, replacing the path which has fewer edges but is longer. At, paths going through the vertices are found. The red and blue boxes show how the path is assembled from the two known paths and encountered in previous iterations, with 2 in the intersection. The path is not considered, because is the shortest path encountered so far from 2 to 3. At, paths going through the vertices are found. Finally, at, all shortest paths are found.
The distance matrix at each iteration of, with the updated distances in bold, will be:

Behavior with negative cycles

A negative cycle is a cycle whose edges sum to a negative value. There is no shortest path between any pair of vertices, which form part of a negative cycle, because path-lengths from to can be arbitrarily small. For numerically meaningful output, the Floyd–Warshall algorithm assumes that there are no negative cycles. Nevertheless, if there are negative cycles, the Floyd–Warshall algorithm can be used to detect them. The intuition is as follows:

The Floyd–Warshall algorithm iteratively revises path lengths between all pairs of vertices, including where ;
Initially, the length of the path is zero;
A path can only improve upon this if it has length less than zero, i.e. denotes a negative cycle;
Thus, after the algorithm, will be negative if there exists a negative-length path from back to.

Hence, to detect negative cycles using the Floyd–Warshall algorithm, one can inspect the diagonal of the path matrix, and the presence of a negative number indicates that the graph contains at least one negative cycle. However, when a negative cycle is present, during the execution of the algorithm exponentially large numbers on the order of can appear, where is the largest absolute value edge weight in the graph. To avoid integer underflow problems, one should check for a negative cycle within the innermost for loop of the algorithm.

Path reconstruction

The Floyd–Warshall algorithm typically only provides the lengths of the paths between all pairs of vertices. With simple modifications, it is possible to create a method to reconstruct the actual path between any two endpoint vertices. While one may be inclined to store the actual path from each vertex to each other vertex, this is not necessary, and in fact, is very costly in terms of memory. Instead, we can use the shortest-path tree, which can be calculated for each node in time using memory, and allows us to efficiently reconstruct a directed path between any two connected vertices.

Pseudocode

The array holds the penultimate vertex on the path from to :
let dist be a array of minimum distances initialized to
let prev be a array of vertex indices initialized to null
procedure FloydWarshallWithPathReconstruction is
for each edge do
dist = w // The weight of the edge
prev = u
for each vertex v do
dist = 0
prev = v
for k from 1 to |V| do // standard Floyd-Warshall implementation
for i from 1 to |V|
for j from 1 to |V|
if dist > dist + dist then
dist = dist + dist
prev = prev
procedure Path is
if prev = null then
return
path =
while u ≠ v do
v = prev
path.prepend
return path

Time complexity

Let be, the number of vertices. To find all of
from those of
requires [big theta|] operations. Since we begin with
and compute the sequence of matrices,,,, each having a cost of,
the total time complexity of the algorithm is.

Applications and generalizations

The Floyd–Warshall algorithm can be used to solve the following problems:

Transitive closure of directed graphs. In Warshall's original formulation of the algorithm, the graph is unweighted and represented by a Boolean adjacency matrix. Then the addition operation is replaced by logical conjunction and the minimum operation by logical disjunction.
Finding a regular expression denoting the regular language accepted by a finite automaton
Inversion of real matrices
Optimal routing. In this application one is interested in finding the path with the maximum flow between two vertices. This means that, rather than taking minima as in the pseudocode above, one instead takes maxima. The edge weights represent fixed constraints on flow. Path weights represent bottlenecks; so the addition operation above is replaced by the minimum operation.
Fast computation of Pathfinder networks.
Widest paths/Maximum bandwidth paths
Computing canonical form of difference bound matrices
Computing the similarity between graphs
Transitive closure in AND/OR/threshold graphs.

Implementations

Implementations are available for many programming languages.

For C++, in the library
For C#, at
For C#, at
For Java, in the library
For JavaScript, in the Cytoscape library
For Julia, in the package
For MATLAB, in the package
For Perl, in the module
For Python, in the SciPy library or NetworkX library
For R, in packages and
For C, a pthreads,, implementation including a SQLite interface to the data at

Comparison with other shortest path algorithms

For graphs with non-negative edge weights, Dijkstra's algorithm can be used to find all shortest paths from a single vertex with running time. Thus, running Dijkstra starting at each vertex takes time. Since, this yields a worst-case running time of repeated Dijkstra of. While this matches the asymptotic worst-case running time of the Floyd-Warshall algorithm, the constants involved matter quite a lot. When a graph is dense, the Floyd-Warshall algorithm tends to perform better in practice. When the graph is sparse, Dijkstra tends to dominate.
For sparse graphs with negative edges but no negative cycles, Johnson's algorithm can be used, with the same asymptotic running time as the repeated Dijkstra approach.
There are also known algorithms using fast matrix multiplication to speed up all-pairs shortest path computation in dense graphs, but these typically make extra assumptions on the edge weights. In addition, because of the high constant factors in their running time, they would only provide a speedup over the Floyd–Warshall algorithm for very large graphs.