Bitonic sorter
Bitonic mergesort is a parallel algorithm for sorting. It is also used as a construction method for building a sorting network. The algorithm was devised by Ken Batcher. The resulting sorting networks consist of comparators and have a delay of, where is the number of items to be sorted. This makes it a popular choice for sorting large numbers of elements on an architecture which itself contains a large number of parallel execution units running in lockstep, such as a typical GPU.
A sorted sequence is a monotone sequence---that is, a sequence which is either non-decreasing or non-increasing. A sequence is bitonic when it consists of a non-decreasing sequence followed by a non-increasing sequence, i.e. when there exists an index for which
A bitonic sorter can only sort inputs that are bitonic. Bitonic sorters can be used to build a bitonic sort network that can sort arbitrary sequences by using the bitonic sorter with a sort-by-merge scheme, in which partial solutions are merged using bigger sorters.
The following sections present the algorithm in its original formulation, which requires an input sequence whose length is a perfect powers of two. We will therefore let be the integer for which, meaning that the bitonic sorters may be enumerated in order of increasing size by considering the successive values.
Bitonic Sorter
A bitonic sorter for is simply a comparator. This is illustrated by the given box layout, in which X and Y represent the inputs, while H and L represent the higher and lower outputs, respectively.With the sorter for, we can recursively create a sorter of higher order.
For example, consider the following bitonic sorter.
The bitonic sorter consists of two layers: a recombination layer, which recombines the bitonic inputs
into two new bitonic sequences that are each half as long as the original sequence, and a bitonic sort layer
consisting of two bitonic sorters of order, each of which sorts one of the two bitonic sequences
produced by the previous layer.
This structure may be extended recursively for higher values of by ensuring that each comparator always accepts one input from each of the two halves of the bitonic sequence it is meant to help sort. The following illustration depicts these connections schematically.
As you can see, elements of the first half of the input sequence are pairwise compared against corresponding elements of the last
half of the input sequence. Comparing each element of the subsequence
with the element of the other subsequence at the respective index
produces two bitonic subsequences. These two bitonic series can then be fed into the next lower-order bitonic sorter. This can
be done because all elements in the red sequence are guaranteed to be higher than all elements in the blue series.
Correctness of the bitonic sorter
Ken Batcher provided some mathematical proof sketch in his paper.With out loss of generality the bitonic input sequence is assumed to be with. With out loss of generality the sequence can be reversed therefore, we can assume.
Case 1: If then every element of the two subsequences are smaller. In this case and with and therefore and are trivially bitonic.
Case 2: Otherwise there exists a such that the element of the first sub-sequence is bigger than of the second sub-sequence while it is the opposite for. This means that and are true for a specific.
Therefore, we now know that:
1. For the sequences are and
2. For the sequences are defined as the opposite of 1, with and
In the original paper he then claims the following inequalities result from those definitions:
Following from 1:
- For that
- For that
- For that
- For that
- For that
From the paper claims follows that the sequences and are in fact bitonic.
Bitonic Sorting Networks (Bitonic Merge Sort)
A bitonic sorting network is created by using several bitonic sorters. Thesebitonic sorters are recursively used to create two monotonic sequences, one decreasing and one increasing, which are then put into the next stage. This creates
a bitonic series for the next stage, which can then use this bitonic series as a
monotonic series for the next stage. Consider the following example for an
bitonic sort network.
The bitonic sorting network for can be created by using a bitonic
sorter and two sorters. The two sorters create a decreasingly or
increasingly sorted sequence in order to create a bitonic input for the bitonic sorter.
Bitonic sorting networks of a lower order are mostly used for the two pre-sorters;
therefore, a recursive definition of a bitonic sorting network from bitonic sorters
can be described. In the above example, the two bitonic sorting networks are
networks; hence, they are just a comparator. The following figure shows
the overall scheme.
This overall scheme requires the sorter to have an input of sequence that is a power of two. There are, however, possibilities to mitigate this by, for example, using sentinel values.
Pseudocode
The following pseudocode describes the sorting process. In the code,a is the array to be sorted, low is the index of the first item in the sub-array to be sorted, k and count is the number items in the sub-array that are being sorted in this function call. direction is a boolean value that determines whether the sub-array is being sorted into ascending / descending order.The function call
bitonicSort is used to sort a, where n is the number of items in a.function bitonicMerge is
if count > 1 THEN
k ← count / 2
// Compare and swap elements across the halves
for i ← low to low + k - 1 do
// determine if two elements of an are out of order in relation to the direction of sorting.
if '''OR THEN
swap a with a
// Recursively merge both halves
bitonicMerge
bitonicMerge
// This only works when input size is a power of 2.
function bitonicSort is
if count > 1 THEN
k ← count / 2
// Sort first/second half into ascending/descending order
bitonicSort
bitonicSort
// Merge entire sequence in desired order
bitonicMerge'''
Complexity
In this section we assume that our sorter has input elements as previously.Each recursion in a bitonic sorting network adds a sorter of order, which consists of bitonic sorter and the next recursion. As both sub-sorters can be done in parallel, only one level is added for each level in both sub-sorters. Each bitonic sorter has, therefore, one recombination layer and a lower-order bitonic sorter for its recursion. This results in levels per bitonic sorter. Therefore, we can describe this construction's levels as the following sum:.
This sum can be reduced using the Gauss sum formula
Therefore, the number of levels in which each comparison can be done in parallel is given by. Which gives us assuming comparisons can be performed in parallel.
Although the absolute number of comparisons is typically higher than Batcher's odd-even sort, many of the consecutive operations in a bitonic sort retain a locality of reference, making implementations more cache-friendly and typically more efficient in practice.