What's the fastest algorithm for sorting a linked list?

Question

I'm curious if O(n log n) is the best a linked list can do.

Just so that you know, O(nlogn) is the bound for comparison based sorts. There are non-comparison based sorts than can give O(n) performance (e.g. counting sort), but they require additional constraints on the data. — MAK
– MAK, Commented Oct 6, 2009 at 15:50
Note that O(n log(n)) is equivalent to O(c n log(n)), where c is a constant. So one O(n log(n)) algorithm could be three times as fast as another O(n log(n)) algorithm. The question is asking for the fastest algorithm. — tstdmy
– tstdmy, Commented Jan 30 at 14:07

csl · Accepted Answer · 2016-02-03 08:52:49Z

115

It is reasonable to expect that you cannot do any better than O(N log N) in running time.

However, the interesting part is to investigate whether you can sort it in-place, stably, its worst-case behavior and so on.

Simon Tatham, of Putty fame, explains how to sort a linked list with merge sort. He concludes with the following comments:

Like any self-respecting sort algorithm, this has running time O(N log N). Because this is Mergesort, the worst-case running time is still O(N log N); there are no pathological cases.

Auxiliary storage requirement is small and constant (i.e. a few variables within the sorting routine). Thanks to the inherently different behaviour of linked lists from arrays, this Mergesort implementation avoids the O(N) auxiliary storage cost normally associated with the algorithm.

There is also an example implementation in C that work for both singly and doubly linked lists.

As @Jørgen Fogh mentions below, big-O notation may hide some constant factors that can cause one algorithm to perform better because of memory locality, because of a low number of items, etc.

edited Feb 3, 2016 at 8:52

answered Oct 6, 2009 at 12:05

csl

11.4k5 gold badges64 silver badges92 bronze badges

Sign up to request clarification or add additional context in comments.

4 Comments

L.E. Over a year ago

This is not for single linked list. His C code is using *prev and *next.

csl Over a year ago

@L.E. It's actually for both. If you see the signature for listsort, you'll see you can switch by using the parameter int is_double.

jfs Over a year ago

@L.E.: here's a Python version of the listsort C code that supports only singly-linked lists

Adam Over a year ago

O(kn) is theoretically linear, and can be achieved with bucket sort. Assuming a reasonable k (number of bits /size of object you are sorting), it might be a bit faster

Ian Boyd · Accepted Answer · 2022-11-13 21:52:37Z

85

Depending on a number of factors, it may actually be faster to copy the list to an array and then use a Quicksort.

The reason this might be faster is that an array has much better cache performance than a linked list. If the nodes in the list are dispersed in memory, you may be generating cache misses all over the place. Then again, if the array is large you will get cache misses anyway.

Mergesort parallelises better, so it may be a better choice if that is what you want. It is also much faster if you perform it directly on the linked list.

Since both algorithms run in O(n * log n), making an informed decision would involve profiling them both on the machine you would like to run them on.

Update

I decided to test my hypothesis and wrote a C-program which measured the time (using clock()) taken to sort a linked list of ints. I tried with a linked list where each node was allocated with malloc() and a linked list where the nodes were laid out linearly in an array, so the cache performance would be better. I compared these with the built-in qsort, which included copying everything from a fragmented list to an array and copying the result back again. Each algorithm was run on the same 10 data sets and the results were averaged.

These are the results:

N	merge sort (fragmented)	Array w/qsort	merge sort (packed)
1,000	<1 ms	<1 ms	<1 ms
100,000	39 ms	25 ms	9 ms
1,000,000	1,162 ms	420 ms	112 ms
100,000,000	364,797 ms	61,166 ms	16,525 ms

Conclusion

At least on my machine, copying into an array is well worth it to improve the cache performance, since you rarely have a completely packed linked list in real life. It should be noted that my machine has a 2.8GHz Phenom II, but only 0.6GHz RAM, so the cache is very important.

edited Nov 13, 2022 at 21:52

Ian Boyd

259k272 gold badges920 silver badges1.3k bronze badges

answered Oct 6, 2009 at 12:57

Jørgen Fogh

7,6872 gold badges38 silver badges48 bronze badges

10 Comments

csl Over a year ago

Good comments, but you should consider the non-constant cost of copying the data from a list to an array (you'd have to traverse the list), as well as the worst case running time for quicksort.

Dean J Over a year ago

O(n * log n) is theoretically the same as O(n * log n + n), which would be including the cost of the copy. For any sufficiently large n, the cost of the copy really shouldn't matter; traversing a list once to the end should be n time.

csl Over a year ago

@DeanJ: Theoretically, yes, but remember that the original poster is putting forth the case where micro-optimizations matter. And in that case, the time spent turning a linked-list into an array has to be considered. The comments are insightful, but I'm not completely convinced it would provide performance gain in reality. It might work for a very small N, perhaps.

Steve Jessop Over a year ago

@csl: Actually, I'd expect the benefits of locality to kick in for large N. Assuming that cache misses are the dominant performance effect, then the copy-qsort-copy approach results in about 2*N cache misses for the copying, plus the number of misses for the qsort, which will be a small fraction of Nlog(N) (since most accesses in qsort are to an element close to a recently-accessed element). The number of misses for the merge sort is a larger fraction of Nlog(N), since a higher proportion of comparisons cause a cache miss. So for large N, this term dominates and slows down mergesort.

Jørgen Fogh Over a year ago

@Steve: You are right that qsort is not a drop-in replacement, but my point is not really about qsort vs. mergesort. I just didn't feel like writing another version of the mergesort when qsort was readily available. The standard library is way more convenient than rolling your own.

|

Neal Richter · Accepted Answer · 2010-12-29 17:56:36Z

10

This is a nice little paper on this topic. His empirical conclusion is that Treesort is best, followed by Quicksort and Mergesort. Sediment sort, bubble sort, selection sort perform very badly.

A COMPARATIVE STUDY OF LINKED LIST SORTING ALGORITHMS by Ching-Kuang Shene

http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.31.9981

answered Dec 29, 2010 at 17:56

Neal Richter

1011 silver badge2 bronze badges

Comments

Artelius · Accepted Answer · 2009-10-06 12:01:34Z

8

Comparison sorts (i.e. ones based on comparing elements) cannot possibly be faster than n log n. It doesn't matter what the underlying data structure is. See Wikipedia.

Other kinds of sort that take advantage of there being lots of identical elements in the list (such as the counting sort), or some expected distribution of elements in the list, are faster, though I can't think of any that work particularly well on a linked list.

answered Oct 6, 2009 at 12:01

Artelius

49.3k13 gold badges94 silver badges106 bronze badges

Comments

DivineWolfwood · Accepted Answer · 2009-10-06 18:12:10Z

As stated many times, the lower bound on comparison based sorting for general data is going to be O(n log n). To briefly resummarize these arguments, there are n! different ways a list can be sorted. Any sort of comparison tree that has n! (which is in O(n^n)) possible final sorts is going to need at least log(n!) as its height: this gives you a O(log(n^n)) lower bound, which is O(n log n).

So, for general data on a linked list, the best possible sort that will work on any data that can compare two objects is going to be O(n log n). However, if you have a more limited domain of things to work in, you can improve the time it takes (at least proportional to n). For instance, if you are working with integers no larger than some value, you could use Counting Sort or Radix Sort, as these use the specific objects you're sorting to reduce the complexity with proportion to n. Be careful, though, these add some other things to the complexity that you may not consider (for instance, Counting Sort and Radix sort both add in factors that are based on the size of the numbers you're sorting, O(n+k) where k is the size of largest number for Counting Sort, for instance).

Also, if you happen to have objects that have a perfect hash (or at least a hash that maps all values differently), you could try using a counting or radix sort on their hash functions.

Mitch Wheat · Accepted Answer · 2009-10-06 11:53:31Z

3

Not a direct answer to your question, but if you use a Skip List, it is already sorted and has O(log N) search time.

answered Oct 6, 2009 at 11:53

Mitch Wheat

302k44 gold badges482 silver badges552 bronze badges

1 Comment

bdonlan Over a year ago

expected O(lg N) search time - but not guaranteed, as skip lists rely on randomness. If you are receiving untrusted input, be sure the supplier of the input cannot predict your RNG, or they could send you data that triggers its worst case performance

Mark Ransom · Accepted Answer · 2009-10-06 18:25:36Z

3

A Radix sort is particularly suited to a linked list, since it's easy to make a table of head pointers corresponding to each possible value of a digit.

answered Oct 6, 2009 at 18:25

Mark Ransom

310k44 gold badges423 silver badges660 bronze badges

1 Comment

LoveToCode Over a year ago

Can you please explain more on this topic or give any resource link for radix sort in linked list.

Pete Kirkham · Accepted Answer · 2009-10-06 13:49:16Z

2

Merge sort doesn't require O(1) access and is O ( n ln n ). No known algorithms for sorting general data are better than O ( n ln n ).

The special data algorithms such as radix sort ( limits size of data ) or histogram sort ( counts discrete data ) could sort a linked list with a lower growth function, as long as you use a different structure with O(1) access as temporary storage.

Another class of special data is a comparison sort of an almost sorted list with k elements out of order. This can be sorted in O ( kn ) operations.

Copying the list to an array and back would be O(N), so any sorting algorithm can be used if space is not an issue.

For example, given a linked list containing uint_8, this code will sort it in O(N) time using a histogram sort:

#include <stdio.h>
#include <stdint.h>
#include <malloc.h>

typedef struct _list list_t;
struct _list {
    uint8_t value;
    list_t  *next;
};


list_t* sort_list ( list_t* list )
{
    list_t* heads[257] = {0};
    list_t* tails[257] = {0};

    // O(N) loop
    for ( list_t* it = list; it != 0; it = it -> next ) {
        list_t* next = it -> next;

        if ( heads[ it -> value ] == 0 ) {
            heads[ it -> value ] = it;
        } else {
            tails[ it -> value ] -> next = it;
        }

        tails[ it -> value ] = it;
    }

    list_t* result = 0;

    // constant time loop
    for ( size_t i = 255; i-- > 0; ) {
        if ( tails[i] ) {
            tails[i] -> next = result;
            result = heads[i];
        }
    }

    return result;
}

list_t* make_list ( char* string )
{
    list_t head;

    for ( list_t* it = &head; *string; it = it -> next, ++string ) {
        it -> next = malloc ( sizeof ( list_t ) );
        it -> next -> value = ( uint8_t ) * string;
        it -> next -> next = 0;
    }

    return head.next;
}

void free_list ( list_t* list )
{
    for ( list_t* it = list; it != 0; ) {
        list_t* next = it -> next;
        free ( it );
        it = next;
    }
}

void print_list ( list_t* list )
{
    printf ( "[ " );

    if ( list ) {
        printf ( "%c", list -> value );

        for ( list_t* it = list -> next; it != 0; it = it -> next )
            printf ( ", %c", it -> value );
    }

    printf ( " ]\n" );
}


int main ( int nargs, char** args )
{
    list_t* list = make_list ( nargs > 1 ? args[1] : "wibble" );


    print_list ( list );

    list_t* sorted = sort_list ( list );


    print_list ( sorted );

    free_list ( list );
}

edited Oct 6, 2009 at 13:49

answered Oct 6, 2009 at 11:56

Pete Kirkham

49.5k5 gold badges96 silver badges176 bronze badges

6 Comments

Artelius Over a year ago

It's been proven that no comparison-based sort algorthms exist that are faster than n log n.

Pete Kirkham Over a year ago

No, it's been proven that no comparison-based sort algorithms on general data are faster than n log n

bdonlan Over a year ago

No, any sort algorithm faster than O(n lg n) would not be comparison-based (eg, radix sort). By definition, comparison sort applies to any domain that has a total order (ie, can be compared).

Pete Kirkham Over a year ago

@bdonlan the point of "general data" is that there are algorithms which are faster for constrained input, rather than random input. At the limiting case, you can write a trivial O(1) algorithm which sorts a list given the input data is constrained to be already sorted

Steve Jessop Over a year ago

And that would not be a comparison-based sort. The modifier "on general data" is redundant, since comparison sorts already handle general data (and the big-O notation is for the number of comparisons made).

|

Stan Switzer · Accepted Answer · 2015-12-10 03:02:48Z

Here's an implementation that traverses the list just once, collecting runs, then schedules the merges in the same way that mergesort does.

Complexity is O(n log m) where n is the number of items and m is the number of runs. Best case is O(n) (if the data is already sorted) and worst case is O(n log n) as expected.

It requires O(log m) temporary memory; the sort is done in-place on the lists.

(updated below. commenter one makes a good point that I should describe it here)

The gist of the algorithm is:

    while list not empty
        accumulate a run from the start of the list
        merge the run with a stack of merges that simulate mergesort's recursion
    merge all remaining items on the stack

Accumulating runs doesn't require much explanation, but it's good to take the opportunity to accumulate both ascending runs and descending runs (reversed). Here it prepends items smaller than the head of the run and appends items greater than or equal to the end of the run. (Note that prepending should use strict less-than to preserve sort stability.)

It's easiest to just paste the merging code here:

    int i = 0;
    for ( ; i < stack.size(); ++i) {
        if (!stack[i])
            break;
        run = merge(run, stack[i], comp);
        stack[i] = nullptr;
    }
    if (i < stack.size()) {
        stack[i] = run;
    } else {
        stack.push_back(run);
    }

Consider sorting the list (d a g i b e c f j h) (ignoring runs). The stack states proceed as follows:

    [ ]
    [ (d) ]
    [ () (a d) ]
    [ (g), (a d) ]
    [ () () (a d g i) ]
    [ (b) () (a d g i) ]
    [ () (b e) (a d g i) ]
    [ (c) (b e) (a d g i ) ]
    [ () () () (a b c d e f g i) ]
    [ (j) () () (a b c d e f g i) ]
    [ () (h j) () (a b c d e f g i) ]

Then, finally, merge all these lists.

Note that the number of items (runs) at stack[i] is either zero or 2^i and the stack size is bounded by 1+log2(nruns). Each element is merged once per stack level, hence O(n log m) comparisons. There's a passing similarity to Timsort here, though Timsort maintains its stack using something like a Fibonacci sequence where this uses powers of two.

Accumulating runs takes advantage of any already sorted data so that best case complexity is O(n) for an already sorted list (one run). Since we're accumulating both ascending and descending runs, runs will always be at least length 2. (This reduces the maximum stack depth by at least one, paying for the cost of finding the runs in the first place.) Worst case complexity is O(n log n), as expected, for data that is highly randomized.

(Um... Second update.)

Or just see wikipedia on bottom-up mergesort.

Having run creation perform well with "reversed input" is a nice touch. O(log m) additional memory should not be needed - just add runs to two list alternately until one is empty.

rcgldr · Accepted Answer · 2025-01-30 08:54:25Z

In most cases, the fastest way to sort is to copy a list to an array, or create an array of pointers to nodes, sort the array, and create a new sorted list from the sorted array. The array sort should be stable, merge sort will be O(n log(n)), or if radix sort can be used, O(n).

For a list base sort, the accepted answer has significant overhead. Each merge pass scans half of the list to repeatedly set the node pointer q. The code has to check for merge size as well as for NULL during the inner merge loop. I simplified the code from the accepted answer to do sort a non-circular single linked list, with the compare inline rather than being a function call.

I compared the Simon Tatham sort to a bottom up merge sort in 64 bit mode. Each node has a next pointer and a 64 bit unsigned integer for data. The test allocates an array of 2^24 = 16777216 nodes. The nodes are filled with pseudo random data, and the list is sorted to test sorting with sequential nodes. This results in a list with randomly scattered nodes. The nodes are again filled with pseudo random data to test sort again with initially scattered nodes. I ran this test on an old desktop with an Intel 3770K cpu.

                sequential     scattered
Simon Tatham    10.3 seconds   32.7 seconds
Bottom up        5.8 seconds   11.3 seconds

Bottom up merge sort uses a small (32) array of pointers to nodes to track run boundaries for lists within a linked list. The array_of_pointers[i] = NULL or a pointer to the first node of a sorted run with 2^i nodes. Nodes are merged into the array one node at a time, then the array is merged to create a sorted list.

typedef struct NODE_{                       /* base node struct */
    struct NODE_ * next;
    uint64_t       data;
}NODE;

NODE * Merge(NODE *pNode0, NODE *pNode1)
{
NODE *pMrg;
NODE **ppMrg = &pMrg;
    if(pNode0 == NULL)                  /* if either list empty, */
        return pNode1;                  /*  return the other */
    if(pNode1 == NULL)
        return pNode0;
    while(1){                           /* merge the lists */
        if(pNode0->data <= pNode1->data){
            *ppMrg = pNode0;
            ppMrg = &(pNode0->next);
            pNode0 = *ppMrg;
            if(pNode0 == NULL){
                *ppMrg = pNode1;
                break;}}
        else{
            *ppMrg = pNode1;
            ppMrg = &(pNode1->next);
            pNode1 = *ppMrg;
            if(pNode1 == NULL){
                *ppMrg = pNode0;
                break;}}}
    return pMrg;
}

/* size of internal array */
#define ASZ 32
NODE *MergeSort(NODE * pNode)
{
NODE *apNode[ASZ] = {0};                /* array of lists */
NODE *pNext;
size_t i;
    if(pNode == NULL || pNode->next == NULL)
        return pNode;
    /* merge nodes into array */
    while(pNode){
        pNext = pNode->next;
        pNode->next = NULL;
        for(i = 0; i < ASZ && apNode[i] != NULL; i++){
            pNode = Merge(apNode[i], pNode);
            apNode[i] = NULL;}
        if(i == ASZ)                    /* don't go past end array */
            i--;
        apNode[i] = pNode;
        pNode = pNext;}
    /* merge array into a single list */
    pNode = NULL;
    for(i = 0; i < ASZ; i++)
        pNode = Merge(apNode[i], pNode);
    return pNode;
}

laura · Accepted Answer · 2009-10-06 12:00:17Z

1

As I know, the best sorting algorithm is O(n*log n), whatever the container - it's been proved that sorting in the broad sense of the word (mergesort/quicksort etc style) can't go lower. Using a linked list will not give you a better run time.

The only one algorithm which runs in O(n) is a "hack" algorithm which relies on counting values rather than actually sorting.

edited Oct 6, 2009 at 12:00

answered Oct 6, 2009 at 11:54

laura

7,3325 gold badges38 silver badges44 bronze badges

2 Comments

DivineWolfwood Over a year ago

It's not a hack algorithm, and it doesn't run in O(n). It runs in O(cn), where c is the largest value you're sorting (well, really it's the difference between the highest and lowest values) and only works on integral values. There's a difference between O(n) and O(cn), as unless you can give a definitive upper bound for the values you're sorting (and thus bound it by a constant), you have two factors complicating the complexity.

bdonlan Over a year ago

Strictly speaking, it runs in O(n lg c). If all of your elements are unique, then c >= n, and therefore it takes longer than O(n lg n).

hce · Accepted Answer · 2019-02-14 22:49:31Z

1

You can copy it into an array and then sort it.

Copying into array O(n),
sorting O(nlgn) (if you use a fast algorithm like merge sort ),
copying back to linked list O(n) if necessary,

so it is gonna be O(nlgn).

note that if you do not know the number of elements in the linked list you won't know the size of array. If you are coding in java you can use an Arraylist for example.

edited Feb 14, 2019 at 22:49

hce

1,18711 silver badges26 bronze badges

answered Feb 14, 2019 at 21:38

Shirin

111 bronze badge

1 Comment

greybeard Over a year ago

What does this add over Jørgen Fogh's answer?

Abhijit Sarkar · Accepted Answer · 2020-08-21 10:55:11Z

The question is LeetCode #148, and there are plenty of solutions offered in all major languages. Mine is as follows, but I'm wondering about the time complexity. In order to find the middle element, we traverse the complete list each time. First time n elements are iterated over, second time 2 * n/2 elements are iterated over, so on and so forth. It seems to be O(n^2) time.

def sort(linked_list: LinkedList[int]) -> LinkedList[int]:
    # Return n // 2 element
    def middle(head: LinkedList[int]) -> LinkedList[int]:
        if not head or not head.next:
            return head
        slow = head
        fast = head.next

        while fast and fast.next:
            slow = slow.next
            fast = fast.next.next

        return slow

    def merge(head1: LinkedList[int], head2: LinkedList[int]) -> LinkedList[int]:
        p1 = head1
        p2 = head2
        prev = head = None

        while p1 and p2:
            smaller = p1 if p1.val < p2.val else p2
            if not head:
                head = smaller
            if prev:
                prev.next = smaller
            prev = smaller

            if smaller == p1:
                p1 = p1.next
            else:
                p2 = p2.next

        if prev:
            prev.next = p1 or p2
        else:
            head = p1 or p2

        return head

    def merge_sort(head: LinkedList[int]) -> LinkedList[int]:
        if head and head.next:
            mid = middle(head)
            mid_next = mid.next
            # Makes it easier to stop
            mid.next = None

            return merge(merge_sort(head), merge_sort(mid_next))
        else:
            return head

    return merge_sort(linked_list)

Time complexity will remain as O(n log(n)), since linear or constant factors are ignored. The repeated scanning of lists will linearly slow down the sort, and it will be slower than a bottom up merge sort for linked list .

Georgie · Accepted Answer · 2025-01-15 20:32:27Z

0

I would suggest trying Shell Sort. Although its computational complexity (with a good gap sequence) is unknown, it has an upper bound of O(n^(4/3)) but this is misleading. In practice Shell Sort on an array is just slightly slower than Quicksort and it beats most O(nlog(n)) sorts. In my tests this is true for N up to at least 20 billion, I could not test beyond that. The code is simple, parallelizable, and ~~suitable for implementation on a linked list~~ (edit: On reflection most likely not true). I have not tried it on linked lists, only arrays.

edited Jan 15 at 20:32

answered Nov 30, 2024 at 6:24

Georgie

2,4962 gold badges18 silver badges13 bronze badges

2 Comments

greybeard Over a year ago

Please sketch how to implement an efficient Shell Sort without constant time indexed access to the values.

Georgie Jan 15 at 20:28

@greybeard you're right, Shell Sort often increments indexes by 1, but also often needs to offset indexes by the gap size. The gap size decreases logarithmically but this would still very likely make the sort unsuitable for a linked list.

ypnos · Accepted Answer · 2009-10-07 02:34:49Z

-1

Mergesort is the best you can do here.

edited Oct 7, 2009 at 2:34

answered Oct 6, 2009 at 11:53

ypnos

53k14 gold badges104 silver badges151 bronze badges

2 Comments

Dominic Rodger Over a year ago

See Simon Tatham's chiark.greenend.org.uk/~sgtatham/algorithms/listsort.html

csl Over a year ago

It would be a better answer if you would clarify why.

Collectives™ on Stack Overflow

What's the fastest algorithm for sorting a linked list?

15 Answers 15

4 Comments

Update

Conclusion

10 Comments

Comments

Comments

Comments

1 Comment

1 Comment

6 Comments

1 Comment

Comments

2 Comments

1 Comment

1 Comment

2 Comments

2 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

15 Answers 15

4 Comments

Update

Conclusion

10 Comments

Comments

Comments

Comments

1 Comment

1 Comment

6 Comments

1 Comment

Comments

2 Comments

1 Comment

1 Comment

2 Comments

2 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related