Sort Linked Lists 10X Faster Than MergeSort

← Back to Stories (view on slashdot.org)

Sort Linked Lists 10X Faster Than MergeSort

Posted by kdawson on Sunday February 25, 2007 @07:43AM from the out-of-sorts dept.

virusfree tells us about a new algorithm that has been developed that the author claims can sort a linked list up to 10 times faster than MergeSort. "BitFast," a member of the Hash algorithms family, is available in C and C++ under the GPL.

4 of 326 comments (clear)

Min score:

Reason:

Sort:

Re:radix sort vs. comparison sort by Ignorant+Aardvark · 2007-02-25 08:20 · Score: 3, Interesting

I ditto this. Mergesort is a true comparison sort, which means it can sort anything so long as a comparison between two values is defined, e.g. strings (in lexigraphical order), floating point numbers, 1000-digit numbers ... you get the drift. BitFast is a radix sort and only works in cases where the list elements obey certain limitations: for instance, if each list element can be expressed in 32 bits (an int).

--
Cyde Weys Musings - Scrutinizing the inscrutable
Re:Only lists of integers by antifoidulus · 2007-02-25 08:23 · Score: 3, Interesting

Not only integers, more than likely only POSITIVE integers. If you are using 2's complement that would mean that the first bit in all negative numbers would be a 1. Unless I missed something, this algorithm does not take that into account and would sort them as being larger than all positive numbers.

--
Monstar L
Memory leaks... by fprog26 · 2007-02-25 09:41 · Score: 4, Interesting

Inside bitfast.h: long *Tail = new long[65535]; long *Head = new long[65535]; for(n = 0; n < 65535; n++){ Head[n] = 0; Tail[n] = 0; } Memory leak of 128KB for each time the function "Node* BitFast(Node* HeadNode,long run)" is called. MISSING: delete[] Tail; delete[] Head; Furthermore the following is faster or use memset: long Tail[65535] = {0}; long Head[65535] = {0}; Unless you are running this in DOS16, DOS32 with Pharlap or smaller 8051, 68000 processor, you do not need to use new[] or malloc() for this. In InsertToList, Node *tmpP = (Node*)malloc(sizeof(Node)); free(tmpP) missing inside FreeList()... no such function In main.c: Node *L1 = malloc(sizeof(Node)); Node *L2 = malloc(sizeof(Node)); MISSING: free(L1); free(L2); In main.cpp: Node *L1 = new Node; Node *L2 = new Node; Instead use (no need to use heap, use stack memory): Node L1 = {0}, L2 = {0}; Mixing the usage of std::cout and printf() is not recommanded. Use all of the former or the later, else you will have weird display on some run, unless you flush the output buffer. I suggest you use printf() all the way. cout << "Ok!!!!" << "\n\n" << "Merge Sort : " << tim1 << " ms\n" << "BitFast : " << tim2 << " ms\n\n"; becomes: printf("Ok!!!!\n\nMerge Sort : %ld ms\nBitFast : %ld ms\n\n", tim1, tim2); Furthermore, your implementation of link list, and calling Length(Node*) for every equal is highly inefficient, requires O(n). Store the link list length somewhere when inserting. EqualSplit takes O(1.5n) instead of O(0.5n) because of that. Something like: typdef struct LinkList { Node head; int length; } LinkList; As a result, MergeSort recursively call itself, which calls EqualSplit every turn. You should make your header file C friendly, and avoid mixing // with /* */, malloc and new[] No clue why this header is needed: #include <iso646.h> instead of not/and use use ! and && before: if(not p) return NULL; becomes: if(!p) return NULL; Finally, BitFast is not recursive; therefore, you should try to find an implementation of merge sort which is not recursive also, so you save on function calls, if possible. "However, iterative, non-recursive, implementations of merge sort, avoiding method call overhead, are not difficult to code." -- Wikipedia http://en.wikipedia.org/wiki/Merge_so rt Your function should be in a file.c or be inline.
Multiple bugs in the code, wrong measurements ! by wtarreau · 2007-02-25 10:34 · Score: 5, Interesting

Don't use this code !

First, there are multiple bugs in the code itself :

1) everywhere, it is assumed that 2^16 = 65535 and not 65536. Arrays are allocated for this size too, but are used with 65536 entries. (un)fortunately for him, the space is probably allocated anyway so the program does not crash on this.

2) a similar bug in the random number generator carefully avoids dangerous values, because he generates numbers between min and max by doing a modulo on max-min , which prevents max from being produced. This is not a
big deal anyway, just that it hides the bug.

3) he assumes everywhere that "long" is unsigned and "short int" too. This makes the code SegFault on Linux with negative array indexes. The fix is obvious, though.

And most of all : times are measured in optimal conditions. Indeed, this algorithm is
just a radix sort with a large radix (2^16). A radix algorithm is efficient when the number
of elements is high compared to the radix, because at every level, all radix entries have to
be scanned for existing values. In his example, he sorts 4 million elements. His code is faster
than the merge sort on 4 million entries, just as would be any radix sort. But on smaller sets,
it still takes the same time per level, because it has to scan all 65536 entries even if there
are only 100 numbers. The fact is that for less than 10000 values, the mergesort becomes faster in his own code.

When measuring algorithms complexity, we use the O(x) notation. The time per iteration is never
considered. When the time matters (as in his example), we should consider the time per iteration (T), and the total time (t) will follow :

t = T . complexity

In complexity, his algorithm is O(N) on 16 bits, but the time per iteration is very high. Other sorting algorithms are O(N.log(N)). So the total time is :

t1 = T1 . O(N) for his algorithm, and :
t2 = T2 . O(N.log(N)) for the mergesort.

But when T1 > T2.log(N), he looses, which is almost always because log(N) is small for 16 bits, and T2 too because operations are very simple. In his case, T1 is very high because it represents the time required to scan 65536 values.

It's amazing that he fell in such a beginner's trap. I suppose that he's still a student, which would explain the total lack of portability and newbie's bugs in the code.

Well tried anyway :-)

Willy