Is this the simplest (and most surprising) sorting algorithm?

beyondzero · on Oct 5, 2021

This is silly but sort of relevant.

During college junior year (1993) as a physics major I took a class in digital electronics (which included 68000 assembler programming). We had a lab contest to create the fastest sorting routine for a set of random unique numbers between x and y.

I won the contest by setting to 0 a y-x register range of memory and then inserting each number into the range based on the number itself ("27" into register 27, "18" into register 18, etc.). Then I printed out all non-zero numbers in the range.

The other 20 or so students did versions of bubble-sorting and called my solution a cheat. The professor defended my victory as I had not broken any of the rules of the contest...

ttul · on Oct 5, 2021

Haha. Reminds me of a lab where we had to program a 6502 to play some sort of game, rendered on an oscilloscope. My genius friend stayed up all night writing code that rendered the snake by dynamically writing code, rather than the obvious approach of reading the “snake” coordinates and rendering those to the screen.

Our snake code ran at something obscene like 1,000 Hz. So much faster than anyone else’s. The professor couldn’t understand how we did it, so gave a low mark.

Ageodene · on Oct 6, 2021

>The professor couldn’t understand how we did it, so gave a low mark.

That was genuinely upsetting to read.

flemhans · on Oct 6, 2021

In High School 2004 I made a license plate detection system for our IT class project. It was not perfect, granted, but was able to locate license plates in a photograph pretty reliably and do OCR on them.

Teacher was like, nah, that's not possible, and clearly hadn't even read it. Gave me an A just to stop me complaining, still not understanding the project.

selcuka · on Oct 6, 2021

My Fortran professor gave me a low mark because I used a lookup table for octal to binary conversion, instead of using division and modulo.

dann0 · on Oct 5, 2021

In a teaching lab in 1995, I had to make a digital thermometer using a mcu. I wrote code that dynamically read the voltage drop on the sensor and calculated the temperature. It was fast enough and easy to calibrate. The code was small and easy to understand.

I didn’t pass because the point of the assignment was to use lookup tables, even though that was a more complex approach in this case :(

gerdesj · on Oct 6, 2021

"It was fast enough and easy to calibrate."

Are you sure about easy to calibrate 8) Did your solution have the same level of accuracy across the range as the MCU would provide?

Bizarrely, you sometimes have to follow the standards because that is what everyone does. I can't think of an example now but there will and have been cases in engineering where the wrong answer is the right one because that is what is done. I know of mensuration devices that "model" some curves with a linear approximation and paper over the errors with cough error estimates and/or keeping within the nearly linear range. That too is fine if everyone understands what is going on.

Accuracy is a funny old thing.

However, I like your approach and it shows you probably understand the principles involved. Ideally, if you are going to be a smart arse like that, then submit two solutions - the proscribed one and your clever one.

You mention a MCU but not what sort of sensor ...

snarkypixel · on Oct 6, 2021

Curious as to what you mean by "dynamically writing code"

NotSammyHagar · on Oct 6, 2021

Like meta programming, creating a string to define a query and then executing the string. In the case above, you dynamically create the program you need to solve the problem and then execute it. This is different than "solving the problem". You are writing a program to "write the program to solve the problem".

moffkalast · on Oct 6, 2021

Doesn't sound like this kind of complicated string eval would be able to do 1000Hz but then again I also have no clue where you could possibly need that for a snake game. On an oscilloscope you'd need to produce vector lines to move from/to right?

OvermindDL1 · on Oct 6, 2021

More likely, I would wager that they were just overwriting the instructions to draw with new instructions that just inline everything so it didn't need to fetch out to anything.

ttul · on Oct 7, 2021

Yes. This.

catwell · on Oct 5, 2021

This is a slightly simpler (because of unicity) version of bucket sort or counting sort.

sli · on Oct 5, 2021

I remember seeing a (gag) variation of counting sort that just provided each element as an argument to `sleep`.

shitlord · on Oct 6, 2021

Yup, sleep sort: https://rosettacode.org/wiki/Sorting_algorithms/Sleep_sort

tylerhou · on Oct 6, 2021

That’s more like heapsort, because the OS will use a heap to implement a priority queue that contains times for when the next process needs to be scheduled.

OldHand2018 · on Oct 5, 2021

Brilliant!

Although if it took a long time to enumerate the list of elements…

ballenf · on Oct 6, 2021

Pass them in as milliseconds or nanoseconds?

tshaddox · on Oct 5, 2021

The "set of random unique numbers between x and y" part stands out so strongly that I would be shocked if the whole point wasn't to encourage a non-comparative sort.

MrYellowP · on Oct 5, 2021

> but sort of relevant

Hahahahaha :D

> I won the contest by setting to 0 a y-x register range of memory and then inserting each number into the range based on the number itself ("27" into register 27, "18" into register 18, etc.). Then I printed out all non-zero numbers in the range.

That's really smart. The others were just angry because they couldn't think outside of the box. Nice work!

kragen · on Oct 5, 2021

That's not silly, that's the right way to solve that problem, unless y - x is large enough that you'd benefit from using one bit per number instead of one word. Well, usually calling qsort() would be fast enough, and less code.

bjoli · on Oct 5, 2021

That brings up memories! I did the same thing. We had to sort a million integers in the range 0-1000 (exclusive range). I did a counting sort, and beat every other solution handily.

In my case, though, the teacher disqualified my solution since the resulting lists didn't contain the same fixnums as the one i sorted, which I argued was stupid.

In the end I implemented a merge sort with some tricks and won anyway.

waterhouse · on Oct 5, 2021

The same fixnums!? Fixnums are distinct from bignums in that they're small enough to fit in a machine word and therefore don't need to be objects allocated in the heap... Was this an odd implementation wherein fixnums were put into the heap anyway?

bjoli · on Oct 6, 2021

Nope. This was in PLT scheme. Fixnums were eq? to other fixnums, meaning they satisfied the most basic equality.

Arguing was futile. When we started benchmarking very large lists, the counting sort was the only one that did it in reasonable time. Despite being disqualified.

TacticalCoder · on Oct 5, 2021

It's not silly. That's basically the "counting sort" and it definitely has uses.

thomasahle · on Oct 5, 2021

The observation that "random numbers can be sorted in linear time" is actually often useful in algorithm design. A typical application is storing a sorted list of hash values.

nijave · on Oct 7, 2021

Basically pigeon hole or bucket sorting https://en.wikipedia.org/wiki/Pigeonhole_sort which, as another commenter mentioned, are types of non-comparison sorting so you can get near linear times

I think the mainstream sorting algorithms tend to cater towards "mainstream" ordering which tends to be somewhat sorted versus purely random. An example is https://en.wikipedia.org/wiki/Timsort

cassepipe · on Oct 5, 2021

I have also heard about it as radix sort : http://www.codercorner.com/RadixSortRevisited.htm

xmprt · on Oct 6, 2021

This is not the same as radix sort but you're right that radix is a non-comparative sorting algorithm just like the one the parent commenter described.

ezoe · on Oct 6, 2021

So, you implemented bucket sort?

pocketgrok · on Oct 5, 2021

Isn’t this usually taught right before sort algorithms? It’s a simple lead-in to the topic and is great for reasoning about memory overhead.

LorenPechtel · on Oct 6, 2021

Sounds like an index sort. Your number space must have been fairly dense for this to work.

residualmind · on Oct 6, 2021

I like this! Could this be a digital implementation of the analog Spaghetti Sort? [https://en.wikipedia.org/wiki/Spaghetti_sort]

OJFord · on Oct 6, 2021

That's just the right solution if you know x and y (or even just x?) ahead of time isn't it?

And don't you mean "27" into register 27-x, do. 18 etc.?

albertsondev · on Oct 6, 2021

Or even neither!

Much faster to find upper and lower bounds and essentially construct a hash table than to manually sort a whole list of non-trivial length.

cjohansson · on Oct 6, 2021

Is it really? Building the hash can be made in linear time but iterating the values in order, especially when there might be gaps in the interval? Could be messy

daniel-cussen · on Oct 7, 2021

The overhead of hashing in a hash table makes it worse than sorting algorithms without hashing.

dd444fgdfg · on Oct 5, 2021

ha, I did exactly the same thing!

dilap · on Oct 5, 2021

High level description of why it sorts:

The inner loop takes an array and an index i in the array, and rearranges the array to put the largest value at index i. This is done in such a way that if the sub-array up to but not including i is sorted, then afterwords the sort will be extended such that the sub-array up to and including i will be sorted.

The outer loop says, just call in to the inner loop with i from 1 to n. This will result in a sorted array, because the initial array of a single element is trivially sorted, and then we'll have by the sort-extension property of the inner loop that the progressively larger sub-arrays of 2, 3, ... up to N elements become sorted.

You can see that the inner loop has the sort extension property by thinking about two cases.

Case 1: If the i-th element is already larger than all previous elements, it won't touch any of the elements before i, and will only potentially make the i-th element bigger, thus clearly extending the sort though the i-th element.

Case 2: If the i-th element is smaller than one of the previous elements, it won't touch any elements until reaching that smaller element; then it will start a series of swaps until it reaches index i, at which point the the original value of the i-th elment is left inserted in sorted position into the sub-array up to and including i. So at this moment we have the sort extension property. From there, it may increase the value of the i-th element and change later elements, but of both those will preserve the sub-array sort.

dvogel · on Oct 5, 2021

It is being likened to bubblesort in various places but your description (which I quite like) makes it seem like quicksort with an intentionally bad (worst possible?) bifurcation.

dilap · on Oct 5, 2021

Well, it's not really dividing, sorting, and merging, so I wouldn't say it's too similar to quicksort.

It's basically insertion sort, but with the curious property that the last value of the sorted subarray immediately and always has the correct value, and we then "gently" insert new values into the sorted subarray.

smusamashah linked to a nice sort visualization.

https://xosh.org/VisualizingSorts/sorting.html#IYZwngdgxgBAZ...

Tap the "Insertion Sort" checkbox on the right, and then the "Start" button below that.

You can see how both sorts are slowly constructing a sorted subarray, but this sort has the interesting property that the tail of the sorted subarray immediately obtains its maximum brightness, and new values are gently incorporated into the array, while the insertion sort tail is just the brightest value encountered so far, and the latest value is "abrutly" sifted down into its appropriate place.

It feels like this sort might minimize some kind of "sort entropy" metric? I.e., visually it looks very smooth. Each step along the way to sorted is very small.

6510 · on Oct 5, 2021

> https://xosh.org/VisualizingSorts/sorting.html#IYZwngdgxgBAZ...

That is the most beautiful thing in CS I've ever seen.

breck · on Oct 5, 2021

This visualization tool is amazing.

Is rain just the atmosphere running bubble sort ;) ?

svat · on Oct 5, 2021

[Edit: I took a quick glance at the paper and got carried away; the rest of this comment is about a different algorithm apparently known as "exchange sort", and an interesting property of it. The algorithm in the paper is actually different, and indeed more surprising.]

Hey! That's my sorting algorithm! (Well, I'm sure it has been independently discovered many times, but one of those times was by me before I had learned of O(n log n) sorting; I used it many times including in the INOI 2004 programming contest, about which I asked the following question ten years ago in 2011 on math.SE about this algorithm and Catalan numbers, a truly surprising property that is not mentioned in this paper: https://math.stackexchange.com/questions/36022/how-do-the-ca...

> For situations where a quadratic-time sorting algorithm is fast enough, I usually use the following:

    //Given array a[1], ... a[n]
    for i = 1 to n:
        for j = i+1 to n:
            if a[i] > a[j]:
                swap(a[i],a[j])

> It looks like bubble sort, but is closer to selection sort. It is easy to see why it works: in each iteration of the outer loop, `a[i]` is set to be the smallest element of `a[i…n]`.

----

Also: A few years ago I emailed Prof. Richard Stanley of Enumerative Combinatorics fame asking whether it was related to any of the hundreds of problems in his Catalan Numbers list, and he said he didn't know a direct connection at the time (so maybe it's an addition to his list?):

> > permutations equal to the product of all their inversions (in lexicographic order) [multiplying on the left]

> Example: The permutation 1423 has two inversions (24) and (34), and the product (34)(24) is the same as 1423.

> Non-example: The permutation 3412 has four inversions (13), (14), (23) and (24), and the product (24)(23)(14)(13) is 4321 which is not the same as 3412.

> Among the permutations of {1, 2, 3, 4}, there are 14 examples (and 10 non-examples).

And 14 is C_4, the fourth Catalan number.

BoppreH · on Oct 5, 2021

Not quite, your algorithm has j going from i+1 to n, and swapping when a[i] > a[j].

OP's algorithm has both indexes going from 1 to n, and swapping when a[i] < a[j].

Yours is more efficient and easy to see why it's correct, but OP's is slightly simpler and more surprising.

svat · on Oct 5, 2021

Oh yes, good point: I got carried away; there is a difference! Simplicity is subjective (I think mine is simpler of course :P), but the algorithm in the post is indeed more surprising.

pricecomstock · on Oct 5, 2021

Your algorithm does appear in Section 3.1 of the linked paper by the name "Exchange Sort"

jeffwass · on Oct 5, 2021

I actually did the opposite! I was taught this sort in high school (I think the teacher called it shuttle interchange?), and I independently came up with the bubble sort on my own as a more-efficient in-place implementation on disk.

Back in the 90’s for my high-school computer class, we were saving records on disk using TurboPascal and I was trying to sort my file in-place.

The sort was super slow, which I assumed was the disk head having to jump back and forth over larger distances of the array of records when comparing and swapping.

I reasoned if I could keep the compares and swaps between I and I+1, and then sweep those I’s forward, it’s the same number of overall compares but less head jumping with each compare/swap being adjacent records.

My implementation did speed things up quite a bit. I have no idea if my reasoning was right, or if I had some other inadvertent bottleneck. But I was proud of the speedier result.

I never explicitly pointed it out to the teacher, though, and didn’t get full marks on that homework because he said my overall program wasn’t user friendly enough.

otto2 · on Oct 5, 2021

Back in time I was sorting records on disk using Atari ST and (probably) Omikron BASIC. I applied RADIX sort (or a variant) that I saw in one magazine. It worked as a charm on RAM disk, but once when I tried it on a floppy disk, it just took forever.

goodcanadian · on Oct 5, 2021

Your algorithm is in the paper as a comparison, and it is described as a standard exchange sort.

buzzin_ · on Oct 5, 2021

What do you mean, it looks like bubble sort? It is bubble sort.

svat · on Oct 6, 2021

Bubble sort only swaps adjacent elements. The one I used is a lesser-known sort apparently called "exchange sort".

_clhx · on Oct 5, 2021

Oh gosh. Now you made me remember my own Catalan number problem with regards to weird solutions to generating balanced parentheses ahahaha.

maxk42 · on Oct 5, 2021

Because I have a need to optimize:

    // Given array a[1], ..., a[n]
    for i = 1 to n - 1:
        swapped = false
        for j = i + 1 to n:
                if a[i] > a[j]:
                    swapped = true
                    swap(a[i], a[j])
                if swapped == false:
                    break

This should reduce time complexity to O( (n - 1)^2 ) and stop sorting after the remainder of the array is sorted and tested one time.

q-big · on Oct 5, 2021

> This should reduce time complexity to O( (n - 1)^2 )

O((n-1)^2) = O(n^2).

labawi · on Oct 5, 2021

I see where you're going, but your implementation broke the sort.

As for time complexity, it changes from ϴ(n²) to O(n²). Alternatively, you can drop the O-notation and calculate precisely so as to include constant factor speedups, though your algorithm may end up slower in the worst case.

y4mi · on Oct 5, 2021

it doesnt work though

        def swap(lst, a, b):
            tmp = lst[a]
            lst[a] = lst[b]
            lst[b] = tmp

        def optimized_sort(numbers):
            i = 0
            while i < len(numbers):
                swapped = False
                j = i + 1
                while  j < len(numbers):
                    if numbers[i] > numbers[j]:
                        swapped = True
                        swap(numbers, i, j)
                    if swapped == False:
                        break
                    j +=1 
                i+=1
            return numbers

        numbers  = [1, 4,7,3,7,12,55,3,11,2,9,88,8]
        optimized_sort([*numbers]) => [1, 4, 2, 7, 7, 12, 3, 3, 8, 9, 55, 11, 88]

removing the break gets you the correct order: [1, 2, 3, 3, 4, 7, 7, 8, 9, 11, 12, 55, 88]

labawi · on Oct 5, 2021

Original code would be correct, if not for a few spaces - probably a typo.

munk-a · on Oct 5, 2021

This is why I like me some curly braces.

(Whitespace dependent languages suck when it comes to passing around code snippets - it's just the honest truth)

maxk42 · on Oct 5, 2021

It was just an indentation typo. Swap check should be one level higher.

raxxorrax · on Oct 6, 2021

That would be decent sort which returns a slighly sorted output.

keskadale · on Oct 5, 2021

INOI 2004. this guy's OG.

GistNoesis · on Oct 5, 2021

I remember asking the question on a stack-overflow site in 2015.

https://cs.stackexchange.com/questions/45348/why-does-this-s...

And it was answered successfully by pointing the same invariant they point in the paper.

I remember finding this program through via automatic program search, so the algorithm and the question of why it works as probably been discovered many times.

If only we had something that would allow to surface information from the vast amount of data available...

somenewaccount1 · on Oct 5, 2021

Yes, but you asked a computer science question with computer code instead of "pseudo code and arguments of correctness.", as Rafael kindly reminded you. :)

civilized · on Oct 5, 2021

That comment was a bit of an eye-roller. The post is perfectly clear as it stands.

liversage · on Oct 5, 2021

The original question contained a sizeable chunk of what looks like C#. It was then edited and made much clearer perhaps in a response to the comment.

svat · on Oct 5, 2021

See the edit: https://cs.stackexchange.com/review/suggested-edits/26152

aasasd · on Oct 5, 2021

More like, GistNoesis used 0-based arrays and the paper uses 1-based arrays. “Arguments of correctness” sounds like Rafael basically told GistNoesis to answer themselves, which is of course a widely spread attitude on SE.

aasasd · on Oct 5, 2021

Wait a minute! What the hell is ‘automatic program search’? Is it like with Malbolge where the first working programs had to be written through brute-force search?

GistNoesis · on Oct 6, 2021

You define some test-unit cases, input-output pair.

You define some elementary blocks like in scratch (Blockly). And you try to randomly assemble the blocks so that they respect the type constraints. And you score your program based on the number of test case it got right.

For this simple problem, naive random search was all that was needed. (less than 100k programs evaluations)

But for more advanced cases you could use something like Genetic Programming with Koza's Automatically Defined Functions where you build an intermediate library of useful blocks.

Now one would use deep learning and produce source code directly with some tool like Copilot and run it until it passes all tests. Or if you want to get your hands dirty, you can re-implement the old school methods and use a neural network to make the pick decisions instead of a random pick.

Once you have programs that pass all test cases, you can try to prove correctness (that's more something like HOL-Light would do) and whether or not they are equivalent (and equivalent to a formal specification) and optimize them further by transforming the program.

aasasd · on Oct 6, 2021

Interesting, thanks!

gnramires · on Oct 5, 2021

I think we should start recognizing more alternate information sources, such as blogs and stack exchange (i.e. attribute scientific credit and recognition to those posts).

Of course traditional scientific publishing has its place (including the care for methodology and source citation)... but many people will search directly those websites instead, so clearly it's an important emerging source of information and discovery.

jvanderbot · on Oct 5, 2021

Write it down or it didn't happen. More directly, write down the result separate from the method or it won't be discovered by the right audience.

Cryptonic · on Oct 5, 2021

Maybe there gotta be a paper or it does not exist xD

areyousure · on Oct 5, 2021

Some links to discussions and/or accidental discoveries of this algorithm:

OP (2021): https://arxiv.org/abs/2110.01111

Stack Overflow (2021, linked by 'Xophmeister in this discussion): https://stackoverflow.com/questions/69437526/what-is-this-od...

Stack Overflow (2021, linked from previous): https://stackoverflow.com/revisions/69435932/1

Stack Overflow with 5 downvotes! (2020): https://stackoverflow.com/questions/63214115/what-is-the-nam...

Stack Overflow (2020): https://stackoverflow.com/questions/59802669/pointer-array-s...

Computer Science Stack Exchange (2015, linked by 'GistNoesis in this discussion): https://cs.stackexchange.com/questions/45348/why-does-this-s...

Stack Overflow (2014!): https://stackoverflow.com/questions/25276619/how-should-i-an...

Reddit (2014, same as previous): https://www.reddit.com/r/algorithms/comments/2dect7/how_shou...

areyousure · on Oct 5, 2021

Stack Overflow (2020, with incorrect answer): https://stackoverflow.com/questions/61970160/what-is-the-nam...

Stack Overflow with 6 downvotes (2019, with incorrect answer): https://stackoverflow.com/questions/57080923/how-to-understa...

Stack Overflow (2016): https://stackoverflow.com/questions/34745203/using-a-for-loo...

Stack Overflow (2016): https://stackoverflow.com/a/37650932

svat · on Oct 5, 2021

Wow how did you find all these? Have you been keeping track all these years, or did you somehow search for and find this just now?

areyousure · on Oct 5, 2021

All today. I have been trying to carefully craft searches on Google and Stack Overflow. I wish Google Code Search were still around, or that Github code search were better.

password4321 · on Oct 7, 2021

https://sourcegraph.com/search ?

I always get it confused with self-hosted https://github.com/CoatiSoftware/Sourcetrail but I think Sourcetrail just shut down.

areyousure · on Oct 7, 2021

Thank you!! This is great! I found some more examples as a result:

Github (2021): https://github.com/terror/paragon/blob/master/examples/sort.... Github (2012!): https://github.com/ragnraok/VisualSort/blob/master/sortalg.p...

areyousure · on Oct 5, 2021

Github (2014--2016): https://github.com/echavarria/edx-mit-600.1x/blob/master/W05... https://github.com/slgraff/edx-mitx-6.00.1x/blob/master/pset... https://github.com/mbh038/MITx600/blob/master/600.1x/Problem... https://github.com/demidovakatya/courses/blob/master/mitx/mi...

Chegg (year?): https://www.chegg.com/homework-help/questions-and-answers/15...

Apparently this was mentioned in a problem set for MIT edX course 6.00.1x in 2014. Then maybe someone then tried to have the internet do their homework for them (this may also be the case for some of the posts in my parent comment):

Math Stack Exchange (2014): https://math.stackexchange.com/questions/876400/best-worst-t...

Note that the question got 3 downvotes, and that the answer is wrong. In particular, the two sorts in the question sort in opposite directions!

geoduck14 · on Oct 6, 2021

Serious question:

What are you searching for to get these results?

areyousure · on Oct 6, 2021

[sort "for i in range(n)" "for j in range(n)"] https://www.google.com/search?q=sort+%22for+i+in+range%28n%2...

[sort "for (int i = 0" "for (int j = 0" site:stackoverflow.com] https://www.google.com/search?q=sort+"for+(int+i+%3D+0"+"for...

["modswapsort"] https://www.google.com/search?q="modswapsort"

The last is the function name from one of the earlier results.

shultays · on Oct 5, 2021

Imo does not approach the simplicity of sleep sort: https://rosettacode.org/wiki/Sorting_algorithms/Sleep_sort

Also I am pretty sure I saw this algorithm before. Doesn't it have a name?

kps · on Oct 5, 2021

If this doesn't have a name (I didn't see one in the paper), I propose “interview sort”.

sundarurfriend · on Oct 5, 2021

All sorts are interview sorts - that's the only context in which they come up, for most programmers.

genewitch · on Oct 5, 2021

That's probably true, however didn't the default sort change between python2 and python3, as a newer, faster algorithm was found and implemented?

antognini · on Oct 5, 2021

Bogosort is pretty simple, too.

  while not is_sorted(a) do
    shuffle(a)

However the shuffle function hides a surprising amount of complexity. It is nontrivial to write a correct shuffle function. [1]

But Bogosort isn't even the worst sorting algorithm out there! There is also Worstsort [2].

[1]: https://en.wikipedia.org/wiki/Fisher%E2%80%93Yates_shuffle

[2]: https://arxiv.org/abs/1406.1077

CaptainNegative · on Oct 5, 2021

Shuffling is even less tricky if you're not trying to do it in place, if you're not trying to do it in linear(ithmic) time, or if you're not trying to induce a uniform distribution over all permutations (but rather just want some positive probability for all of them). Relaxing any combination of these conditions works just fine for Bogosort.

infinisil · on Oct 5, 2021

It would even suffice to just swap two random elements every iteration!

6510 · on Oct 5, 2021

My gut says that combining that with other sort cycles in some magic ratio could do magical things. I'm quite wrong quite often ofc but the adventure is out there!

a1369209993 · on Oct 7, 2021

Fun fact: Fisher-Yates samples a uniform distribution over permutations and is therefor capable of producing any permutation, so you can sort any array in linear time using Fisher-Yates with its RNG replaced by a appropriate oracle. (So the complexity of sorting a array mostly reduces to the complexity of determining which permutation it's in relative to a sorted version.)

innocenat · on Oct 5, 2021

I think most C++ implementation of bogosort use std::next_permutation

jtolmar · on Oct 5, 2021

You can turn a sort into a shuffle by replacing the comparator with a random coin flip, and you already have defined a sort algorithm, so there's no added code complexity.

    fn bogosort(a, comparator):
      while not is_sorted(a, comparator) do
        bogosort(a, rand)

aidenn0 · on Oct 6, 2021

You can't turn all sorts (not even all comparison sorts!) into a shuffle by replacing the comparator with a random coin flip. In fact your code post is a demonstration of that fact.

j16sdiz · on Oct 5, 2021

a good (even) shuffle is not easy.

notriddle · on Oct 5, 2021

Sleep sort is heap sort. The heap is just built into your OS.

lmilcin · on Oct 5, 2021

Sleep sort is meant as a joke. Obviously, everybody understands there exists no O(1) GP sorting algorithm.

falcor84 · on Oct 5, 2021

Assuming GP means General Purpose, yes, everyone who took an intro to algorithms should know that no sorting algorithm that relies on comparisons can do better than O(n*log(n)).

But non-comparison algorithms [0] like radix sort [1] that rely on simplifying assumptions are not a joke and can be used in practice in many real-world cases.

[0] https://en.wikipedia.org/wiki/Sorting_algorithm#Non-comparis...

[1] https://en.wikipedia.org/wiki/Radix_sort

Kranar · on Oct 5, 2021

No general purpose sorting algorithm period can do better than O(n * log(n)), regardless of comparison or otherwise. For a radix sort with a complexity of O(n * w) where w is the key length, w converges to log(n) in the general case since the number of bits needed to represent n values grows logarithmically with n, hence you arrive back to O(n * log(n)). The O(n * log(n)) is an information theoretic result that applies even to radix sort, to be more precise the optimal sorting algorithm is Θ(n!).

If you fix w to a constant, then certainly one can claim that radix sort is O(n), but then so are comparison sorts as well.

That's not to say that radix sort isn't useful in practice, just because two sorting algorithms have the same asymptotic bounds doesn't mean they are equal, but it is to say that its performance is a matter of evaluating the constants rather than how it scales.

cb321 · on Oct 5, 2021

> since the number of bits needed to represent n values grows logarithmically with n

What you say only applies to sorting n unique/distinct keys. With no prohibition on duplicate keys, n & w vary quite independently. In this more general setting, radix sorting is indeed "really" O(nw) and w could be like 8 bits for some giant data set (EDIT and order is absolutely a useful property even with dups since there can be satellite data with the keys - also possibly bounded!). This is a common talking past each other problem in this area. To your credit you made this assumption explicit. (EDIT: "General purpose" may have some semantic unclarity, I suppose.)

In terms of relevance, everyone has their own, but my personal experience is that most (but not all) real world data allow both duplicate keys and bounding key width. In particular, the largest n data sets have been the least likely to have a prohibition on duplicate keys, and also the most likely to have small w (with a few exceptions). The random access pattern of memory writes in the passes of a radix sort may (or may not) be a problem, and constant factors matter, of course (as you also say). So tying w to n was has always seems a very special and somewhat artificial case to me.

I also have never heard of any practical O(n) comparison sort for fixed w before. Can you perhaps give a reference to some such algorithm? Thanks!

jameshart · on Oct 5, 2021

At some level, even comparing ‘a<b’ has a complexity related to the log of the magnitude of a and b. So if you’re doing a comparison sort over n unique values surely you’re going to run into a similar log n factor that the gp threw at the radix sort?

cb321 · on Oct 5, 2021

It depends how you count. There can be O(n log(n)^2) as you say, but I believe there are some comparison based algos where you only have to compare "fewer bits" to decide each comparison. So bit-wise cmps might still be O(n log n). However, you do have to move whole w-bit keys which is w bits wide. So, in full cost (cmp + space moves) unless you can do tricks to avoid the whole move, the comparison sorts do scale worse than radix-like digital sorts. And on general purpose CPUs people certainly compare batches of 8/16/32/64 bits at a time in HW. Personally, I think radix really does scale better, but maybe @Kranar can clarify his claim. (EDIT: there is often a random write pattern to digital sorts that can slow them down at massive scales, but that is definitely an {important} constant factor/different kind of "real machine" analysis.)

cyber_kinetist · on Oct 5, 2021

Maybe it has more to do with the actual distribution/form of the data? From my understanding radix sort can be incredibly fast when the numbers have fixed digits and evenly distributed in a predefined range (like phone numbers) but will be ineffective for floating point numbers or uneven distributions with extreme outliers.

lmilcin · on Oct 5, 2021

That is why I specified "General Purpose". The moment you know something more about the data there might exist an opportunity to exploit that information to improve the algorithm.

Which is an interesting topic in itself. Many times I have met people who said with absolute authority "no, it is not possible to implement this" only to be confronted with a counterexample. And that is because real life is rarely about spherical cows in vacuum.

I have once implemented a transactional, log-based database for credit card terminal (20MHz ARM, 2MB total unified memory).

The read (get value for key) operation was O(n^3) which the reviewers decided is "ABSOLUTELY FUCKING UNACCEPTABLE" (their words).

The fun started when I asked them to suggest better implementation. They could not come up with anything even remotely as fast.

And the basic reason was that n was guaranteed to be small as there wasn't even much space on the device for any more data. Other than that it exploited fantastic read ahead capability of the device plus it was so simple (basically couple nested loops) that CPU cache could be used very effectively.

I remember their O(nlog(n)) was never faster and actually about 50 times slower in most operations.

jsight · on Oct 5, 2021

I'm consistently amazed at how willing people are to make assumptions about optimizations with problems that don't fit their assumptions.

Nice job.

lmilcin · on Oct 5, 2021

I think main reason is our flawed education. The tasks you are being given at school are usually constructed so that they fit the idealized knowledge you just learned.

What it teaches people is to short cut the critical path of reasoning which is to first make sure you actually understand the problem and then the problem actually fits the assumptions. People are not taught that even a small divergence from assumptions can have dramatic difference on the actual solution or applicability of what they think is a solution. They are also not taught the actual assumptions or spot how they look. They talk about CS algorithms as if they worked on idealized computers, but then run them on real ones.

At school this is not taught because when it seems the problem fits the material it almost always is. People are actually rewarded for jumping to conclusions as this helps them go through material faster.

If you don't learn anything about the world you bring your learning with you and try to apply it to real world and that's how a lot of developers operate.

Especially in tech, you can usually look up solution to any problem easily. But for some reason this is the part everybody emphasizes rather than recognize knowing when to apply the solution is way more valuable.

I also think it is part of the wisdom behind "premature optimization is root of all evil".

_tom_ · on Oct 5, 2021

What is wrong with Leetcode and similar, in a nutshell.

IgorPartola · on Oct 5, 2021

I once watched someone implement a heap in PHP in order to sort 4 domain names in order of priority and be able to resort them. The idea was I guess that at some point there may be 5 or even 6 of these and then we would really need the heap…

bartvk · on Oct 5, 2021

Amazing story, thanks for sharing!

air7 · on Oct 5, 2021

Actually, for any specific size of N, every algorithm is O(1)...

faizshah · on Oct 5, 2021

I can’t believe someone implemented that in jq: https://rosettacode.org/wiki/Sorting_algorithms/Sleep_sort#j...

Pretty interesting implementation not actually sleeping tho.

fnord77 · on Oct 5, 2021

https://en.wikipedia.org/wiki/Insertion_sort ?

SamBam · on Oct 5, 2021

From the paper:

> There is nothing good about this algorithm. It is slow – the algorithm obviously runs in Θ(n²) time, whether worst-case, average-case or best-case. It unnecessarily compares all pairs of positions, twice (but see Section 3). There seems to be no intuition behind it, and its correctness is not entirely obvious. You certainly do not want to use it as a first example to introduce students to sorting algorithms. It is not stable, does not work well for external sorting, cannot sort inputs arriving online, and does not benefit from partially sorted inputs. Its only appeal may be its simplicity, in terms of lines of code and the “symmetry” of the two loops.

dhruvdh · on Oct 5, 2021

Well, I teach Algorithms, I think there is an obvious intuition as to why this works and I use this algorithm to introduce sorting.

ColinWright · on Oct 5, 2021

Do you really use this algorithm? Look more closely, if compares every element against every other element, and the comparison appears to be in the wrong order.

This is not the Bubble Sort[0] as defined in Wikipedia. For one thing, it always has n^2 comparisons, is not always comparing adjacent elements, and does not terminate early.

[0] There are some who disagree and claim that it is.

dhruvdh · on Oct 5, 2021

Yes. I can read.

It finds the largest number and places it at `i`. In the next iteration the largest number from 0 to `i` is placed at `i+1` Right up until `j` reaches `i`. Leaving a trail of orderliness.

As the value of `i` increases we approach and eventually equal ascending order.

What’s non-intuitive or what makes the comparison in the wrong order?

It’s much easier to explain if we just stop `j` at `i` each time.

areyousure · on Oct 5, 2021

> I use this algorithm to introduce sorting.

Do you have any (publicly-available) course materials that mention this algorithm? The OP mentions they were "unable to find any references to it", and I've been collecting references I find in this comment: https://news.ycombinator.com/item?id=28760986#28762275

> What’s non-intuitive or what makes the comparison in the wrong order?

In a sibling comment, 'ColinWright compares this algorithm to bubble sort, which has the comparison in the other direction. Another similar algorithm is this selection sort which differs by one character:

  for i = 1 to n do
    for j = i to n do
      if A[i] < A[j] then
        swap A[i] and A[j]

This also sorts the array, but in the opposite order. It's true that seen in the correct light, both of these algorithms can be seen to work, but it definitely confuses people. For example, these three answers on Math Stack Exchange and Stack Overflow are incorrect about the sorting order: https://math.stackexchange.com/a/876452 https://stackoverflow.com/a/61970459 https://stackoverflow.com/a/57118286

SamBam · on Oct 6, 2021

> What’s non-intuitive

Well, for me what's non-intuitive is that if we started with a fairly-well ordered array:

[ 1, 2, 3, 5, 8, 4, 9 ]

The very first thing the algorithm would do would be to swap 1 and 2, even though they are already in the right order. The next thing it would do would be to swap 2 and 3, making it even worse. And so on, all the way down the line. It takes six swaps to put 1 finally back to where it belongs, and the rest of the array is hopelessly out of order.

It does work, yes, and you can eventually grok it, but I feel like you're not being honest to call this a completely intuitive algorithm to new CS students.

ColinWright · on Oct 5, 2021

> Yes. I can read.

I apologise.

Multiple comments elsewhere have been made by people who initially thought it was a Bubble Sort, and every other course and tutorial I know of starts with the Bubble Sort. So I admit to being surprised that you use this to teach algorithms. Personally, I find the algorithm given here completely unobvious, but perhaps that's just because I derived the Bubble Sort for myself back in the mid 70s, then read about O(n.log n) algorithms, and was hooked.

For me, this algorithm is very misleading, but if you find it a useful teaching tool then I've learned something, and I find it intriguing. I'd be interested to know if it's used in any other courses. Do you know of any?

snip description

> What’s non-intuitive or what makes the comparison in the wrong order?

It works, so obviously the comparison is right. But compare it with a classical Bubble Sort:

  procedure bubbleSort(
      A : list of sortable items
      )
    n := length(A)
    repeat
      swapped := false
      for i := 1 to n-1 inclusive do
        /* if these are out of order */
        if A[i-1] > A[i] then
            /* swap them and remember
               something changed */
            swap(A[i-1], A[i])
            swapped := true
        end if
      end for
    until not swapped
  end procedure

It's looking to see if A[i-1] > A[i], because then they are the wrong way round. Compare that with the code in this submission:

  if A[i] < A[j] then
  ...

It's the other direction, which pulls me up short.

And it's comparing every place with every other place, whereas Bubble Sort only ever compares adjacent items.

Being as familiar as I am with sorting algorithms in general, this, to me, feels unintuitive.

I guess my intuition doesn't match yours.

dhruvdh · on Oct 6, 2021

I see now that people seem to be equating intuition with making steady, obvious progress.

If you think about the point of all the adjacent comparisons in the inner loop for bubble sort is, it is finding the largest number and moving it to the last place in the array. It keeps doing that in a loop, the next time the second largest is found and at the second last place, etc.

This algorithm doesn’t particularly do anything different, it just finds the largest and places it at `i` instead of the last place in the array. You’ll find that as it does it’s thing the array from 0 to `i` is always in ascending order.

It’s very easy to teach this algorithm, the next variation that stops the inner loop at `i` instead of `n`, and the next variation that doesn’t do unnecessary swaps, which ends up being insertion sort I think? I would have to check.

ColinWright · on Oct 6, 2021

> This algorithm ... finds the largest and places it at `i` instead of the last place in the array.

This is immediately where my intuition for the correctness of this algorithm falls down. When `i` is 1 (in a 1-based array) this algorithm starts by putting the largest element in the 1st location.

Then it proceeds by putting the largest element in the 2nd location, then the 3rd, then the 4th, and so on, so I can see that the largest element ends up in the last place. What is harder to see is that in all the swapping about, it's guaranteed that the smallest ends up in the 1st location.

> * You’ll find that as it does it’s thing the array from 0 to `i` is always in ascending order.*

This requires (a) noticing that it's the invariant you want, and (b) showing that it's true. It feels like a lot more work than proving Bubble Sort works, and to me, it's a lot less "obvious" (for want of a better term). But in particular, you refer to "making steady, obvious progress." It's not obvious to me that in the mid-phases we are making progress. The largest element continues trudging upwards, yes, but it's not at all clear to me that what it leaves behind is ordered, and not chaotic.

I recognise that I'm not you, and I'm not in the same position as your students, but I still find this algorithm requiring a lot more work to understand than the Bubble Sort.

Is there any particular reason you use this algorithm as your starting point for teaching sorting?

dhruvdh · on Oct 6, 2021

It’s a lot more work to massage bubble sort into the other sorting algorithms they need to learn without scraping the code and starting from scratch.

dhruvdh · on Oct 6, 2021

The algorithm is essentially just inserting the i + 1 th element into the sorted subarray from 0 to i, and it inserts in the correct place. I’ll be teaching this next week or the one after, I’ll try to record a video and share if I remember. It’s harder to explain with just words.

ColinWright · on Oct 7, 2021

I'd be fascinated to see this ... if you can do that I'd be grateful ... thank you.

(Copying from your sibling comment)

> It’s a lot more work to massage bubble sort into the other sorting algorithms they need to learn without scraping the code and starting from scratch.

An interesting observation. When I teach equivalent things I tend to present them separately and individually, so the students recognise that they are different in important ways, and then cross-connect them with transformations or other bridging techniques. Then they can see that they are different, but related.

I'll think about your approach next time I do something like this.

Meanwhile, if you would be willing to share a video of teaching this topic I'd be pleased to see it ... contact details are in my profile if you'd prefer direct contact, although I'm sure others would also be interested.

dtjc · on Oct 11, 2021

I'd like to watch that video as well.

jonsen · on Oct 5, 2021

Your original comment does not show that you are one of the few able to just read and understand a simple algorithm. Look around the many comments here where people misread this simple algorithm and readily jumps to premature conclusions. In that light I don't think you can say "there is an obvious intuition".

iainmerrick · on Oct 5, 2021

Who said it was bubble sort? It’s clearly not.

ColinWright · on Oct 5, 2021

There are others in this discussion who are claiming that it is Bubble Sort. Some have recanted, others are sticking to their guns.

I am in email conversation with someone who is claiming that it is just a more efficient version of the original Bubble Sort, and I'm trying to get the time to track down their references.

So:

I don't think it is Bubble Sort, I think it's clearly different, I have said so elsewhere, not everyone agrees.

iainmerrick · on Oct 5, 2021

Hmm, come to think about it, it does have a lot going for it as an illustrative algorithm!

- It’s really simple

- It’s a bit counterintuitive, so makes a cute puzzle

- It’s a good exercise for a first correctness proof

- And it’s obviously a pretty pessimal algorithm, so students shouldn’t be tempted to use it in real code (hopefully...) Compared to bubble sort, which seems somewhat plausible, so people end up actually using it for real even though it’s a terrible algorithm.

cwoolfe · on Oct 5, 2021

Right. Isn't the intuition that the comparison and swap rule is applied to every possible combination? Put another way: How can you rearrange everything based on order and expect the net result to not be ordered at the end?

hrldcpr · on Oct 5, 2021

Note that the comparison is backwards, so by that line of thinking it would sort in reverse (descending) order …but it doesn't

ColinWright · on Oct 6, 2021

I've seen now from other comments that I'm not alone in finding this algorithm intuitive.

Can you share with us your intuition as to why it works?

CobrastanJorji · on Oct 5, 2021

This is fun but I do wish papers would stop using "obvious" or "not difficult" so frequently. This paper has five uses of "obvious," 3 uses of "clear" or "clearly," a "should not be difficult," a "what can possibly be simpler?," and a "It is difficult to imagine that this algorithm was not discovered before."

Is this a particularly complex algorithm or proof? No. But it reads like the author was terrified that someone would be able to understand their paper simply by reading it once and worried that would make the author look dumb. That's backwards, though. Easy to understand concepts are a good thing.

tgb · on Oct 5, 2021

Of the five "obvious" words, two of them are used to say that something is not obvious and your last two examples are something entirely different. The uses here seem quite reasonable. But these words should be helpful, not insulting! If it's not clear, as the author claims, then you've likely misunderstood something. No point pondering it for twenty minutes, better go back and reread the statement first.

But of course they do get misused and there's not much more demoralizing than getting stuck on something 'obvious.'

yupper32 · on Oct 5, 2021

You're going about it backwards. It's not that those phrases need to be removed from this paper, it's that this paper should have been a blog post.

The algorithm is not novel or in need of academic scrutiny. It's just a fun little sorting algorithm.

sytelus · on Oct 5, 2021

Plus I completely dislike the title. These kind of papers treat arxiv as if it is discussion group. It is not. It is paper repository for things that has been written with lot of care and thought.

jimmyed · on Oct 5, 2021

Why does the paper have the title of a clickbaitey BuzzFeed article?

earthscienceman · on Oct 5, 2021

This is getting more and more common in science, as a researcher's reputation and funding is more and more connected to their ability to get media/internet attention.

Y_Y · on Oct 5, 2021

This comment made me weep internally. It's soul-crushing how relentless self-promotion is now as necessary good science for a "successful" academic career. (We can define that as getting paid anything at all by a university.)

I never really had the intellectual talent or salesmanship for that, my gripe is that academia has this unsettling undercurrent of boasting and bluffing and backstabbing and it makes it just an unpleasant community in which to forge a livelihood.

Surely a healthy measure of these things has always existed, but the incentives has become perversely perverse and it really drives away people with a genuine aptitude and interest from areas where they can make genuine discoveries about nature and contribute to the sum of human knowledge.

And it drove me away too.

earthscienceman · on Oct 5, 2021

Oh yeah. I'm miserable. I have a 'dream career' but the reality of it is so unsettling. I work in a laboratory where collaboration only happens if it directly benefits someone's career in an "I'm famous" way... not an "I contributed to an important work" way. This is not unique. It's systemic.

My job largely revolves around seeming important, not doing good work.

phkahler · on Oct 5, 2021

>> My job largely revolves around seeming important, not doing good work.

That happens in companies too. It's really strange. OTOH we also know that any objective measures will be gamed, which I guess is the same thing. Perceptions are more important than reality sometimes.

EVa5I7bHFq9mnYK · on Oct 5, 2021

It always was this way. Source - PhD in physics, 1988.

6gvONxR4sf7o · on Oct 5, 2021

Because the whole paper is just a fun little note.

cyber_kinetist · on Oct 5, 2021

Yeah, this doesn't have anything to do with academic "self-promoting", it's just a fun little paper on Arxiv. (If the author really wanted self-promotion, they should have used Twitter instead of that boring red site...) Don't really understand why people have issues with this one.

revolvingocelot · on Oct 5, 2021

So that those familiar with Betteridge's Law can just browse right on by

bertman · on Oct 5, 2021

Having read the paper, Betteridge may actually not be true here :D

SkeuomorphicBee · on Oct 5, 2021

Is this really a new algorithm? I'm fairly sure my "Algorithms" professor* taught us this sorting algorithm twenty years ago. To be fair I don't think it was taught as a serious contender, not something to be implemented in production in the future, but instead as the baseline to measure the other algorithms, and also as teaching tool to introduce the fact that computers can't "instinctively look over" the whole set (like a human sorting a small set would do) but instead have to purposefully compare everything to everything.

*: (or maybe it was my "Data Structures" professor, I don't remember which of those basic courses covered the basics of sorting algorithms)

BoiledCabbage · on Oct 5, 2021

I obviously wasn't in your class, but what your describing sounds more like the role 'bubble sort' plays in comp sci education. And while this looks similar to bubble sort it is very different in behavior.

SkeuomorphicBee · on Oct 5, 2021

I distinctively remember that bubble sort was presented later. We started with this naive double loop (and the naive simple sort as a sort into a different copy alternative) and incrementally added optimisations to reach the various named algorithms

chmod775 · on Oct 5, 2021

Pretty much everything you, I, or someone else is likely to come up with, someone else will have thought of before. That's why we usually attribute inventions and discoveries to the first people to bring to market or publish.

Given the simplicity of this, obviously other people have found this algorithm before. But this might be the first publication mentioning it.

andi999 · on Oct 5, 2021

The 'that's why..' part seems non sequitor to me.

robotresearcher · on Oct 5, 2021

The article does not claim it is new. The introduction says:

> It is difficult to imagine that this algorithm was not discovered before, but we are unable to find any references to it.

the author would almost certainly agree with your other points, and has some other interesting things to say in the article.

yodelshady · on Oct 5, 2021

Slightly OT, but it always slightly bugs me that sorting algorithms are taught *only* in the context of computers, which obey different physics to sorting things. Has anyone done work on sorting algorithms under different cost functions than just "minimise comparisons"? I'm thinking something like:

1. Allocation of new space is free, at least up to 2x the input size. 2. Moving a contiguous set of elements is as cheap as moving one element. 3. Moving an element, or set of elements, has a cost proportional to the distance moved.

gus_massa · on Oct 5, 2021

Not exactly what you are asking for but ...

To sort ~500 student tests by name, my favorite algorithm is

* Each person grabs a bunch of test. Split the test in 4 piles, using the first letter: A-F, G-L, M-Q, R-Z The same 4 piles for everyone. (I don't remember the exact borders. It may depend on your language. Perhaps 5 piles is better.)

* Each persons grabs a pile. Split each pile in subpiles by the first letter again. If some letter is too popular, split it using the second letter.

* Sort each small subpile, that has only 10-20 tests, probably using insert sort.

* Make a giant pile.

This can be classified as a combination of radix sort and insert sort. It's easy to parallelize to 5-10 persons.

If you later realize there are ~50 more test, just sort them using your favorite method and then use merge them, like in mergesort (or timsort?).

tgb · on Oct 5, 2021

What's the optimal sorting algorithm for your Bridge hand? Sorting and holding 13 cards is tricky enough to make me wonder.

thehappypm · on Oct 5, 2021

Yeah it’s a little different because the human mind isn’t procedural — you can perceive multiple cards at once, at a fuzzy resolution. You look at a hand and see “okay an ace, 3 face cards, some other cards” then scan to resolve. So an optimal algorithm needs to take that cognition into account.

yodelshady · on Oct 5, 2021

That's the sort of thing I hand in mind, yes. Should probably write what I think I'm doing down.

Though in that specific situation, the sufficiently smart and motivated could probably infer hand information.

pezezin · on Oct 6, 2021

Something like this? https://en.wikipedia.org/wiki/Pancake_sorting

smusamashah · on Oct 5, 2021

It behaves more like insertion sort https://xosh.org/VisualizingSorts/sorting.html#IYZwngdgxgBAZ...

Xophmeister · on Oct 5, 2021

This weirdly/coincidentally [delete as appropriate] showed up as a StackOverflow "Hot" question today

https://stackoverflow.com/questions/69437526/what-is-this-od...

dtjc · on Oct 5, 2021

I'd say both weird and coincidental :-). I'm the author of that StackOverflow question and have nothing to do with that paper. As far as I understand arXiv (I'm not familiar with it), the paper was submitted to arXiv before the StackOverflow answer that inspired my question, and published by arXiv after my StackOverflow question: https://arxiv.org/list/cs.DS/recent

bo1024 · on Oct 5, 2021

Yes, and the paper makes no mention of it, hm. Sorry to be a grumpy gatekeeper, but this really belongs as a blog post, not an arxiv paper, unless it were to cite more sources at least.

robotresearcher · on Oct 5, 2021

It now serves as a reference for this algorithm, something that was missing before and not well served by a blog post. It does not claim to have invented the algorithm.

jonsen · on Oct 5, 2021

The graphics in the first answer gives a nice demonstration.

gamacodre · on Oct 5, 2021

Back in the 80's and 90's, working on consoles in very tight space constraints, there were times we’d go with a horribly inefficient implementation of something like sort, because we only needed to sort 8 things and the savings in bytes of code was more important than the savings in time from a “better” algorithm.

Those abominations could be tweaked throughout the development of the game, and along the way we'd sometimes "rediscover" a more common (sane) algorithm. The refinement in this paper reminded me a lot of that.

alex_smart · on Oct 5, 2021

The amount of people in this thread that can't read an intro to a paper is too damn high.

gus_massa · on Oct 5, 2021

I think I saw something similar many years ago in a side competition of the national Math Olympiad were you can use computers. [spoiler alert] IIRC one student used a weird sorting algorithm, and after looking at it for a long time and testing, we called it a very inefficient insertion sort, and IIRC it was in the "wrong" direction. So my guess is that the student used this algorithm or something very similar.

userbinator · on Oct 5, 2021

Likewise, I also think this algorithm has been "discovered" many times but those who did were probably not intending to.

jakub_g · on Oct 5, 2021

HTML version of the PDF: https://www.arxiv-vanity.com/papers/2110.01111/

phaedrus · on Oct 5, 2021

I turned in a similar algorithm for a college homework assignment, as a practical joke. (I'd been programming for 10 years already before I was able to go to college, so I was extremely bored in my first year Computer Science classes.)

I'm pretty sure I used the double-loop technique to drive it, but mine wasn't quite as simple because I layered the joke by also using multiple XOR, instead of swap, to propagate the values through the array.

scotty79 · on Oct 6, 2021

For the people that don't want to open PDF here's the algorithm from the paper:

  for i = 1 to n do
    for j = 1 to n do
      if A[i] < A[j] then
        swap A[i] and A[j]

agent5566 · on Oct 16, 2021

Well, many people who tried to implement the bubble sort for the first time “invented” this algo, it’s just not optimized bubble rather then insertion sort, is it worth an article?

ColinWright · on Oct 16, 2021

If you read elsewhere in this thread you'll see that there really is enough discussion to suggest that yes, this is worth an article.

And it really, really isn't Bubble Sort. One of the defining features of Bubble Sort is that it only ever compares elements in adjacent locations. Others are claiming that if you optimise this then it becomes Bubble Sort, but someone else observes that if you "optimise" differently you end up with Insertion Sort.

That suggests that this is not Bubble Sort. It's something different that can be mutated into Bubble Sort, but it is, of itself, not Bubble Sort.

egberts1 · on Oct 5, 2021

I recall some kind of sort that requires a construction of link list for storing index numbers of the original array.

Only that the final link list (of index numbers) were in sorted order of which you then summarily write in the final array.

Which would have consumed some (n*(row-size)+n*(link-ptr-size+value-size) memory outside of its original array.

probably did O(n^1.5 for read operation which is WAY better than O(n^2) for bubble-sort.

Best in-its class algorithm for swap at O(2), worst case?

Delving into swap breakdown, its write operation might be O(3n+1) for writing a value then updating a link list pointer for each entry and for its link value into that final array; +1 for that NULL pointer at the end.

was arguably the lowest O (way back then) but at extremely high cost of memory. Never did learn which sort class did that variant falls into.

EDITED: insertion class, memory-hog-style.

egberts1 · on Oct 6, 2021

Now I remember, it was a college try to improve the read operation of insertion-class sort algorithm.

One guy knocked down the time order considerably by doing tiny pre-bubblesort in blocks before entering into the link-insertion thereby making less passes over its million link-list chains: but only because he leveraged to fitting within its CPU L2/L3 cache. Professor said that the order got worse but not by huge logarithmic scale despite time speed-ups for completion.

Dang, the O order just got complicated for multi-level caches.

ansaso · on Oct 5, 2021

Is this a discovery!? I'm 100% sure I wrote when I first started programming and obsessively tried to get the shortest solutions on codewars.

stagger87 · on Oct 5, 2021

Did someone refer to it as a discovery? In fact, from the introduction in the article.

> It is difficult to imagine that this algorithm was not discovered before, but we are unable to find any references to it.

NoGravitas · on Oct 5, 2021

I expect it's "commonly rediscovered because it's so simple; rarely published because it's so obviously bad".

grumpopotamus · on Oct 5, 2021

http://wiki.c2.com/?QuantumBogoSort

QuantumBogoSort a quantum sorting algorithm which can sort any list in O(1), using the "many worlds" interpretation of quantum mechanics. It works as follows: 1. Quantumly randomise the list, such that there is no way of knowing what order the list is in until it is observed. This will divide the universe into O(n!) universes; however, the division has no cost, as it happens constantly anyway. 2. If the list is not sorted, destroy the universe. (This operation is left as an exercise to the reader.) 3. All remaining universes contain lists which are sorted.

laurent92 · on Oct 5, 2021

The next generation will laugh at us for our first implementations of various algorithms in quantum computers. It will look like what you describe.

In fact, they’ll laugh at us for our first implementations of AI, which will look so stupid that they’ll call them BogoAI.

willvarfar · on Oct 5, 2021

It has a perverse beauty and will stick in my mind for a while :)

tester756 · on Oct 5, 2021

cosmic ray sort

while (!IsSorted(array)) { }

asow92 · on Oct 5, 2021

"ICan’tBelieveItCanSort(A[1..n])" what an incredibly cheeky academic paper.. I love it.

Apocryphon · on Oct 5, 2021

Watch out, whiteboarding interviews.

apricot · on Oct 6, 2021

Holy shit. I independently invented... well, not quite "invented", more like "stumbled into" this algorithm when I was in elementary school learning Applesoft BASIC, and wanted to sort an array of numbers. I had no clue how to do it and just intuited there had to be two loops, one inside the other. I got the comparison wrong, saw that it sorted backwards, and just changed > to < without asking myself why it worked.

fartingflamingo · on Oct 5, 2021

Reminds me of the remarkably little-known AlgoRythmics sorting dances [0].

[0] https://www.youtube.com/user/AlgoRythmics/videos

tu7001 · on Oct 5, 2021

Lenin Sort: guaranteed linear time, if element is not sorted, get rid of it! :-D

star-trek-fleet · on Oct 5, 2021

This is bubble sort

https://en.m.wikipedia.org/wiki/Bubble_sort

This should be in the standard college level CS classes.

Edit:

As mentioned elsewhere, this is not exactly same as the textbook bubble sort

When I wrote this, I am saying in the sense that each outer iteration "bubbles" up to its correct place, by swapping.

Thinking twice this also appears to just be a selection sort, as it indeed swaps in each iteration the smallest element

But overall, the algorithm is not surprising at all to me. It seems unusual indeed though.

ColinWright · on Oct 5, 2021

I suggest you read elsewhere in this discussion, where many people have claimed it is a Bubble Sort, and most have then changed their minds.

Consider, a Bubble Sort (as documented in TAoCP, for one) only ever compares elements in adjacent locations, whereas this routine compares every location to every other location.

jonsen · on Oct 6, 2021

Then think trice. Selection sort gets its name from doing work to find, select, the overall smallest element. This algorithm just takes the element at position i and inserts it at the so far right place in the sorted subarray to the left of i. It's an inefficient (and somewhat obscure) variant of insertion sort.

star-trek-fleet · on Oct 6, 2021

But overall it's not surprising to me...

nazgulnarsil · on Oct 5, 2021

I had a similar experience in college. The assignment was to print out a tree structure with any given initialization values. The professor and everyone else in the class printed the tree top down, like tree diagrams in textbooks. This required a bunch of fiddly spacing considerations. I realized that printing it out left to right required far simpler spacing. I think the final code was less than 20 lines with the core logic being 7 lines. The professor was very surprised it worked.

meesg · on Oct 5, 2021

Selection sort with extra swaps

jonsen · on Oct 6, 2021

Insertion sort with unnecessary comparisons.

mirrorlake · on Oct 6, 2021

For me, this is a lesson in the power of semantics. If one re-writes the if statement and switches the order,

   if A[j] > A[i] then

I find the correctness of the algorithm to be significantly more obvious.

It is true that i >= j, and sorted ascending values will have the property A[i] >= A[j].

It follows that swapping happens when the values are out of order, when

   A[j] > A[i]

cratermoon · on Oct 5, 2021

I kind of like Sleep Sort https://rosettacode.org/wiki/Sorting_algorithms/Sleep_sort

Seen previously on HN https://news.ycombinator.com/item?id=2657277

victorbstan · on Oct 5, 2021

"It is difficult to imagine that this algorithm was not discovered before, but we are unable to find any references to it." -- I've always maintained that for academics, if somebody hasn't written a paper for it, it doesn't exist. I can say I've seen this in actual code, in the wild, and I'm sure many others have.

professoretc · on Oct 5, 2021

It's more like, if it you can't find it, it doesn't exist. Seeing something published is probably the most common way for academics to "find" something, but if this algorithm was present in a lot of different code bases, made its way into the Linux kernel, etc., that would have been good enough, too.

lettergram · on Oct 5, 2021

I've been using the same program for years (since 2015) for interviews, intentionally adding some issues and asking what the code does, how to fix the bug(s), what to name the function, what are some properties that are problematic, etc.

Kind of a fun interview, but actually explores peoples knowledge.