Solving Instagram's Unshredder with Mechanical Turk and $0.50

cellis · on Nov 27, 2011

The next step: turn it into a game and get it done for free. Even better, put a skinnerian ux around it and charge for the pleasure of solving picture puzzles ;)

mikeknoop · on Nov 27, 2011

Another possible way to do this is present the Turker with two vertical pieces already placed together. Ask them a yes/no question: do these two images match?

Iterate intelligently over your favorite sorting algorithm until you've placed all the images!

Note: requires more Turkers but each answer could be worth much less, maybe around $0.01

aantix · on Nov 27, 2011

Slightly offtopic, but I am the author of the Rails/Mechanical Turk integration gem, Turkee ( http://www.github.com/aantix/turkee ).

The lower the cost of the HIT (and the easier the question) the more you want to make sure you're screening for bots.

Make sure you have a gold standard question. For every HIT ask two questions; ask the legitimate question that you need answered and then on top of that, ask a simple "gold-standard" question that will determine whether the answer submitted is a bot or not.

For the gold standard, I like to choose two numbers, and store the sum of their values. Along with your HIT question, be sure to ask the Turker the value of their sum.

E.g. For a survey you ask the Turker "What is your favorite color?" But then along with that, you ask them "What is four plus fve?"

Be sure to misspell one value (e.g. remove a letter, etc). If the Turker doesn't respond correctly to the "gold standard" question, programmatically reject their response.

pama · on Nov 27, 2011

Not sure if this matters for a "gold standard" question, but google answers it correctly:

http://www.google.com/search?q=What+is+four+plus+fve%3F

jethroalias97 · on Nov 27, 2011

But then I think you are giving up the main advantage of using mechanical turk over just roughly matching pixels. It is possible matching pixels will get the wrong answer where a human, with the context of being human, will know what the image is supposed to be and get the write answer, captcha style.

Example: an iron fence with a scenic view behind and where the slices are too small for someone to match only two.

jarin · on Nov 27, 2011

Speaking of CAPTCHAs, a smaller version of this would make an excellent one.

Er, well maybe not, given the ease of programmatic solutions.

thesis · on Nov 27, 2011

This would be beat with a very high success rate if it were a captcha.

jarin · on Nov 27, 2011

What about doing the same thing, but fuzzing a few pixels from each edge to avoid pixel distance matching? You could even make the edges just plain black for a few pixels from each edge. The human eye would be able to complete the image (it would just look like it has black bars), but I'm guessing it would be enough to prevent a simple algorithmic solution.

Edit: This is essentially the same idea as the iron fence posted above.

notatoad · on Nov 27, 2011

At that point you could probably just do some simple edge similarity tests instead of sending it off to the Turks.

on Nov 27, 2011

[deleted]

mikeknoop · on Nov 27, 2011

You're right, you'd have to get creative with some of the sorting algorithms, but something like Merge sort would be perfectly suited.

jakobe · on Nov 27, 2011

You can't use merge sort or any other standard sorting algorithm in this case. All these algorithms assume you can compare any two elements. To use a sorting algorithm, you would need to be able to say for any two images which is further left.

rhizome · on Nov 27, 2011

Am I the only person who doesn't see this as Instagram's unshredding project as much as a DARPA one?

http://www.popsci.com/technology/article/2011-10/darpa-wants...

Omnipresent · on Nov 27, 2011

This is a really neat hack but How does the script know whether or not the solution was correct? Have they shared a real solution that solves this problem completley algorithmically, without human involvement? I could not find it.

seanp2k2 · on Nov 27, 2011

This is probably the first time I've ever cares about "crowdsourcing" or his ugly brother-in-law, "cloudsourcing". However, I can't help but imagine why they want a solution for this. Seems like Cold Wat-era spook stuff.

code_duck · on Nov 27, 2011

It was a hiring contest/game to get people thinking about Instagram and find some talent, rather then a search for better system like Netflix's contest.

darkstar999 · on Nov 27, 2011

Would any of this help with producing a panorama? That's all I could think of.

Groxx · on Nov 27, 2011

Probably not, as then you have to play around with blending two images and (un?)distorting them to match each other's shape. That's the hard part - people tend to take pictures in order rather than randomly.

jarin · on Nov 27, 2011

I can imagine it now, this is the first step in human evolutionary divergence between the Eloi (programmers) and the Morlocks (Mechanical Turkers).

colanderman · on Nov 27, 2011

Eh. I don't like the analogy. The Eloi didn't do anything; they were just food.

shinratdr · on Nov 27, 2011

That was strikingly similar to a comment I read just recently...

jarin · on Nov 27, 2011

If you're trying to imply that I "stole" the comment, I posted it in the article comments after posting it here (note the usernames). Figured it might spark some discussion for people not coming from HN.

shinratdr · on Nov 27, 2011

I know that, I saw that it was you. I also don't have a problem with it, it just caused me to do a double take and go check the site to see if my deja vu was real. I thought I would mention that because I doubt I was the only one who had that experience.

lachyg · on Nov 27, 2011

The jQuery sorter thingy is incredibly buggy / slow. I would have been done in 1/3rd of the time if it wasn't!

burgerbrain · on Nov 27, 2011

If more turk tasks were like this one, I'd probably consider putting some free time into it. That was actually sort of fun.

arkitaip · on Nov 27, 2011

This is just brilliantly simple and effective. The fact that you created a fully functional prototype is just gravy.

nitrogen · on Nov 28, 2011

OT: For some reason the Recollect bar that slides in from the top when I scroll down the page is very distracting. It seems to violate the expectation that, when scrolling downward, page content should only be moving upward. It also feels like it's robbing me of readable screen area. I suspect I'd be much less bothered by a position: fixed header that was always present.

jeffbarr · on Nov 27, 2011

This is a very cool way to use the Amazon Mechanical Turk!

I am 99% sure that the image (at least the one that I saw) on the page is of Tokyo's Shibuya station. And the tall white and black building on the right 1/3 of the picture? That's the Cross Tower building, housing Amazon's Tokyo office.

This photo had to have been taken from one of the upper floors of the Cerulean Tower hotel.

jpadilla_ · on Nov 27, 2011

Very clever sir! I've used the Mechanical Turk for a couple of interesting tasks, lowering prices to $0.01 and having Turkers complete those tasks in less than 2 minutes which is pretty awesome! It's very interesting how we've started to create some kind of relationship with Turkers, since we've sent over 1M HITs since we started using it.

iag · on Nov 27, 2011

Love it. This is an excellent use of mechanical turk solving real world problems.

natch · on Nov 27, 2011

Very cool. I'd love to see the jQuery you used for this. Is that posted anywhere?

neilparikh · on Nov 27, 2011

Here's a link to the code that I found in the blog post (I'm assuming it has the jQuery code, but I'm not sure): https://github.com/bertrandom/unshred

jc4p · on Nov 27, 2011

Just do a view-source on the unsolved version of this page:

http://unshred.recollect.com

It seems fairly simple for something that gives so much utility.

code_duck · on Nov 27, 2011

This is because the complexity is found in a library, jQuery UI, which provides sortable().

deepkut · on Nov 27, 2011

Wow. That is truly amazing. Creative work my friend.