So, after over a decade, I finally found a use case where I had the clout and the need to use mechanical turk. I wanted to write about my experiences.
What I used it for: We were looking for some data on businesses. We had business name, city and state, and wanted full contact information. We paid a dime for each listing, and asked for email address and physical address. We asked about each listing twice so that we’d have some kind of double check.
How effective was it? This varied. If you were using the master workers, it was very effective, but slower. If you open it up to all workers, you have to review their work more closely. The few times I rejected someone’s task, they wrote back and asked why and tried to make it right, which was a testament to the power of the system (it records rejections). Make sure you break the work into a couple of smaller groups so you can iterate on your instruction set (when workers asked questions on the first set, the answers went into the instructions for the second set). We still had to review all the listings and double check any that didn’t match between both task answers, but that was a lot quicker than googling for each business and doing the research ourselves.
How much did it cost? On the order of a couple hundred bucks to process around fifteen hundred listings.
What kind of time savings did we see? Assume we had 1500 business names, and it took us 90 seconds to google the business name and find the information. That is 1500 listings * 1.5 minutes == 37.5 hours, and this is on the low end. Instead, it took about 2-3 hours of setup, and then 36 hours of calendar time (when I was able to do other things like sleep and work on other problems), and we were done. Then I would say it was about 7-10 hours of review. So you are trading a couple hundred bucks for at least 20 hours of saved time.
Would I do it again? I think mturk is perfect if your problem has the following three attributes: more money than time, a task that is extremely simple, and time to review the finished product.
Other tips? You have to build it some kind of sampling for correctness. I have no idea what the quality is if you pay more than a dime per task. Make sure you think about edge cases. Provide tips to your workers (“check whois records as well as google”).