{"id":108,"date":"2004-03-09T15:50:56","date_gmt":"2004-03-09T21:50:56","guid":{"rendered":"http:\/\/www.mooreds.com\/wordpress\/?p=108"},"modified":"2004-03-09T15:50:56","modified_gmt":"2004-03-09T21:50:56","slug":"with-enough-eyeballs","status":"publish","type":"post","link":"https:\/\/www.mooreds.com\/wordpress\/archives\/108","title":{"rendered":"With enough eyeballs&#8230;"},"content":{"rendered":"<p>I referred to <a href='http:\/\/www.gutenberg.net'>Project Gutenberg<\/a> obliquely <a href='http:\/\/www.mooreds.com\/weblog\/archives\/000105.html'>here<\/a>, but browsing their site I found that they&#8217;ve implemented distributed proofreading.  This is a very good thing.  I did one book, <a href='ftp:\/\/indian.cse.msu.edu\/pub\/mirrors\/Gutenberg\/etext99\/hrmyf10.txt'>Hiram, the Young Farmer<\/a>, for PG a few years ago, when I was in college and time wasn&#8217;t so precious.  The OCR went quickly, but the proofreading was slow going and error prone; the story wasn&#8217;t exactly riveting, but it was in the public domain.  (In fact, I just took a look at Hiram and found at least two mistakes.  Doh!)<\/p>\n<p>But <a href='http:\/\/www.pgdp.net\/c\/default.php'>Distributed Proofreaders<\/a> solves the proofreading problem by making both the scanned image and the OCRed text available to me in a web browser.  Now I can proofread one page at a time, easily take a break, and even switch between books if I&#8217;d like.  Also, they&#8217;ve implemented a two phase review, much like Mozilla&#8217;s <a href='http:\/\/www.mozilla.org\/hacking\/reviewers.html'>review and super review<\/a> process.  Hopefully this will prevent mistakes from being made, since these are going to be the authoritative electronic versions of these documents for some time.  <a href='http:\/\/www.catb.org\/~esr\/writings\/cathedral-bazaar\/cathedral-bazaar\/ar01s04.html'>Linus&#8217; law<\/a> probably holds for text conversion even more than for software development.<\/p>\n<p>Now, it wasn&#8217;t apparent to me from the website, but I certainly hope the creators of this project have licensed it out to businesses&#8211;I can see this application being a huge help for medical transcriptions (work from home!) and any other kind of paper to electronic form conversion.<\/p>\n<p><b>Update:<\/b><br \/>\nIt looks like there is a bit of a <a href='http:\/\/www.distributed.net\/'>distributed.net<\/a> type <a href='http:\/\/www.pgdp.net\/c\/stats\/teams\/tlist.php'>competition<\/a> among the PGDP proofreaders.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>I referred to Project Gutenberg obliquely here, but browsing their site I found that they&#8217;ve implemented distributed proofreading. This is a very good thing. I did one book, Hiram, the [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[3,2],"tags":[],"class_list":["post-108","post","type-post","status-publish","format-standard","hentry","category-books","category-technology-and-society"],"_links":{"self":[{"href":"https:\/\/www.mooreds.com\/wordpress\/wp-json\/wp\/v2\/posts\/108","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.mooreds.com\/wordpress\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.mooreds.com\/wordpress\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.mooreds.com\/wordpress\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.mooreds.com\/wordpress\/wp-json\/wp\/v2\/comments?post=108"}],"version-history":[{"count":0,"href":"https:\/\/www.mooreds.com\/wordpress\/wp-json\/wp\/v2\/posts\/108\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.mooreds.com\/wordpress\/wp-json\/wp\/v2\/media?parent=108"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.mooreds.com\/wordpress\/wp-json\/wp\/v2\/categories?post=108"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.mooreds.com\/wordpress\/wp-json\/wp\/v2\/tags?post=108"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}