Just One Page a Day
Charles Franks writes "Two years ago I started building an online proofreading system as a way to help Project Gutenberg (PG) get more books online: Distributed Proofreaders (DP). The concept is simple, we scan books and load the image and OCR output for each page into the online system. Next, proofreaders compare the OCR text to the image making any corrections as necessary, each page gets looked at twice. Finally the output from the site is massaged into a PG e-text and submitted to PG for posting to the archive. Now, nearly 600 books and a lot of PHP code later, we have snuggled into our new home which is graciously provided by the Internet Archive and Project Gutenberg. Now that we have 'real' resources available to us (the original site ran on a Pentium 200 over my 128kbps upstream cablemodem) I would like to invite the online community at large to help us put even more books online. To this end I would like to ask everyone to do 'Just One Page a Day'. Thank you, Charles Franks"
Seriously? Just make a distributed system, put in PHP code, and make it all open source and free?
What's the criteria?
Good quote, too many chars. Seriously, the slashdot 120 char limit sucks!
... which is renowned for it's spelling prowess? ;)
A)bort, R)etry or S)elf-destruct?
Sounds like Gary Condit's plan for extramarital affairs.
A. Rightmann
I'm shure that buy askin teh Salshdot crowd (esp. the editturs) to help, yule improove jamatically teh kwality off you're output.
:-)
Try NetBSD... safe,straightforward,useful.
Imagine the kids 200 years from now reading |-|uc||_3b3rry F1|\||\|.
(That hurts my brain just trying to type it in...)
--- I wish I could hear the soundtrack to my life. That way I'd know when to duck.
"The Road Ahead" will not be included, at least in this round of distributed OCRing.
Read Errant Story.
Instead of proofreading the books, I think this guy is asking for his new server setup to be tested!
I'll do it for cheesy poofs.
welcum 2 mi phoez leest
Nuff said.
Don't you mean run a compare tool in the background using CPU idle time right?
You don't actually want us to read a
page of literature do you?
The new congress might extend copyright protection to Shakespeare's great great great great great great great great great great great great great grandson's nephew's out of wedlock kid's son whose paternity is in question.
---
When you come to a fork in the road, take it! --Yogi Berra--
You mean a more communal approach than an oligarchy of "editros" that can't spot day-old duplicates? Great idea!
Not to mention malapropisms!! :-)
http://www.dictionary.com/search?q=duplicity&d b=*
I like the first definition better!
Alison
"It is a miracle that curiosity survives formal education." - Albert Einstein
I think he was just watching all his volunteers working on one page a day and thought:
"Imagine a beowulf cluster of these!"
Sure, it starts as just one a day. But, before you know it, you're doing two, then five, then ten.
You stop going out with friends or even returning their calls, personal hygiene takes a back seat and even Counter Strike and Warcraft III become unappealling. And, finally, after countless chapters and hundreds of pages you realise that you're friends were right: you're an addict.
Just one page a day, huh? Yeah, right.
Opium. Pot. Cocaine. Now pages.
It might not be your older brother's drug, or your Daddy's or your grandfathers, but, trust me, this stuff can be dangerous.
Do what I do. Just say no.
"Accept that some days you are the pigeon, and some days you are the statue." - David Brent, Wernham Hogg
OK, here's mine:
#include stdio.h
next...
My beliefs do not require that you agree with them.
MY GOD! A story where nitpicking grammar and spelling is *ON* topic.
This'll be a fun one to read through.
Do you Gentoo!?
I'll help out.
One question - is Playboy public domain yet?
OK, I'll start at the other end and work my way toward you:
}
YM,
#include <stdio.h>
One line, one bug. Yikes!
If they're looking for proofreaders here, the project is in deep trouble...
This page accidentally left blank
Wait a minute! Isn't PHP like evil or something?
Programming languages may come and go, but good old fashion machine code will last as long as literature, very much like good old fasion ASCII text and good old fashion zip files with no meaningfull names.