Internet Archive Puts 1.6M E-Books On OLPC Laptops
waderoush writes "Brewster Kahle of the San Francisco-based Internet Archive announced today that all 1.6 million books scanned and digitized by the Archive will be available for reading on XO laptops built by the Cambridge, MA-based One Laptop Per Child Foundation. The announcement came during a session on electronic books and electronic publishing at the Boston Book Festival. Kahle said the Archive has been collaborating with OLPC for a year to format the e-books for display on the XO laptops, some 750,000 of which are in use by children in developing countries."
Both versions of the XO laptop (1.5 as well as 1) have dual mode screens (the backlight can be turned off to enable reflective mode).
Although there's not much that can be done about it due to copyright laws, the fact that they're restricted to public-domain books likely skews it even more: there's a lot of 20th-century and 21st-century African literature, for example, but much less from pre-1923.
10 PRINT CHR$(205.5+RND(1)); : GOTO 10
Many are, though a good deal aren't. I don't see a way to browse their texts archive by language (am I missing something?), but you can search by specific language in the advanced search. I can't get them to add up to anything near 1.6 million, so presumably many aren't language-tagged.
But some rough figures:
400 - Swahili
Definitely a skewed distribution, but e.g. 17,000 texts in Spanish is quite a few, certainly more than most children can read!
10 PRINT CHR$(205.5+RND(1)); : GOTO 10
In fact, the proofread is done by the Distributed Proofreaders: http://www.pgdp.net/c/
BTW, I'd like to know what is done from all the human OCR from the Recaptcha project: http://recaptcha.net/
Any link to the digitized books ?