Slashdot Mirror


Squeezing a Wikipedia Snapshot Onto an 8GB iPhone

blackbearnh writes with this excerpt from O'Reilly Radar "Think about Wikipedia, what some consider the most complete general survey of human knowledge we have at the moment. Now imagine squeezing it down to fit comfortably on an 8GB iPhone. Sound daunting? Well, that's just what Patrick Collison's Encyclopedia iPhone application does. App Store purchasers of Collison's open source application can browse and search the full text of Wikipedia when stuck in a plane, or trapped in the middle of nowhere (or, as defined by AT&T coverage...)"

15 of 169 comments (clear)

  1. Re:Survey of Human Knowledge? by L4t3r4lu5 · · Score: 5, Funny

    "Wikipedia, what some consider the most complete general summary of human knowledge[citation needed] we have at the moment."

    There. Fixed that for you.

    There. Fixed that for you.

    --
    Finally had enough. Come see us over at https://soylentnews.org/
  2. Not a Problem by Blue+Stone · · Score: 4, Funny

    This is easily doable.

    Once you trim the earth reference down to "Mostly harmless".

    --
    Corporation, n. An ingenious device for obtaining individual profit without individual responsibility. - Ambrose Bierce
  3. What a total geek.... by ColdWetDog · · Score: 4, Insightful

    1. Goes to foreign country - one that he has never visited before
    2. Doesn't have wireless access.
    3. Instead of wandering about the country he spends most of his time programming ("Then basically, I spent a significant fraction of my time there in Japan, again, in 2007 writing those applications") an application so he can look up stuff about the country he isn't spending much time actually visiting.

    I bow before you sir. Awesome.

    --
    Faster! Faster! Faster would be better!
    1. Re:What a total geek.... by pzs · · Score: 4, Interesting

      You're right that this guy has flown the geek flag pretty high here; however, at least it's to some useful purpose. There are all kinds of facts about a country that are quite hard to discover just wandering about in it, and Wikipedia would be the ideal candidate to answer them.

      Last time I went on holiday (to Australia) I came back with a dozen questions I wanted answering, just because I didn't have internet access while I was out there; Wikipedia access would answer many of these questions. Examples:

      • I heard that Beds Are Burning was about the Australian aborigines - I never knew this before and wanted to look up more details on it.
      • As a result of that, I wanted to know far more information about how well aborigines were integrated in Australia at the moment. Answer: badly, but again hard to find out just by wandering around in Australia and difficult to raise with a random Aussie.
      • Australia is experiencing a lot of drought at the moment, but while we in Sydney, it rained quite a few times. I wanted to know more about the drought and what parts of the country it was affecting.
      • ...

      I could answer these questions by going into an internet cafe, but this isn't always possible. A portable Wikipedia sounds like a great idea.

  4. Oblig by TinBromide · · Score: 4, Funny

    xkcd comic reference

    Yeah, pretty much you're turning your iphone into a hitch hiker's guide to earth, or at least america and europe if you can manage to squeeze wiki-travel onto it.

    --
    Is it sad that I am more likely to recognize you and your posts by your sig than your name or UID?
  5. Re:iPhone apps for computers by Filip22012005 · · Score: 5, Funny

    That problem has recently been solved. With the recent addition of sms-sharing, you could use any iPhone remotely.

    --
    When the policeman of the tie, rule you violate, hello punishment of the kitty?
  6. Nothing new by Hrshgn · · Score: 4, Informative

    This is nothing new. Wikipedia has been available for several years now in MDict format: http://www.octopus-studio.com/product.en.htm

  7. Re:iPhone apps for computers by FlyingBishop · · Score: 4, Interesting

    FTFA:

    But I released the code to this application; it was open source from the very start. So it was pretty easy for them to take it and to port it to the OLPC.

    Already done.

    However, I'm not sure that I want precisely what this iPhone app is. It strips out references, and from the sound of things also the discussion pages. I'd say about 1/2 of articles I check the discussion pages to see what's really going on. Also he says he strips a lot of the metadata, and obviously images, none of which are things I"d want to give up (some of the metadata might be superfluous, but if I'm copying Wikipedia onto my computer, I want to copy Wikipedia onto my computer.)

    I understand there are licensing issues with images, but even so, the SVG ought to be safe. And that wouldn't add as much of a disk space hit as the gifs, etc.

    One of the other issues is the timing of Wikipedia dumps. They only do text-only dumps, and according to the article they only happen once every few months. It would be nice to implement an image review policy, and figure out a way to allow for mirrors (or just some increased bandwidth at Wikipedia HQ) so that we can actually have the entire English Wikipedia, regularly snapshotted and compressed, available for download. And really, for that kind of thing a 3-month or even yearly turnaround would be well worth the wait.

  8. Wikipedia has an entry on the Kama Sutra ... by Daniel+Dvorkin · · Score: 4, Funny

    ... so clearly this app will never make it through Apple's review process.

    --
    The correlation between ignorance of statistics and using "correlation is not causation" as an argument is close to 1.
  9. Re:Another step closer by TheRaven64 · · Score: 4, Insightful

    No. The Kindle supports online access to Wikipedia, but this requires a network connection. The iPhone supports the same. A while ago someone created a cut-down version of Wikipedia which you could browse completely offline on the iLiad. It sounds like someone has ported this to the iPhone, and because it's now on the iPhone it's news.

    Putting Wikipedia snapshots on portable devices is interesting. I don't really see why you'd do it with an iPhone; the iLiad takes CF cards, so you can just keep a 16GB CF card for Wikipedia and not fill up space you'd otherwise use for something else, but the iPhone's storage isn't expandable so it's a strange thing to want to do. The text of Wikipedia is not that big. A complete (uncompressed) copy is 200GB, but that includes all revision history and user pages. The current version of the English Wikipedia is around 4GB of text. This leaves another 4GB for filling up with images.

    --
    I am TheRaven on Soylent News
  10. Re:Survey of Human Knowledge? by mikael_j · · Score: 4, Interesting

    In all seriousness, I'm starting to get extremely annoyed by what is IMHO flagrant abuse of the [citation needed] tag on Wikipedia, I don't know how many times I've seen it used in situations where it just wasn't needed. And I don't mean in "But anyone who spends all day working on FOO knows that BAR!" situations but more along the lines of "The earth orbits the sun[citation needed]." or even better "Sir NameOfArticle was in his day frequently regarded as a national hero in $COUNTRY.[citation needed]. <Six paragraphs that detail, with plenty of sources, exactly how famous Sir NameOfArticle was.>".

    I've actually begun wondering if maybe there are certain individuals who are deliberately trolling Wikipedia by adding [citation needed] in places where it just doesn't belong and then sit around giggling as they read the discussion pages of various articles they've messed with.

    /Mikael

    --
    Greylisting is to SMTP as NAT is to IPv4
  11. XML Compression by firefarter · · Score: 4, Interesting

    So, I'm reading here that they convert the XML into proprietary metadata and compress that.

    Why not use EXI (Efficent XML Interchange) http://www.w3.org/XML/EXI/ which has been tested as more efficient that gzip and requires less memory to parse? Especially since the XML processing can remain the same, since the nodeset is the same.

  12. Re:Complete human knowledge? by Daniel+Dvorkin · · Score: 4, Insightful

    [citation needed]

    I'm not really kidding. Your anti-Wikipedia rant is entertaining, but it doesn't provide any substance. Speaking for myself, when I go to Wikipedia for a refresher on something I already know about, I'm generally pleased with the quality of the results, which makes me think that the articles on subjects I don't know much about are likely to be pretty good too.

    Your line about "political correctness and facts washed out of existence by human insecurities" provides a clue as to what really bothers you about Wikipedia: reality's well-known liberal bias. Unless you can provide specific examples, with citations, it's reasonable to assume that the Wikipedia groupmind knows more about the way things really work than some random dude on /.

    --
    The correlation between ignorance of statistics and using "correlation is not causation" as an argument is close to 1.
  13. Warning: 3 majors problem with this app. by Anonymous Coward · · Score: 5, Informative

    I bought this application 6 months ago and there are 3 majors problems with it:
    1) The search function is broken because you need to type the exact word (prefix)
    2) This is plain text: no pictures and no tables so most articles with "list" are useless
    3) No update mechanism so the dump used will be outdated soon.

  14. Just use the mobile-formatted version by daemonenwind · · Score: 4, Informative

    try this link from your mobile phone:
    http://wapedia.mobi/en/

    That way you get the whole thing, up-to-date, and with no trouble or major memory usage.