Slashdot Mirror


Wikipedia Planning a DVD Version

daria42 writes "The Wikipedia Foundation hopes to sell an English version of Wikipedia on CD-ROM and DVD before the end of the year. A boxed set of the German language version of Wikipedia has been available since last year. An updated version of the German Wikipedia was launched on Amazon.de this week, and the e-commerce site has received 8,000 pre-orders, according to Wikipedia Foundation president Jimmy Wales. Wales said it was easier to put the German version of Wikipedia onto CD as there are significantly less pages than there are for the English language version. He said that English Wikipedia would 'barely fit on 2 DVDs.'"

23 of 310 comments (clear)

  1. Re:Whaaa? by Servants · · Score: 5, Informative
    I followed your link:
    Last dump made: 2005-03-09 (30 days ago)
    Total size 50503MB (1460MB for just current revisions)

    These are SQL dumps of the current and old article revision databases for each wiki. They can be read into a local database and directly used with the MediaWiki software (MySQL, PHP, Apache required).

    These dumps are not suitable for viewing in a web browser or text editor unless you do a little preprocessing on them first.
  2. There are... by NumbThumb · · Score: 3, Informative

    ...no images in the dump. Just text. And not reader software.

    Also, the current dump is about 800 MG, gzipped. enjoy.

    --
    I have discovered a truly remarkable sig which this 120 chars is too small to contain.
  3. wiki is going to get sued for this by Anonymous Coward · · Score: 3, Informative

    A lot of vandals copy/paste text from copyrighted websites onto Wikipedia, usually they get found and deleted but some are missed. If they sell copies of Wikipedia then they are going to get tons of copyright infringement lawsuits.

  4. Re:Why? by sinclair44 · · Score: 3, Informative

    Wikipedia's servers are often overloaded. My net connection can go offline somtimes. It's 100% positivly available for a research paper, and will 100% be around to back you up. You can run complex searches on an offline version much better/nicer/faster than an online version (if you can run it online at all). You can show it off to friends. Or a multitude of other reasons.

    --
    Omnes stulti sunt.
  5. Re:Is this legal? by teslatug · · Score: 5, Informative

    Have a read.

  6. Re:Whaaa? by Rylz · · Score: 2, Informative

    Those figures are for all language versions. For just English, these are the figures (further down on the same page):

    en.wikipedia.org
    cur - 585MB- md5 ec31110459e3d9680074bfaeddb8bfc9
    old - 26269MB- md5 94c2a48a0f68e17d0fd38c5b2e4c4ec3
    --
    Sometimes you've gotta roll the hard six.
  7. Re:Wikipedia Magazine... I'd pay for it! by Raelus · · Score: 3, Informative

    Want to learn random stuff for free?
    http://en.wikipedia.org/wiki/Special:Randompage

    --
    "It is the stillest words which bring the storm. Thoughts that come with doves' footsteps guide the world."
  8. Re:Is this legal? by remahl · · Score: 5, Informative

    Text content contributed to Wikipedia must be GFDL, so the foundation can sell it as long as they respect the authors' copyright and the terms of the license. Although the Wikimedia Foundation is not-for-profit, even commercial distribution would have been acceptable under the terms of the GFDL. But the content copyrights still belong to those who created it.

    On the other hand, it happens that people contribute material copyrighted by other people, without their consent. According to U.S. law, Wikipedia cannot be held responsible for that, as long as they act quickly to remove infringing material. When physical media is distributed, that protection is no longer valid.

  9. The fine print by Bifurcati · · Score: 3, Informative
    Just so we're clear, the article says that the majority of the price is going towards production costs and paying amazon. But if you're cheap, and really want a DVD set, then you can just download the images off "various websites", presumably to burn at your leisure.

    It's hard to get a more friendly distribution method than that!

  10. Re:Hmm by pluke · · Score: 1, Informative

    from the amazon site http://www.amazon.de/exec/obidos/ASIN/3898530205/q id=1112923136/sr=8-1/ref=sr_8_xs_ap_i1_xgl/302-436 0157-7192803 it's 9,90 euro which is about $6-7.

    --
    "all through my house i set up traps, it seems like the rats have a map, so now i feed the rats crack" - Donald D
  11. Re:Another reason the German version fits on CD by nbert · · Score: 4, Informative

    well, we just hate redundant information :D

    There is no way to turn AndbecauseGermansdonotwastebytesonspaces into one single German word btw.

    It is true that the German language allows combinations of nouns of arbitrary length, but in the English language constructs like this exist as well (e.g. railway consists of two nouns). The only difference is that those speaking English are not free to make up new ones.

    And as a general rule of thumb most combinations in English are limited to two words. While it wouldn't make sense to combine more than 5 words, because it would get to hard to read and understand the term, there are rare examples in German which consist of 3 or even 4 words.

  12. A bit of history on this by Raul654 · · Score: 5, Informative

    I first heard about this back in July of 2004. The people at Mandrake had already approached some of our people, and told us they wanted to put Wikipedia on DVD. The stumbling block was, of course, copyright issues. We launched a copyright tagging project in August - basically, they did an sql dump of the list of all uploaded files that had no copyright tag and tagged them. In January, Angela sent them an email, telling them it was done, and that's when the DVD project actually started.

    --


    To make laws that man cannot, and will not obey, serves to bring all law into contempt.
    --E.C. Stanton
  13. Re:Another reason the German version fits on CD by arodland · · Score: 5, Informative

    You could if you studied it for a minute. The rules are actually relatively simples. For example, let's look at everyone's favorite word: "Aktiengesellschaft", meaning a public corporation, and usually abbreviated "AG".

    Semantically, it can be broken into Aktien / Gesell / -schaft; taken at this decomposed level it means something a little bit like "stock fellowship".

    Orthographically, well, you're in luck; as usual (always?), it breaks down along the same lines, according to the rules. What are the rules? I couldn't tell you exactly, but they're simple, and they're similar to Latin's. Anyway, it breaks down to Ak/tien/ge/sell/schaft. Breaks occur between consonants that don't form clusters, between vowels that don't form diphthongs, and otherwise before consonants.

  14. Re:Why? by ikkonoishi · · Score: 4, Informative

    And for those who care...

    An MLA/APA auto formatter for references.

    Every teacher at my school has recommended it to me. (Although I myself have not yet gotten a chance to try it.)

  15. Are you all really *that* connected? by Just+Some+Guy · · Score: 2, Informative
    I think you're all missing the point. I'd love to kick back with Wikipedia during a long plane trip and depth-first traverse myself into oblivion. I have friends who live in rural areas with bad phone lines who might still like to read a few articles. A lot of restaurants in my town still don't have wifi, so I can't browse during lunch.

    I can think of a million and one reasons why having a fixed version that is instantly available would be a very handy thing indeed. I have all the Internet connectivity I could want (short of a neural interface), but I'd still probably shell out a few bucks for a copy.

    --
    Dewey, what part of this looks like authorities should be involved?
  16. How to help out by Anonymous Coward · · Score: 1, Informative

    The main problem facing Wikipedia at the moment is lack of development resource. At FOSDEM last month Wales urged the assembled audience of open source developers to get involved with the work of the foundation.

    The article doesn't mention how you can help out. Maybe might want to check this out:
    http://meta.wikimedia.org/wiki/Help:Contents%23For _MediaWiki_hackers

  17. Good, Bad, it is what it is. by cbreaker · · Score: 4, Informative

    Althought I think actually USING the DVD set for normal use when you have broadband kinda defeats the purpose, I can think of a few reasons why it could be a good thing.

    A) Archival. Average users will be able to get a working, usable snapshot of Wikipedia, with media.

    B) Preservation. If Wikipedia were to shut down, you'd have a copy of it.

    C) Faster access. If you have a slow connection, you can still access Wikipedia at fast speeds. This benefit dwindles over time as articles are updated.

    D) Offline access. If you're on the road with no net connection, you can still access Wikipedia. This benefit also dwindles over time as articles are updated.

    E) Although backed by Google now which helps with the financials, if it brings in some cash to help support itself it's likely to stay around for much longer.

    --
    - It's not the Macs I hate. It's Digg users. -
  18. Re:Where is the Great Publishing House of Ursa Min by hacker · · Score: 2, Informative
    You mean something exactly like this? Or perhaps Wikiquote in Plucker format? Or maybe the full Wiktionary in Plucker format?

    As you can see, I've been busy ;)

  19. Wiki* in Plucker handheld formats by hacker · · Score: 4, Informative
    I've been working on the Wikipedia, Wikiquote, Wiktionary and other similar works to convert them to Palm handheld formats (primarily Plucker format, but now iSilo for those users as well, with less functionality in iSilo, of course). I did a lot of work to the core Mediawiki software that drives it, to make it more usable on handheld devices.

    You can see my work so far at the following links:

    Wikipedia in Plucker format
    Wikiquote in Plucker format
    Wikitionary in Plucker format

    ..and of course, my beautiful anti-alias fonts for Plucker, made with PalmFontConv by Alexander Pruss.

    I've also converted the Creating XPCOM Components book by Doug Turner and Ian Oeschger to Plucker format as well as the FreeBSD Handbook.

    I have literally hundreds of similar-quality works I'll be releasing over the next few months to the community on an ongoing basis.

    If there's something you'd like to see, just let me know

  20. Japanese... by Cryptnotic · · Score: 2, Informative

    The only Japanese texts that are written all in phoenetics (hiragana or katakana) are children's books. Those will have spaces in between words. Text that is written for literate adult readers is written with kanji (alphabet from China). Most nouns, verbs, adjectives, and adverbs are written using one, two, or three kanji. Sometimes nouns are strung together to make longer nouns (sort of like compound nouns, e.g., "All-Japan Kendo Federation" is "zen nihon kendou renmei" and is written using 7 kanji strung together). Hiragana is used for particles (which mark parts of speech and join words together) and also for adjective and verb conjugations. There are also some words which are written in hiragana, however they are mostly common and easily understandable, so there is no problem. Foreign words are usually transliterated using katakana (another purely phoenetic script). Because they're written in katakana, they are automatically differentiated from the surrounding kanji and hiragana. Differentiating words isn't all that hard. The problem is really learning all the vocabulary and kanji.

    --
    My other first post is car post.
  21. 9,90 EUR = 12,44 USD by Anonymous Coward · · Score: 1, Informative

    Actuallym 9,90 euro is around $12

  22. Re:What's their point? by slavemowgli · · Score: 2, Informative

    1. The code is the same size, but it doesn't matter since the space the software would take up on a CD/DVD/... is dwarfed by the size of the data.

    2. The English Wikipedia has (roughly) about 2,5 times as many articles as the German Wikipedia. Mean article size (in bytes) is roughly equivalent.

    3. The English Wikipedia uses many more images etc. than the German Wikipedia.

    --
    quidquid latine dictum sit altum videtur.
  23. Wikimedia Foundation by dolmen.fr · · Score: 3, Informative

    This is about the Wikimedia Foundation , not Wikipedia Foundation which doesn't exist.
    Both the article and the /. post are wrong.