Wikipedia Planning a DVD Version
daria42 writes "The Wikipedia Foundation hopes to sell an English version of Wikipedia on CD-ROM and DVD before the end of the year. A boxed set of the German language version of Wikipedia has been available since last year. An updated version of the German Wikipedia was launched on Amazon.de this week, and the e-commerce site has received 8,000 pre-orders, according to Wikipedia Foundation president Jimmy Wales. Wales said it was easier to put the German version of Wikipedia onto CD as there are significantly less pages than there are for the English language version. He said that English Wikipedia would 'barely fit on 2 DVDs.'"
Frequent mentions of David Hassellhoff compress really well.
I love it when /. posts an article only for me to see 'Nothing to see here.' for a minute.
On the Other Hand, Only TWO DVDS?
I better get to Wiki and start writing pages all about the writing Fan Fiction!
Edward@Tomato - /home/Edward/ man woman
man: no entry for woman in the manual.
"Qua!?"
Last time I checked, the current version of the English wikipedia dump, is around 585MB. It should comfortably fit on one CD. Where did this figure of two DVDs come from?
Not that I hope multitudes don't buy this, because any support of such and effort is good, but why would anyone buy it when you can just look it up and possibly get better updated results online?
Why pay for what you can get free?
"It is the stillest words which bring the storm. Thoughts that come with doves' footsteps guide the world."
How will the trolls deface a read-only version of it?
"The object of war is not to die for your country, but to make the other bastard die for his." - Patton
Because you gotta love it when people get paid while continuing to give you their stuff for free. Everybody wins.
Surely text compresses well. Not to mention that most of the linking between different articles could be handled in such a way as to reduce wiki-markup overhead.
Then again, there are a heck of a lot of articles in the Wikipedia...
I think it's a good idea to have wikipedia available in other formats than just online, but isn't the whole point of it that anyone can come and edit the articles to make them more correct? You couldn't do that with a DVD version. And unless someone is going to go through every article before putting it on a disc, you'd run the risk of buying an encyclopedia with some things blatantly wrong. I could envision pranksters trying to sneak in false information just before the DVD release...
Real_men_don't_need_spacebars.
I thought the whole idea behind wikipedia was that it is constantly changing. Will updated dvds be sold? And if so, will previous buyers get a discount?
The footprint of the english version fits on a floppy!
You know how controversial subjects in the Wikipedia get fights over entries. Back and forth it goes, with one person putting their "truth" and then the opposite side removing or replacing it with their version of the "truth." Now, just picture it: The deadline for the gold master version to be put on disc is announced, and like people pouncing on an EBay auction at the last second, the warring factions will rapidly replace each other's versions of an article, hoping that their version is the one to be immortalized on disc.
I wonder... does this 2-DVD set include all articles from Wikipedia? (As opposed to some just selected somehow...) Also, I wonder if the DVD version will include all the version changes to the articles. If not, then perhaps the best version was picked out somehow?
Hmmm... This is what I think needs to happen: Wait a few more years for Wikipedia to gain even more information, and then put some kind of button on pages that allows users to "vote" for that page to be included in a dead-tree encyclopedia version of Wikipedia. The idea is to put only those articles that have the highest votes into a traditional-style encyclopedia that can rival the likes of commercially made ones. Of course, there would need to be ways to cite sources, to make the encyclopedia worthy of academic research and the like, and preferably there should also be a way for people who want to do other stuff than write articles to submit photographs or whatever kind of artwork, of their own creation and released under the free license of Wikipedia, for inclusion in the articles. For the print version, people might be able to vote for the "best" photographs and artwork for inclusion. At that point, it should be a matter of running some perl script or something to typeset the whole darn thing. This might find its way into libraries and into peoples' homes. Imagine that!
I like the fact that Wikipedia is always current, so I don't know why I'd ever want this on DVD. For example, they had a great article on the pope the other day which was current right up through his death. Since I can just look this up online, why would I want stale information stored on my computer? I have a set of World Books on the shelf, and we keep them around for when you want to do research when "otherwise occupied" (i.e. sitting on the can). Of course now that I have a Zaurus with wireless networking then I don't need the hardcopy any more since I can surf from any room in the house.
If you don't want crime to pay, let the government run it.
How often do existing pages change? Maybe in a case where people catch errors.
I have a spare 20GB lying around that I would install this on, if there was some way to sync it with the current state and have it download new pages and update current ones.
WWJD -- What Would Jimi Do?
(Smash amp, burn guitar, take home the groupies)
...no images in the dump. Just text. And not reader software.
Also, the current dump is about 800 MG, gzipped. enjoy.
I have discovered a truly remarkable sig which this 120 chars is too small to contain.
This would be great for schools. They could buy the DVD set and set up a local "mirror" of Wikipedia to increase access speed and decrease Wikipedia bandwidth usage.
...when I say, "two single-layer DVDs, or dual-layer?"
Say what you will about Wikipedia's well know flaws (revert wars, submitters with thinly veiled political agendas and other various sub-vices) but part of the appeal of Wikipedia is simply the form of its current incarnation.
Right now, if I wanted background information on something, I just load up the (usually speedy) bookmark, enter the search and within seconds I have my answer. Why should it then become a more laborious to use resource? 2 DVDs? No only would constantly inserting and removing discs and launching extra applications provide more hassle (not too much more, but enough to make its value as an ultra speedy information center reduced) how would the information be sorted? Imagine half the links on any page requiring you to switch discs? For me, one of the great ways to use Wikipedia is to wander from article to article following the various text links. A multi-disc setup like this would only discourage that method.
Finally, I don't really think Wikipedia is ready to be put onto physical media for distribution. I certainly wouldn't trust it for more than satiating my curiosity, for instance I'd never cite it in an essay as a source. Articles with heavy disputes in their comments abound and many have no easy answers with how to solve an article's problems. So putting it on DVDs is a two fold problem, you don't get the advantage of having your data updated constantly by other users, but you'll also be working with a lot of flawed data that will be flawed forever on the disc. Perhaps I'm exagerating the problems a little, but really, who needs this? It's not like it's a cheapr printed encyclopedia alternativ to Brittanica, and since it'll be in electronic form anyway, why NOT use the web for it, you don't have to store any of it yourself.
The only audiance I can think of that would need this are Wikipedia addicts who spend a lot of time without internet access.
Sorry for being overly negative, I really love using Wikipedia, but I think this move kind of messes up the point...
Yup...
In order to publish and SELL this information on CD/DVD, does the Wikipedia Foundation have to get the permission of all the article writers, or is there, perhaps, a clause on the website that says something like 'we own all the stuff put on here'. What would happen if Slashdot sold versions of article comments on DVD?
Rhymes that keep their secrets will unfold behind the clouds.There upon the rainbow is the answer to a neverending story
A lot of vandals copy/paste text from copyrighted websites onto Wikipedia, usually they get found and deleted but some are missed. If they sell copies of Wikipedia then they are going to get tons of copyright infringement lawsuits.
On each Wikipedia article, there should be a button where users can vote an article as being "worthy" for academic research and the like. Articles that receive high votes would actually get published in a monthly (or even by-weekly) magazine... So, for example, each month, subscribers would receive the magazine in the mail, and it would contain, in addition to paid advertising like any other magazine, something like ten or fifteen articles randomly chosen from Wikipedia. These would cover a broad range of topics. One month, you might receive a magazine with articles about Argentina, transaxles, grep, electromagnetism, George Washington, the Berlin wall, Apollo 9, goldfish, ballpoint pens, and cow manure. Some subscribers will already be familiar with some of the topics; others might not be interested in some of the topics; but chances are that if you pick up this magazine and read it, even for a few minutes a month, you'll learn some interesting new facts here and there, usually about topics that you'd never consider reading about in any serious manner, but which you're reading because the Wikipedia Magazine happens to be there.
Links at the bottom of articles would direct the reader to the article online. This would serve an additional purpose: People who find something missing or something that could be improved in an article would perhaps be more likely to find out about it and then go online and fix it, thereby improving the quality of the entire Wikipedia.
Money from subscriptions; money from advertisers in all fields (not just technical, and perhaps based on the content of that month's magazine) would finance the magazine and help finance Wikipedia. I see this as an opportunity to make quite a profit on something that is free, while mainly benefiting the community by doing so.
Honestly, I'm wondering how the CD will be like. Will they include an option for you to have the ability to update the content at your setting? Or will it be so that you have to buy a new version everytime? I'm thinking that they will include an option to update but you can also buy newer versions so that you don't have to spend time updating (I hope)
- Teja
German people dont necessarily speak english and vice versa. And two DVD's is a lot of space. And 4 cd's isnt even a DVD. And just because other people use multiple disks doesnt mean its a good idea. I remember playing riven and having to switch disks way way to often. And for a Encyclopedia there would be a 50% chance that you would have to switch disks everytime you looked something up. That would be rather annoying when trying to do any research.
How much will it cost? If it is more then ten bucks then I'll probably just warez it.
You know, Wikipedia is ripe for a Hitchhiker's Guide to the Galaxy treatment.
Put it in a little handheld, stick an Ipod hard drive in it, give it a usb port so it can grab updates, and presto.
As for Wiki itself, "At least where it is inaccurate, it is definitively inaccurate." -Douglas Adams
Better idea: let the Wikipedia admins pick a version of the page they like, and don't mention the gold master date to anyone. If the deadline isn't known, the factions -- and let's be honest, this might be a problem on ten or fifteen major pages at most -- can't go edit-warring right up to the deadline.
p
In Korea, long hair is for old people!
It's hard to get a more friendly distribution method than that!
Physicist, consultant, science communicator
I would like to ammend: reset the counter (or lower it or something) for each rewrite. I'd expect the pages in "edit wars" would also get a lot of votes.
Can you be Even More Awesome?!
And instead of a "DONT PANIC" sticker, they'll put on a "DONT EDIT" one.
"It is the stillest words which bring the storm. Thoughts that come with doves' footsteps guide the world."
I've always considered Wikipedia the closest thing Earth has to a Hitchhiker's Guide. If I need to know about anything quickly, that's the first place I look. I'd be a bit interested in something like what the parent mentioned.
Maybe the whole encyclopedia can be installed onto the hard drive, so the user can just choose to update topics as they view them, and also updates when they search for the topic. And such a system can be used online and offline. Oh, and they can also throw in local editing and uploading your updates.
Anyone know if they have any way of stopping this?
Physicist, consultant, science communicator
I should hope that they would check each page for trolling before going gold. Even if this is an enormous number.
Simple. Don't publish the exact snapshot date.
... maybe the zealots who use Wikipedia as their ideological battleground (e.g. this, this, or this) can host their own wikipediae, with their own versions of The Truth, and thus the revision wars on the original Wikipedia will stop.
Or not.
If you mod me down, I shall become more powerful than you can possibly imagine.
The problem with voting for the best articles is the same problem that other elections can face--ballot stuffing. What is to stop people with some sort of agenda from voting many times to promote certain articles or coordinating groups of people to vote?
This would probably be a factor only with ideologically divisive topics. The problem is that the votes engendered by the controversial articles might end up dwarfing the numbers for interesting but obscure subjects. I for one wouldn't want a hypothetical Wikipedia magazine reduced to a compendium of hot-button articles.
I know that this is a pessimistic view of how article voting might work. Don't get me wrong, I like Wikipedia and I enjoy it a lot. I just don't quite trust it.
Anakin Simpson: If you're not with me, then you're my enemy--ooh, donuts!
Well you could have a seperate mp3 on an iPod for every entry, but try getting Peter Jones (or Stephen Fry) to read every page on wikipedia. I don't think they'd be up to it, nor would there be room on any current iPod, methinks (not sure).
"I would say that 99 per cent of what my father has written about his own life is false." - L. Ron Hubbard Jr.
- Dolby Digital Surround Sound
- 1337 5u8717135
- EBG13 Fhogvgyrf
- TourettesMOTHERFUCKINGBITCHCOCK subSHITMONGERtitles
- A short music video in which Jimbo breaks into his famous freestyle diss of Brittannica
- Re-enactments of some of the best NPOV disputes... with puppets!
- And BJAODN, of course
And expect a 25-disc Collector's Edition in about 6 months, with the addition of:They should publish this in the form of a mobile device with the cover saying "DONT PANIC". It would be as cool as hell, and quite useful on certain ocasions when a laptop is too large, or you dont want to risk it being stolen. Anyone thought of making their iPod (mabey its too small) into The Guide?
I'm *still* waiting for the encyclopedia guy to come around to my house.... he hasnt been around for a good 15 years!
Is German smaller in code than English, it not, then what is causing the extra space???
The English version is more comprehensive with more articles, largely because it has more people working on it.
I first heard about this back in July of 2004. The people at Mandrake had already approached some of our people, and told us they wanted to put Wikipedia on DVD. The stumbling block was, of course, copyright issues. We launched a copyright tagging project in August - basically, they did an sql dump of the list of all uploaded files that had no copyright tag and tagged them. In January, Angela sent them an email, telling them it was done, and that's when the DVD project actually started.
To make laws that man cannot, and will not obey, serves to bring all law into contempt.
--E.C. Stanton
Indeed; this is what I mentioned up above. It can be done, and it can be done reasonably well in a cross-platform app.
Mediawiki 1.5 (which should be out soon - a couple of months, probably) will include a versioning system. Someone can mark a version as 'good', which in theory means no vandalism and no POV (point of view) issues. (who this is has not yet been determined - I suspect it will probably an admin)
To make laws that man cannot, and will not obey, serves to bring all law into contempt.
--E.C. Stanton
What's the point? Wikipedia is an inherently online medium. The articles change daily, new ones are created, etc. This cannot be reasonably placed on a static medium.
Slashdot: Playing Favorites Since 1997
Now there's a new buzzword: war-editing
I've so many wikipedia word-wars, where different factions couldn't agree on the correct meaning.
Vintage computer adverts: http://www.vintageadbrowser.com/computers-and-software-ads
There are already mobile versions that you can get for Palms and Pocket PCs. The database files are over 500MB, so you need an expansion card with that much room. Visit http://en.wikipedia.org/wiki/Wikipedia:TomeRaider_ database
Personally, I'm hanging out for someone to find a way to only retrieve the top 50% of articles by relevance/importance/popularity and put that on a 256MB or even 128MB database, so that mere mortals can also have Wikipedia on our Palms.
Meanwhile, h2g2, the official Guide project, was contemplating a mobile version, but broadcasting regulations meant that the BBC couldn't follow through on it.
NO! How many people do you know with Blue Ray drives?
Dewey, what part of this looks like authorities should be involved?
self upgrading... and of course, based on GNU/Linux
13-4=54/6
Hey, you implicitly agreed to this when you decided to contribute to Wikipedia, fully agreeing that your work was licensed under the GNU FDL.
Le français vous intéresse?
Aw, man! You had me goin' there for a minute. I was really hoping to read an informative essay on the topic of cow manure, but you just pulled that out of ... your... ass... figuratively speaking.
I can think of a million and one reasons why having a fixed version that is instantly available would be a very handy thing indeed. I have all the Internet connectivity I could want (short of a neural interface), but I'd still probably shell out a few bucks for a copy.
Dewey, what part of this looks like authorities should be involved?
Really? WHY? It's not factual, often more farcicle. Wiki is based on consensus and not facts, and reality (nor the facts) change because a large number or people believe something is so.
Going to Wiki is a waste of time, I honestly can't understand why people do it.
The main problem facing Wikipedia at the moment is lack of development resource. At FOSDEM last month Wales urged the assembled audience of open source developers to get involved with the work of the foundation.
r _MediaWiki_hackers
The article doesn't mention how you can help out. Maybe might want to check this out:
http://meta.wikimedia.org/wiki/Help:Contents%23Fo
Althought I think actually USING the DVD set for normal use when you have broadband kinda defeats the purpose, I can think of a few reasons why it could be a good thing.
A) Archival. Average users will be able to get a working, usable snapshot of Wikipedia, with media.
B) Preservation. If Wikipedia were to shut down, you'd have a copy of it.
C) Faster access. If you have a slow connection, you can still access Wikipedia at fast speeds. This benefit dwindles over time as articles are updated.
D) Offline access. If you're on the road with no net connection, you can still access Wikipedia. This benefit also dwindles over time as articles are updated.
E) Although backed by Google now which helps with the financials, if it brings in some cash to help support itself it's likely to stay around for much longer.
- It's not the Macs I hate. It's Digg users. -
How are they going to get a Snapshot of Wikipedia in which there is no vandalism in any of the articles? It's ok to let people download Wikipedia when a few articles have been been vandalized, but to sell it in stores like that? They might have to work out a system where they copy the whole thing, put it on another page, and allow access only to admins, who will revert any vandalism they see, before having the whole thing ready for sale.
"When the atomic bomb goes off there's devastation...but when the atomic bong goes off there's celebraaaaation!"
As you can see, I've been busy ;)
A Treo 600 or 650 with iPedia works nicely for this and has a well-designed interface. It's shareware, but IMHO well worth it -- I find myself using it quite regularly.
You can see my work so far at the following links:
..and of course, my beautiful anti-alias fonts for Plucker, made with PalmFontConv by Alexander Pruss.
I've also converted the Creating XPCOM Components book by Doug Turner and Ian Oeschger to Plucker format as well as the FreeBSD Handbook.
I have literally hundreds of similar-quality works I'll be releasing over the next few months to the community on an ongoing basis.
If there's something you'd like to see, just let me know
I hope the take the history of Mathworld as a warning as what can happen in the publishing world.
Now you can deface Google as well!
Get your Unix fortune now!
I could see it beeing useful for schools just the way current encyclopedias are. I wouldn't cite the wikipedia in a paper, but I would use it as a starting point for research. It has been understood for papers that I have written since 8th grade, that you get more advanced sources than an encyclopedia.
"brxref
Recursive wget's are soooo much easier.
Systemd: the PulseAudio of init systems
I think wikipedia is a lot more thrustworthy then most other sources on the internet. At least you know that many people are looking at wikipedia and making corrections.
And as other people have noted. Wikipedia can at least be a good source to start further research from.
Greetings,
Project Manager of Crystal Space (http://www.crystalspace3d.org). Support CS at http://tinyurl.com/cb3x4
The only Japanese texts that are written all in phoenetics (hiragana or katakana) are children's books. Those will have spaces in between words. Text that is written for literate adult readers is written with kanji (alphabet from China). Most nouns, verbs, adjectives, and adverbs are written using one, two, or three kanji. Sometimes nouns are strung together to make longer nouns (sort of like compound nouns, e.g., "All-Japan Kendo Federation" is "zen nihon kendou renmei" and is written using 7 kanji strung together). Hiragana is used for particles (which mark parts of speech and join words together) and also for adjective and verb conjugations. There are also some words which are written in hiragana, however they are mostly common and easily understandable, so there is no problem. Foreign words are usually transliterated using katakana (another purely phoenetic script). Because they're written in katakana, they are automatically differentiated from the surrounding kanji and hiragana. Differentiating words isn't all that hard. The problem is really learning all the vocabulary and kanji.
My other first post is car post.
...I'm from Norway, where we typically also create long words. Railroad conductor's assistant should be translated as "jernbanekonduktørassistent" (jernbane-konduktør-assistent).
Because we're very English oriented (lots of US series on TV etc.) we've gotten this word splitting syndrome. "Lamb thighs" = "Lammelår" , not "Lamme lår" = "Lame thighs". "Hospital" = "Sykehus", mot "Syke hus" = "Sick houses".
Most of the time it can't be misunderstood though, but it is still plain wrong. But ah well, the biggest rampant destruction now is irc/im/sms language leaking into everything, even school work. I can speak TLAs as well as anyone, but I also know how to write proper Norwegian (and English, and German...).
Kjella
Kjella
Live today, because you never know what tomorrow brings
Actuallym 9,90 euro is around $12
1. The code is the same size, but it doesn't matter since the space the software would take up on a CD/DVD/... is dwarfed by the size of the data.
2. The English Wikipedia has (roughly) about 2,5 times as many articles as the German Wikipedia. Mean article size (in bytes) is roughly equivalent.
3. The English Wikipedia uses many more images etc. than the German Wikipedia.
quidquid latine dictum sit altum videtur.
If the content would only fit on 2 DVDs (mostly due to media other than text I suppose) I wonder how will they solve the problem of minimizing the number of times users have to change DVDs? Do algorithms exist that will solve this, in reasonable time on reasonable hardware, for something the size of en.wikipedia?
I know I for one would find it incredibly annoying if it turns out I would need to play DJ a lot to go from "Gautama" to "India" to "Bhopal", say, which is precisely the kind of loosely coupled chain I find myself browsing for the fun of it. The links go everywhere, and I would argue the topology is at least as valuable as the content itself..
I expect you could isolate some "clouds" of articles mostly referring to each other, but anyone want to guess as to the percentage of "cross-DVD" links?
Gosh, thanks. That must be why the other ships call me Meatfucker -- GCU Grey Area (Eccentric)
Make that 26269 MB or 26 Gigabytes, 585 mb is just the changes since the last dump.
If voting were effective, it would be illegal by now.
Even if he is not online 24/7
The real blasphemy is that he has any computers at all without ethernet cards.
I mean they want to be online even if he doesn't
A computer without the internet is a football bat.
Might as well hide with a pad of paper and pencil in the closet.
If voting were effective, it would be illegal by now.
I've setup a PC for a youth group I help out at and as no telephone lines are available and no wifi is around - It would be really nice to put wikipeida on it.
To Slashdot or not to Slashdot. That is the question (that will cause me to fail an interview)
trust me, it is very good, that there is no such entry. bavarian poetry is nearly always worse than vogon poetry and in the written form you can even understand it.
in the spoken form the things are different, though, because nobody understands bavarians.
Conservatism: The fear that somewhere, somehow, someone you think is your inferior is being treated as your equal.
Stuff that, I want the eBook of Oolon Colluphid's trio of philosophical blockbusters.
already done, check out tomeraider wikipedia editions on a gig sd card and pocketpc.
More and more I caught myself finding the answer I need in Wikipedia. I think that all human knowledge can be finally encoded in the medium called 'internet', in Wikipedia form...thus allowing unlimited access from schools and other institutions. It would really be a very helpful tool for teaching, along with the classic ways.
What the hell's wrong with you?
What the hell's wrong with your telecommunications provider. Prime offender: Telenor. In 2000 I was going to get ADSL. I got it in 2003. In 2004 I moved, again to an area where they claim you can get ADSL, but I'm still waiting. Oh yeah, the former place didn't have cable (too many people on the street with satellite), the latter hasn't upgraded the cable network to do Internet. WOHOO!!!!
Kjella
Live today, because you never know what tomorrow brings
The more devious (and therefore preferable) solution is to announce that the gold date has already passed and let the WikiPedia continue to ooze forward as it does. Then take the master version whenever they bloody well want to.
Well, Peter Jones certainly wouldn't be up to it, considering he died in 2000...
Never mind Spamassassin. When's Spammerassassin coming out?
This is about the Wikimedia Foundation , not Wikipedia Foundation which doesn't exist. /. post are wrong.
Both the article and the
I think a possible solution would be for Wikimedia to maintain secrecy regarding the "gold master" deadline. They would only be able to announce an actual release.
---