Yahoo! Orders Wikipedia Hardware
Edit This Page writes "Jimmy Wales announced today that Yahoo! has ordered 23 HP servers for the Wikimedia Foundation. The three database servers are model DL 385, and will come with dual Athlons, 8GB of RAM, and 6x 146GB 15K RPM drives each. They will also provide rackspace and bandwidth. The announcement comes four months after Google's announcement of support, and two months after Yahoo's own. Google has not yet made their intentions clear. You can read more about the specifications of what will soon be a 100+ server cluster at the Wikimedia Servers wiki article."
As I write this, our developers are switiching the entire site over to Mediawiki 1.5 (from 1.4), and most of the changes will make it run faster. So we're lowering the per-transaction cost of the software and increasing the server capacity -- this is a good thing.
To make laws that man cannot, and will not obey, serves to bring all law into contempt.
--E.C. Stanton
Wikipedia Hardware?! I didn't know they make hardware. Does anyone have the Wikipedia link for this? ;)
Does wikipedia seriously need all that? I thought the data they were serving up was mostly just text and wasn't really a huge problem. As in, weren't their current servers enough? Or am I missing something?
So it seems now that Wikipedia has more street cred than either Yahoo OR Google, since they're both clammering to be seen as being in support.
And with Google at aproximately 211 street cred units as of the last survey, Wikipedia is definitely doing well.
xkcd.com - a webcomic of mathematics, love, and language.
The trouble of course with wiki-hardware is that the system adminstration is left to the community.
All Google has done is hand-waving so far.
On the other hand, Yahoo has been one of the earliest Wikipedia supporters according to TFA.
(Please browse at -1 to read this comment.)
Not Athlon
Only on Slashdot would Yahoo's donation be compared unfavorably to Google, when Yahoo has actually provided something, and Google has merely mentioned it.
concrete5: a cms made for marketing, but strong enough for geeks.
Does anyone know why they are being set up in South Korea?
Theres nothing wrong with hand-waving.
Obi-wan did ok by it.
liqbase
The server hardware spec link said the "athlons" in fact are opterons. *sigh*
I have a really elegant proof for Fermat's last theorem. If this sig was only a bit longer...
This is sort of like those school yard spats over a girl.
Wiki is the girl. Google and Yahoo are the two guys.
My mother's advice surely applies to this situation(that I got many years back):
"Stay away from that little trollop! Anyone that causes a fight is not worth it."
Of course, I did hang round that girl. Pretty wee thing. It was all fruitles of course.
Bitch! You whore Wiki!
*begins to cry*
In post Patriot Act America, the library books scan you.
If Yahoo is a "me too" move, Google was a "look how good we are" move.
Regardless, it's good for not only the administrators, but obviously for their large user base too.
Beware: In C++, your friends can see your privates!
Seems to be a war to be the best "opensource" helper. See Google wants to help wikipedia, Yahoo helps wikipedia, Google makes Google summer code ...
;-) ?
What's next
Bonjour !
Up until recently when they moved to a new co-lo this data was out there, but it is unfortunately no longer available. I can say as a fact though that they are currently pushing out about 17 terabytes per month and growing strong. There's a bandwidth graph and instructions to read it on this page of my site.
python -c "x='python -c %sx=%s; print x%%(chr(34),repr(x),chr(34))%s'; print x%(chr(34),repr(x),chr(34))"
True, but think about it, what is the truth for non technical things?
Before wiki and the 'net in general made content become alive, and coming from whatever source, all such discussions were lost. The winner of the argument, or more likely, the one with the arguments that were more pleasing for the ones in charge, would win and get published and later become part of what is taught in schools.
With wikipedia the argument is part of the content and being critic of what you read is a good exercise for the mind.
---- MISSING MISCELLANEOUS DATA SEGMENT --- [sigdash] trolololol
RTFA - it is the Wikipedia guys who are holding up Google's donation, not Google:
"Wikimedia's planned facilities in Amsterdam (The Netherlands), Belgium, and Asia are not online yet, so it would be premature at this juncture to ask Google for something specific when we don't yet have good technical knowledge of what we will need in the coming months following the introduction of these new facilities. Google are eager to help us, and Wikimedia are eager to accept their help, but the Board want to be good stewards of donor money, and this requires them to move carefully"
I'm wondering about setting up a network of boxes running the Coral software. Those have built in fault tolerance so it wouldn't take lots of admin work and would allow accepting many small bandwidth offers, in countries with comparatively low traffic. Makes most content even closer to the end users and spreads the bandwidth load around. Nothing actually happening on this front yet, though.
A very large number of places witih full database servers and page builders, like this Yahoo announcement, would have too much admin overhead - 3-6 of those places is about right.
P2P is a security problem. People can always modify P2P programs to add nasty content and Wikipedia has already seen people trying to upload that and has filters in place to catch and block some things.
Both Yahoo and Google deserve approximately equal kudos for being helpful to the projects. Thanks!
This is a classic case of considering the hardware to be the problem rather than the software. The software has serious issues when it comes to performance and the developers are very slow to address it. Hell, Tim Starling, a lead developer, even stated that one of the design goals of the MediaWiki software was to spend as little time as possible developing it. I kid you not, that's paraphrasing something (with NO exaggeration) that was said in a presentation document which I can find if anyone doesn't believe me.
I've heard some whining from some of the developers because they didn't have a ready made solution for certain things, meaning they would have to put actual *effort* into making their own. The idea of writing glue code (to C code) to make up for a feature lacking in existing php libraries was considered an abhorrent thing.
Their best response to me pointing out flaws in their "development philosophy" was to them retort with the oh-so-clever "well why don't you write something better yourself?" Of course, that phrase is just a code word for "we know it sucks and we're just not willing to put all the extra effort into rewriting major portions of it." Really, it's sad when you have to define your software in terms of someone else (your opponent specifically) not writing something better.
This isn't just unfounded complaints either. The developers have often complained that the existing implementation (and especially the choice to write the original code in PHP) needs to be rid of. They've said it has "everything and the kitchen sink" and that it degrades performance, but aren't trying that hard to get rid of it. They know this as a matter of fact through testing--Mediawiki has a massive overhead in setup time compared to other wiki software.
Not just that, but the Wikipedia admins are all volunteers and aren't exactly the cream of the crop. They took them as volunteers since they were the best ones to devote that much time to it and unfortunately that means they're mediocre and they REALLY are not experienced for such a high traffic website.
If they actually had a paid full time admin who had considerable background in sites like this, you'd suddenly see a massive drop in down time and other problems.
Hello? Wikipedia keeps all versions of an article. Just refer to the specific version in that case.
No, you're thinking of the DL585, which is quad-Opteron (up to 8-cores). The DL385 is dual-processor (though you can install dual-core Opterons to get 4 total cores).
Here is their servers list.
Since I presumably have moderation to burn, I'll say frankly that I'm appalled. Wikipedia is enormously valuable as a resource in objective domains such as hard science and mathematics, but its articles in politically and culturally sensitive areas are abyssmal reflections of popular delusion and political correctness that do an enormous disservice to us all. The cockles of my heart not not warmed.
-I like my women like I like my tea: green-
That's a Wikipedia server admin that's speaking.
XML is like violence. If it doesn't solve the problem, use more.