Slashdot Mirror


Dvorak on Google and Wikipedia

cryptoluddite writes "PC Magazine has an article by John C. Dvorak expanding on the community discussion of Google's offer for free web hosting of Wikipedia. Those against the deal point out that Google may be planning to co-opt the encyclopedia as Googlepedia (by restricting access to the complete database). In a revealing speech given by the Google founders, Larry Page says he would 'like to see a model where you can buy into the world's content. Let's say you pay $20 per month.' Should public domain information be free?" It's a pretty scary scenario painted, but one can hardly take a speech from 2001 as serious evidence these days. Update: 02/16 20:16 GMT by T : This story links inadvertently to the second page of the column; here's a link to the first page.

92 of 449 comments (clear)

  1. Harsh on Google by Cracell · · Score: 3, Insightful

    Google wouldn't be like msn and only show certain articles, plus that wouldn't work with wikipeida since it's user made/edited

    --
    Signatures are so 90s
    1. Re:Harsh on Google by goldspider · · Score: 3, Insightful
      "Google wouldn't be like msn and only show certain articles"

      Oh really? And how do you know that? Just because you know that Google isn't an EVIL company like Microsoft?

      --
      "Ask not what your country can do for you." --John F. Kennedy
    2. Re:Harsh on Google by jarich · · Score: 3, Insightful
      plus that wouldn't work with wikipeida since it's user made/edited

      You mean like Google did with Usenet Newsgroups?

      Don't get me wrong, I like Google, but don't assume that they can't own the only database containing the 'free' information and provide access as they see fit. After all, they are paying to maintain it, right?

    3. Re:Harsh on Google by jarich · · Score: 2, Informative
      You could probably download the entirety of google's usenet archives and set up your own service.

      There's a leap of logic... well, I guess if it's on the web, I can just download it? E-Donkey doesn't host everything for free. ;)

      Here's a little more substantial info: http://www.google.com/terms_of_service.html

      The Google Services are made available for your personal, non-commercial use only. You may not use the Google Services to sell a product or service, or to increase traffic to your Web site for commercial reasons, such as advertising sales. You may not take the results from a Google search and reformat and display them, or mirror the Google home page or results pages on your Web site. You may not "meta-search" Google. If you want to make commercial use of the Google Services, you must enter into an agreement with Google to do so in advance.

  2. I take issue with the submitter by Biff98 · · Score: 3, Insightful

    Wow --

    "It's a pretty scary scenario painted, but one can hardly take a speech from 2001 as serious evidence these days."

    That's horrible.

    1. Re:I take issue with the submitter by andywebz · · Score: 2, Insightful

      However, when you are dealing with technology, old speeches hold less credence on the business practices of today. Time is much harsher to ideas on technology than it is to social issues.

      Would you assume that a company had no internet strategy, and that 640k Ram should always be enough, based on some 1984 speeches? No, because they are outdated, and many things have changed. Just as things have changed alot for google since 2001.

      Feel free to continue blowing comments out of proportion while i'm away.

      --
      Saying "I'll probably get modded down for this", is a magnet for my -1 mod token. I hate to disappoint.
    2. Re:I take issue with the submitter by KillerDeathRobot · · Score: 2, Interesting

      I agree. Just think - Martin Luther King's "I Have a Dream" speech, Pericles' funeral oration to fallen Greek warriors in the Peloponnesian War, and JFK's "We Choose To Go To The Moon" speech - all consigned to the dustbin of history by one arbitrary pronouncement on the part of one self-important pundit. Way to go, dude.

      Does that sentence mean that we should disregard all speeches spoken before, say, June 14 2004? What is PC Magazine's cutoff date here?


      Don't be ridiculous. He's talking about a speech given by a head honcho at a tech company several years ago. The tech industry, as we all know, changes very rapidly. Saying that such a speech might not be relevent any longer has absolutely nothing to do with whether actual important historical speeches are still relevant (which they may or may not be).

      And besides, it was CowboyNeal who said that, not Dvorak (your "self-important pundit").

      --
      Thinkin' Lincoln - a web comic of presidential proportions
  3. Contract? by Poromenos1 · · Score: 4, Insightful

    Wouldn't Wikipedia take measures to ensure nothing bad happened? I mean, that's what contracts are for...

    --
    Send email from the afterlife! Write your e-will at Dead Man's Switch.
  4. Google trying to strip away MS's dominance? by Anonymous Coward · · Score: 3, Interesting

    And this is another thing they can leverage in their war against MS... Next up, a total web-based OS (Firefox/Linux backend?)... Would be interesting to see where this is going; someone needs to stand up to the behemoth that is Microsoft, for the sake of all mankind!

  5. Hookay! There goes my good favour... by aendeuryu · · Score: 2, Insightful

    In a revealing speech given by the Google founders, Larry Page says he would 'like to see a model where you can buy into the world's content. Let's say you pay $20 per month.'

    For a company that claims they are endevouring to never be evil, this strikes me as a pretty evil bait-and-switch type scheme to me.

    I think I'm going to start checking out Yahoo's search engine. Not because I think I'll ever prefer it, but because I think I'd better start getting used to it, just in case.

    1. Re:Hookay! There goes my good favour... by Mr+Guy · · Score: 3, Interesting

      In 2001, that was still a cutting edge idea. People knew there was a way to make everything accessable, but weren't entirely sure what revenue model could support that.

      $20 a month was (and is) a small price to pay for everything, if "everything" is correct and up to date.

      I'd certianly pay a subscription for Google now, because their service is of value to me.

    2. Re:Hookay! There goes my good favour... by Mr+Guy · · Score: 4, Interesting

      In /. case, no, it's definitely not worth any money to me. I use /. to kill time while my project is building at work. Occasionally, there are articles that interest me. My contribution is not putting /. in adblock.

      Google is entirely different. It provides access to information in a format that is much more agreeable to me than other searches I've used. Unlike what others have claim, I regularly click on the ad links because they are often relevant to the information I'm looking for. I personally feel Google maps kicks the crap out of other tools. If they found a way to make their service significantly more usable, it would certainly be worth it to me.

      Hints (2 Things that'd move me closer to being willing to pay):

      Integrate Google maps with movie showtimes, as in IMDB's theater database. If possible, read my local paper and correlate showtimes from there, since not all my local theaters keep their times up to date online.

      Correlate restaraunt searches in google maps with reviews. I'd like a review aggregate for a total star rating of nearby messages when I get directions via SMS. I'd like to be able to filter places that google believes suck, based on their their review data.

    3. Re:Hookay! There goes my good favour... by nadadogg · · Score: 2, Insightful

      Just keep in mind that nothing in this world is free. Someone will have to pay for it in one way or another, be it in money, equipment, time, etc.

      --
      i use linux and windows oh god how can i have an opinion
    4. Re:Hookay! There goes my good favour... by Queer+Boy · · Score: 3, Insightful
      $20 a month was (and is) a small price to pay for everything, if "everything" is correct and up to date.

      Not much to pay at all, I call it my Internet bill and that's why I have the internet.

      The internet was not created to provide a revenue model. Countless companies learned this in the dot.bomb. It's not like cable or satellite where my choices are limited and if I don't pay I don't get content. Wikipedia came about for a reason. If it goes subscription it immediately loses value because now articles are only maintained/created by subscribers.

      If it goes subscription another free/open online encyclopaedia will take its place, the same way that FreeDB came about after CDDB required buying a license to use in applications.

      --
      Not since Marie-Antoinette played milkmaid has looking simple and honest been so fake and complicated.
  6. World Domination by ph34r_Hk · · Score: 2, Funny

    Google's one step closer to taking over the world now...

    1. Re:World Domination by bigtangringo · · Score: 2, Funny

      I, for one, welcome our new 99 zero overlords.

      --
      Yes, I am a smart ass; it's better than the alternative.
  7. Google alrealy has a working profit model. by dj_tsd · · Score: 5, Insightful

    Just hosting Wikipedia would work with google's already profitable model. Why would they bother creating a fee based model for a community product?

    1. Re:Google alrealy has a working profit model. by pilgrim23 · · Score: 2, Insightful

      Dvorak is correct about Usenet Deja-News and the terrible job Google has done to it. If the Wikipedia suffers a similar fate it would be just as useless. No matter how many ads are on a page no one visits, the hit counts will tell the tale..

      --
      - Minutus cantorum, minutus balorum, minutus carborata descendum pantorum.
    2. Re:Google alrealy has a working profit model. by pilkul · · Score: 2, Insightful

      Wikipedia is under the control of the Wikimedia foundation; Google's programmers wouldn't be making any changes to the way it works. This is just an offer to provide servers and bandwidth.

  8. Oh great. by bigtallmofo · · Score: 5, Funny

    What do we do if Google turns evil?

    --
    I'm a big tall mofo.
    1. Re:Oh great. by grazzy · · Score: 4, Funny

      When my friend. When if not already.

    2. Re:Oh great. by CdBee · · Score: 3, Funny

      Bomb them, of course!

      Sorry, I've been watching too much C-Span.

      --
      I have been a user for about 10 years. This ends Feb 2014. The site's been ruined. I'm off. Dice, FU
  9. There could be by Exter-C · · Score: 5, Insightful

    This is something that will be very interesting. The information in wikipedia should be available to everyone for free. There could be an interesting situation where people could subscribe to a service to have no advertising. That way it would pay for the wikipedia services to continue running, while still providing the benefit to the community. I know I use online services reguarly and its something that I would pay a nominal fee for without complaining to much.
    However it must have both free and subscription based services for it to be a viable system.

    1. Re:There could be by PornMaster · · Score: 5, Interesting

      Is IMDB the model for Wikipedia going forward?

      Free not-necessarily-accurate data for everyone, fact-checked and extended commercial data for some?

  10. Wikipedia needs hosting help, but... by guitaristx · · Score: 3, Insightful

    Lots of people know that Wikipedia hasn't had the server power to keep up, but a pay-for-service model isn't the answer. A free web-based encyclopedia is what makes wikipedia so great.

    --
    I pity the foo that isn't metasyntactic
  11. Licensing? by Omicron32 · · Score: 5, Insightful

    I thought the content on Wikipedia was licensed under a free, open license? How can Google "revoke" that to do this?

    1. Re:Licensing? by maztuhblastah · · Score: 5, Insightful

      They can't...but has that ever stopped Dvorak from one of his "predictions", i.e. Wild Ass Guesses (TM), before? Seriously, this guy is just a pundit. He makes his living by spouting off stupid, controversial crap...that's the only reason that he's published: controversy == readers/sales.

      Bottom line: again, Dvorak's talking out of his ass, just like when he claimed that there were almost no linux applications that could run on the PS2, he's making an uninformed guess based on something he heard somewhere.

    2. Re:Licensing? by pohl · · Score: 4, Informative

      Not only that, but the open content license also allows Google to profit from providing premium access (read: low-latency) to their own instance of the content. This sort of scenario was anticipated from the beginning when the content license was discussed, and it was considered to be an indicator of success.

      --

      The "cue the foo posts in 3, 2, 1..." posts will commence with no subsequent foo posts in 3, 2, 1...

    3. Re:Licensing? by Raul654 · · Score: 4, Informative

      They cannot. This article is nonsensical FUD from someone who doesn't know what he is talking about. (--A wikipedia admin)

      --


      To make laws that man cannot, and will not obey, serves to bring all law into contempt.
      --E.C. Stanton
    4. Re:Licensing? by Raul654 · · Score: 2, Informative

      It means that if you copy it and modify it, you are required to license the new version under the GFDL and acknowledge Wikipedia (and that a hyperlink satisfies our acknoledgement requirement). Is that supposed to be scandalous?

      --


      To make laws that man cannot, and will not obey, serves to bring all law into contempt.
      --E.C. Stanton
    5. Re:Licensing? by Quantum+Jim · · Score: 2, Insightful

      Not only that, but the open content license also allows Google to profit from providing premium access (read: low-latency) to their own instance of the content.

      Not only that, but from Google's point-of-view Wikipedia provides the benifits of Yahoo's original function and the Open Directory Project. That is, the community's openness actually seems to provide insentive to edit and add to the the content, while the collective wiki gardening also removes wrong, out-of-date, or very low quality articles and spam from the system. Thus Wikis - and Wikipedia in particular - generally provide high quality links that search engines can use to target and/or refine their search bots. This helps them with searching, targeting adwords, anti-spam filtering in Gmail, among others. That's why spammers try vandalizing them so much. Given Google's lead among their peers with document analysis algorithms, they have a high incentive to support wikis like Wikipedia.

      --
      It is impossible to enjoy idling thoroughly unless one has plenty of work to do.
      - Jerome Klapka Jerome
    6. Re:Licensing? by Jamie+Lokier · · Score: 2, Insightful

      They cannot restrict copying of the content, but they can limit access to it via Google's servers. The GFDL does not prevent this.

      That's why it's important that there are always a few people maintaining mirrors of the entire Wikipedia.

      It's also important that if Google ever stops the ability to make mirrors of the entire Wikipedia including updates and update history, that a big public fuss is made.

      If you think it can't happen due to Wikipedia's license, think again: Usenet is presumably public domain, but Google aren't exactly falling over themselves to let people mirror that archive.

      Some people will say that Google did real work to put together the Usenet archive, and it's within their rights as a business to limit access to it. Fair enough: just remember, that it's also within their rights as a business to limit access to their instance of Wikipedia in future, and if nobody has an up to date mirror, that will be a real limitation.

      I'm not saying worry about it. Those are only possible scenarios. I'm saying: be diligent in keeping it open and fully accessible; don't let it slip like CDDB or the Usenet archive.

      -- Jamie

    7. Re:Licensing? by Raul654 · · Score: 2, Interesting

      "They cannot restrict copying of the content, but they can limit access to it via Google's servers." - Wrong. The GFDL requires them to provide a transparent copy on a nondiscriminatory basis. Wikipedia does this via download.wikipedia.org, and google would be obligated, at the very least, to provide something similiar.

      --


      To make laws that man cannot, and will not obey, serves to bring all law into contempt.
      --E.C. Stanton
  12. even if it became a "premium" service,we'd survive by humuhumunukunukuapu' · · Score: 4, Interesting
    It's not like Wikipedia is the only place to get information on the internet [and don't forget that real world out there].

    if someone ruins it, sure it is a shame, but something else will pop up to replace it. The internet is just a big game of whack-a-mole, no matter if you are the RIAA, the Feds, a kiddie porn fiend, or a information seeker.

    It's kind of the whole point...

    --
    i saw the baby, and the baby looked at me
  13. Is this just alarmist talk from a doomsayer? by StateOfTheUnion · · Score: 5, Informative
    hose against the deal point out that Google may be planning to co-opt the encyclopedia as Googlepedia (by restricting access to the complete database).

    Can they do that? The wikipedia is governed by the GNU Free Documentation License . . .wikipedia details here.

  14. 'Twould be a pity by banana+fiend · · Score: 2, Interesting

    IF (and it IS an if), google do start restricting and charging - it would be a pity.

    This information was collected for free, and would be disseminated at a cost. While this has been done before (volunteer organisations are not new) - it would probably lead people away from making the effort in the next thing that comes along and is "by the people for the people"

    --
    Johns: Well, how does it look now? Riddick: Looks clear.
  15. This is why Jimbo didn't want the details to leak by Neophytus · · Score: 5, Informative

    Speculation runs rife. I guess security through well... not very obscurity's bound to get someone chatting in the end.

    The deal in the short to medium term with wikipedia is expected to be the provision of about a dozen caching servers. No actual database work would be done by google. There is already a small (3) squid cluster in Paris that does this for users in the UK and France saving on some transatlantic bandwidth.

  16. Re:First rule about public businesses by Anonymous Coward · · Score: 2, Interesting

    What exactly are you throwing a hissy fit about? By sold-out, do you refer to google becoming publically as opposed to privately traded?

  17. Page would be unlikely to charge by Morosoph · · Score: 3, Insightful
    It would smack of 'evil' in contradiction to his company's motto. More likely, he would use it, like Google News, as a draw to Google, gaining mind-share, and indirectly boosting revenues.

    Whether Wikipedia should accept is another matter. I don't think that they should. It's much easier to appear independent if you have to pay your way, and for an encyclopedia, appearing independent is really pretty important.

  18. An answer to his question by eseiat · · Score: 5, Insightful
    Should public domain information be free?

    Yes, yes it should indeed be free. Information is the essential ingredient to the advancement of society. This is why public libraries, schools, and lectures were created, so that information could be dissemenated to all individuals who actively sought it out for themselves and for their children. Charging $20 a month for access to information is an outrageous idea and is particularly frightening when uttered by an individual whose company holds the key to so much of the electronic information on the web. I think if they continue with his "vision" of the future, Google's usage will plummet quite rapidly.

    Hasn't the Open Source community taught anyone the value of free information exchange??

    1. Re:An answer to his question by Mr+Guy · · Score: 3, Insightful

      Look where the quotes end. HIS vision at that time was that Google would be able to answer any question, at any time, as fast as you asked it. Think of Google even more than Google is now (staggering). Google that can answer questions like Ask Jeeves tried to. Google that can, perhaps, anticipate your next question. Google that not only references what's available, but makes educated guesses at what isn't available (Your result turned up no matches, perhaps you meant... or Your result turned no matches, your local library has a book...) and is able to provide you with what you probably really meant in a nonobtrusive manner (You searched for Chinese restaraunts near you, look at the bottom of the page for reviews of these restaraunts).

      Google has already done amazing things with aggregating data that is useful to the searcher. If they could take it much farther, $20 a month would be a small price to pay.

    2. Re:An answer to his question by SpongeBobLinuxPants · · Score: 2

      First off, I am against charging $20 a month for info. But Google isn't the first to do this. There are websites that will charge you for your credit report, in my state you're allowed to see your credit report free every 6 months. They also charge an extra fee for seeing your credit score. Public court records are also sold on the internet. So, yes, it is bad for google to want to charge for something that is free, but they are definately not the first company to come up with this idea.

    3. Re:An answer to his question by radar2k2 · · Score: 2, Insightful
      Yes, yes it should indeed be free. Information is the essential ingredient to the advancement of society. This is why public libraries, schools, and lectures were created...
      Um.. Who is paying for all these "free" institutions? (Hint: They aren't "free") There has to be a revenue model somewhere. It could be use fees, it could be a progressive tax system, it could be a regressive tax system, but it is not free. To ignore that fact and claim that charging money to access information is evil is disingenuous. Is it good or evil when my town raises property taxes to pay for the new library that you want to be "free"? How about if my taxes go up to pay for the new metropolitan city-owned Wi-Fi network that some private corporation could probably build and operate at a lower cost per user?
    4. Re:An answer to his question by danbeck · · Score: 5, Insightful

      Sure, the information should be free, but who is going to pay for the webservers to host it? Or the bandwidth to deliver it?

      You may erroneously think that your local library is free, but in fact it's not. You pay taxes that fund the library. The government doesn't have some magic pot of gold that it pays for that stuff you know... it's most certainly *not* free.

  19. Re:Hmm by Leroy_Brown242 · · Score: 2, Insightful

    They are also not being uncompetitive. They are simply providing tons of services and spreading like mad.

  20. Re:First rule about public businesses by Peyna · · Score: 4, Insightful

    As opposed to private businesses which have no interest in maximizing profits?

    --
    What?
  21. Google Groups is still Usenet... by GillBates0 · · Score: 5, Informative
    As I understand it, Google Groups is just one more interface to Usenet, like zillion others offered by ISPs, schools, and other servers. The propogation mechanism of messages is still the same, and they just offered a way for people to access News using a web based interface (lots of other sites offer this) rather than through a regular News reader (rtin, etc).

    I'm fine with Google offering a faster mirror/interface to Wikipedia, because mirroring of information is always good. From the last /. article on the subject, I gathered that Google would offer their faster processing power and ub3r bandwidth to Wikipedia....but that doesn't necessarily mean they get to hijack the content....they'd just provide a faster way to get to information that's mirrored elsewhere.

    --
    An Indian-American Hindu committed to non-violent thought/speech/action alarmed by the global explosion of radical Islam
  22. How to Stop it . . . by Dausha · · Score: 5, Insightful

    First, if the relationship between the Wikipedia and Google can be properly maintained, and boundaries established, I think this is a good thing for the Wikipedia.

    People are fearful that Google will attempt to co-opt the Wikipedia. That's what is apparent in the Dvorak article. However, what Wikipedia needs is a slick lawyer to write a contract between Google and Wikipedia. (IANASL)

    1. Google will host the Wikipedia as a donation.
    2. Google will not restrict access to the Wikipedia except as mutually agreed upon by both parties, and a public page to explain what restrictions and why. At no time will restrictions be based upon subscriptions or charges.
    3. Wikipedia will put a slick Google icon somewhere on the page to say "thanks Google for hosting us."
    4. This agreement may be terminated with fair notice to the other party at any time.

    If Wikipedia is able to maintain its autonomy, and the relationship is clearly labelled a donation of server space, then I think the Wikipedia could be hosted on Google.

    --
    What those who want activist courts fear is rule by the people.
  23. DON'T PANIC by Quinn_Inuit · · Score: 4, Insightful
    First, Wikipedia is licensed in such a way that, if I want a copy of the whole thing to fork it, they have to give it to me. If someone doesn't like where it's going, they can start up their own. The GNU FDL isn't perfect, but it'll work as advertised.

    Second, Google may just want to be in on the ground floor if and when Wikipedia decides to allow Adsense-type ads.

    Third, companies do often do charitable things. It's a tax write-off.

    Given those three things, I recommend that some commenters pay attention to the big, friendly letters in the subject line.

    --

    Stop learning! Only you can prevent esoterrorism.
  24. Usenet... by MrBandersnatch · · Score: 2, Interesting

    Still IS on the front page of google...well 1 click away. And the search is still perfectly usable...what IS Dvorak on about?

    While I will agree with him that DejaNews should NEVER have ended up in the hands of a corporate entity when the oportunity came for it to enter public hands; google havnt done a bad job of maintaining it. Its just a pitty no-one has come up with a service to compete with Google on that level since it COULD be a lot better.

  25. Re:Hmm by bunratty · · Score: 4, Insightful

    What does Google have a monopoly on? There are other search engines I can easily use. Maybe you should look up the definition of monopoly?

    --
    What a fool believes, he sees, no wise man has the power to reason away.
  26. Re:First rule about public businesses by mOoZik · · Score: 3, Insightful

    You are a hippie idiot. Any company is a company to make MONEY, not to serve some general good. Guess what, even Kaiser and Cancer treatment places are there to MAKE MONEY, not for any other purpose. Maybe the people working there do so out of the "kindness of their heart," but that is not the intent of the organization as a whole. Same with Google. Same with Slashdot. And same with anything else.

  27. Dirty tricks 101: quotes out of context by saddino · · Score: 5, Informative

    In a revealing speech given by the Google founders, Larry Page says he would 'like to see a model where you can buy into the world's content. Let's say you pay $20 per month.

    The only thing "revealing" about that article is that Page continues "Somebody else needs to figure out how to reward all the people who create the things that you use. " In other words, what Page would like to see is a system where "users" pay for accessing content and "contributors" are paid for providing it.

    This /. story could have equally read "Does Google Want to Pay Wiki authors?" but of course, that would have derailed cryptoluddite's agenda to smear Google.

    To the editors: when you see the words may be planning, just ignore the submission in the future. TIA.

  28. "should public domain information be free?" by turnstyle · · Score: 5, Insightful
    Regarding: "should public domain information be free?"...

    Public domain information is already free (free as in speech), but that doesn't mean that somebody can't also charge for it.

    It's no different than the GPL -- also free as in speech, but not necessarily free as in beer.

    --
    Here's what I do: Bitty Browser & Andromeda
    1. Re:"should public domain information be free?" by MikTheUser · · Score: 2, Insightful

      Couldn't you then, just as the GPL allows, buyinto the service, copy everything out of it and make it available for free? Of course, it would be easier to have the project forked into the commercial (EVIL) and a free (GOOD) part.

    2. Re:"should public domain information be free?" by l3pYr · · Score: 3, Insightful
      the commercial (EVIL) and a free (GOOD) part

      Statements like this are hurtful to the FOSS movement. Assuming all commercial interests are inherently evil is ignorant, being able to create profitable, Free (as in FOSS) commercial projects is vital to the survival of the whole movement. The majority of skilled programmers will eventually go where the money is, especially once they have a family, simply not having time to freely (as in $$$) contribute to FOSS projects. In a capitalist world such as we live in, money is life blood. Good-will contributions and free press will only last so long. Once the bandwagon's run out of gas who will be left? People who can make a living in FOSS, that's who.

      --
      RTFA and cite your sources or prepare to get pwnd
    3. Re:"should public domain information be free?" by IBeatUpNerds · · Score: 2, Informative

      It's no different than the GPL

      It's quite different from the GPL. Public domain works are free for any use and not bound by restrictions (typically). You can do things with Public Domain software that you can't with GPL. It typically means there is no license attached other than a disclaimer saying "Public domain".

    4. Re:"should public domain information be free?" by Headcase88 · · Score: 2, Funny

      here is the google mirror...

      .siht fo em sdnimer taht

      --
      "When the atomic bomb goes off there's devastation...but when the atomic bong goes off there's celebraaaaation!"
  29. Where to start? by Otter · · Score: 2, Insightful
    I'm getting ADD trying to figure out where to begin to start responding to this -- Dvorak's claim that Google is somehow responsible for the demise of Usenet as a result of their ownership of the DejaNews archive is so moronic that I can't bring myself to move on to the Wikipedia issue.

    It seems obvious enough to me that DejaNews/Google Groups has kept Usenet far more prominent than it would have been otherwise (Dvorak doesn't seem to get that the archive isn't ownership of Usenet itself), but given that he's claiming that Groups isn't linked off the Google front page at all, why bother arguing details.

    Whatever. If dumbasses who have seen Star Wars too many times enjoy droning on about how Google used to be Good and Not Evil, but is now Evil, who am I to argue? At any rate, Wikipedia isn't going anywhere.

  30. Dvorak is stale by BrK · · Score: 5, Insightful

    15 or so years ago Dvorak had some insightful articles, even if they didn't always come 100% true. Nowadays he's another has-been from a past era trying to pimp his FUD and general tech conspiracy theories. IMO, if you steadily bet AGAINST Dvorak you'll come out ahead over the long run.

    In the days of 10Mhz 286's I used to really enjoy John's columns. Now, I don't know if I've just gotten smarter, or he's gotten dumber (heh), but I can't remember the last time he didn't seem like a technology lunatic to me.

    --
    -This sig intentionally left blank
    1. Re:Dvorak is stale by CAIMLAS · · Score: 2, Insightful

      I remember reading PC Magazine when I was 15, and I just gobbled up his blurbs about the new, shiny things that were coming out on the market. Then I stopped reading PC Mag for a couple years.

      At 18 or so, I picked it up again and took a look. The Internet was becoming prevailant, and his stuff was swill. Pure nonsense, really. He was at least 6 months behind what the Internet (largely via slashdot) had already alerted me to what was going on, and going to happen. This was in 1998 or 1999.

      I recognized him for what he was, then: a mis-placed journalist with an interest in technology who managed to catch the coattails of the IT explosion. He was marginally tech savvy in 1985, I imagine, but now it's rediculous how ill-informed he is. He was a mouthpiece for tech companies to large corporations to begin with, but now a person has to wonder where he fits in, what with the "new" corporate tech culture.

      He's like the senior tech guy at the office that's been there for 30 years and knows nothing (maybe he still says things like "AMD processors aren't compatible with Pentiums!" or something similarly circa 1994) - but they won't fire him or get rid of him. They keep him around for laughs and because he's got a name for himself (whether the name is good or bad).

      You're probably correct in both senses: your knowledge and discernment has increased, while he never had discernment and hasn't really increased in knowledge.

      --
      ~/ssh slashdot.org ssh: connect to host slashdot.org port 22: too many beers
    2. Re:Dvorak is stale by llywrch · · Score: 2, Informative

      > 15 or so years ago Dvorak had some insightful articles, even if they didn't always come 100% true. Nowadays
      > he's another has-been from a past era trying to pimp his FUD and general tech conspiracy theories. IMO, if you
      > steadily bet AGAINST Dvorak you'll come out ahead over the long run.

      You got it in one. Dvorak must have remembered that he had a column due, indulged in his intoxicant of choice, picked some random news items & used them as an excuse to indulge in some superifical reflections.

      Speaking as someone who has contributed for a long time to Wikipedia, there is no simple way that Google could take control of Wikipedia. (There are days when I wonder if *anyone* could control Wikipedia.) Because its content is licensed under a form of the GPL (as well as many parts under the Creative Commons license), if you can read it, you can copy it & fork it. Making a mirror of Wikipedia is not only possible, it has been done: there are countless websites that mirror Wikipedia's content, some more up to date than others, some using the material as a starting place for their own encyclopedias. Further, many of the non-English Wikipedias have their own communities & are establishing their own servers; even if Google somehow got control of the main servers in Florida, it would be trivial for the groups in Europe to immediately fork.

      And community is an important part of this. Were Google to start limiting access to Wikipedia, *many* volunteers would leave -- either to a fork, or stop contributing entirely. In a very short time, what was left of the original Wikipedia site would have minimal value, ravaged by bitrot, out-of-date information, & unchecked vandalism.

      IMHO, the best Google could do here is offer a better interface to Wikipedia than Wikipedia has. The ability to edit Wikipedia will undoubtedly remain on their servers; to attempt to share this ability would result either in a technological mess or a fork.

      Lastly, having exchanged emails with Jimbo Wales, the de facto leader of this project & having read much of what he has written about Wikipedia, I sincerely doubt he has any interest in converting it into a for-profit Internet venture. For one thing, he has been working to push the legal responsibility off of his shoulders & onto an international board of directors. And for another, he seems to be having too much fun travelling around the world on behalf of Wikipedia & its related projects: he clearly gets far more satisfaction from this being open & free to everyone, than he would if he converted this project into a big pile of cash. At present, more people listen & value what Jimbo has to say than what Dvorak writes; that fact alone must stick in this has-been Ziff-Davis' columnist's craw & color anything Dvorak has to say about Wikipedia.

      Geoff

      --
      I think I see a trend here. Maybe for them it really would be easier to muzzle the entire internet than to produce p
  31. Land Grab by Doc+Ruby · · Score: 2, Interesting

    This is just like what happened to the CDDB (Compact Disc DataBase). It was open source, public server, free client. Millions of us entered our CD data, in exchange for access to everyone else's data, for free. Then the founders sold the operation to GraceNote corporation, which took it proprietary, and slapped licensing restrictions on access, protected by secure login - locking out all the "owners" of the shared data we'd entered.

    Some other people cloned the DB server into FreeDB, and jumpstarted it by datamining the CDDB server while it was still publicly accessible. We'll probably need to do that with Wikipedia. How big is it? Since "Content is available under GNU Free Documentation License", we should take a page from the FreeDB folks who saved our data from privateering clutches. How big is Wikipedia, in GB? Sounds like a job for BitTorrent, or perhaps Archive.org, or maybe a more passive archive, which would redistribute it only if access is restricted. Just distributing copies of the valuable data we've all produced would probably preempt Google, or any other "benefactor" from taking Wikipedia private. Let's not repeat the history that stole from us.

    --

    --
    make install -not war

    1. Re:Land Grab by FroBugg · · Score: 2, Insightful

      The CDDB thing comes to mind, obviously, but it's a very different situation.

      All the contributions to CDDB were merely info copied from liner notes and CD cases by fans with some free time. GraceNote was a bastard when they went closed, but it's hard to argue that the information was owned by anyone (except perhaps the original artists).

      With Wikipedia, you've got original works. These are things that are copyrightable, and as far as I know, the original authors of all the articles still retain their copyrights. The Wikipedia license doesn't seem to say anything about surrendering that. So if Google were to try and close this and charge for redistribution, they'd be violating the license under which these thousands of original authors released their work and opening themselves up to a very valid lawsuit.

  32. "serious evidence" by CAIMLAS · · Score: 4, Interesting

    As serious evidence? No. But thanks for the commentary.

    No, what it is telling of though, is the mindset at Google at the time of writing. This little insight is important now because it's quite possible that their end goal is to monopolize information in such a way as to extract their income from it.

    As they've recently made copious amounts of money and gained incredible power, it's quite possible its gone to their heads. Let's not paint them as a humanitarian group just because we like them: they are a company, after all, and have the same potential for evil that Microsoft (or any large company or government) has and does demonstrate.

    --
    ~/ssh slashdot.org ssh: connect to host slashdot.org port 22: too many beers
  33. Re:Is this just alarmist talk from a doomsayer? by tdvaughan · · Score: 3, Informative

    Since the copyrights are owned by the people who contribute to the articles, Google would have to contact each of them and ask them to relicense their contributions under a less permissive one. It's a bit like when that dude asked if a Linux kernel snapshot could be released under a BSD license for $50,000. Not going to happen.

  34. Dvorak? by Cytlid · · Score: 2, Interesting

    That the same guy in PC magazine? I used to read that eons ago... I found myself in the grocery store and read his latest column (in PC Mag). It was kinda interesting, he was mentioning the Cell processor.

    But he incorrectly stated, not once -- but three times, that the Cell was going to be a 250 "Teraflop" processor.

    Dunno about him, but everywhere I've seen info on this chip, it was a gigaflop processor, not teraflop. Don't believe me? Go pick up the most recent PC Magazine, see for yourself.

    --
    FLR
  35. Re:Hmm by Anonymous Coward · · Score: 2, Informative

    Google seem to be attempting to become the Microsoft of the internet.

    Except Google is providing useful services that people want to use.

    For free.

    We get value out of using their services.
    The advertisers get value out of the exposure they get, which is great, because the advertising still isn't annoying.

    Google isn't squashing competitors with shady business practices, they are simply providing the best, most innovative services for the time being.

  36. Re:Is this just alarmist talk from a doomsayer? by BrettJB · · Score: 2, Informative

    "You may not use technical measures to obstruct or control the reading or further copying of the copies you make or distribute."

    --From the GNU FDL

    So, Google could not legally prevent access, no could they prevent content from being mirrored. Don't like Google? Then help maintain a wiki mirror elsewhere.

    Seems to me Google just wants to co-opt an information resource that has become extremely popular of late. They'd like to be to serve it up as they do now, but with Google ads sprinkeled in the sidebar. Since I currently ignore the Google ads now in their searches and in my Gmail account, this wouldn't be an issue for me personally.

    --
    Smell that? You smell that? Burning karma, son. Nothing in the world smells like that...
  37. Re:Is this just alarmist talk from a doomsayer? by dj_tsd · · Score: 2

    Perhaps the whole article was just flamebait.

  38. Public domain in print publishing by Junior+J.+Junior+III · · Score: 4, Insightful

    Books that are in the public domain still cost money. Anyone has rights to publish them, but publishing them is still a business enterprise and still costs money. If google hosts Wikipedia, they ought to be able to attempt to make money off of it, but NOT by leveraging IP ownership or DRM. As long as the information can still be freely distributed as a public domain resource, mirrored by other interested parties, etc., then I don't see a problem with google hosting and charging for access.

    --
    You see? You see? Your stupid minds! Stupid! Stupid!
  39. Dvorak is a columnist, he's out for a reaction by ianscot · · Score: 4, Insightful
    Anything this particular source suggests comes with a salt shaker full of salt for me.

    Dvorak's doing much the same thing for the tech industry that your paper's sports columnists do for the local teams. His role isn't "provide a balanced picture of such-and-so," it's more like "provoke a reaction by pushing every subject to distorted extremes."

    Every sports section has at least one writer like that. Their job is to generate traffic, or responses, by staking out polemical opinions. Usually the one writer who pulls this duty paints a bleak picture of the local teams' moves, so as to get the loyalists to write in. It helps circulation. The same people work extra shifts on call-in shows, pretty often.

    In this case, our sage has consistently been on the wrong side of basically every technology he's commented on in my book. He's a sort of gadfly to all things Apple, for example. (His reaction to the idea of the mouse was as spectacularly wrong as anything ever written on computers.)

    --
    "Fundamentalism" isn't about divine morality. It's about human authority.
  40. From the speech in question by Infonaut · · Score: 3, Informative
    "One risk of that is that people don't get paid for their content, which is clearly a problem. I'd personally like to see a model where you can buy into the world's content. Let's say you pay $20 per month and get access to the world. Somebody else needs to figure out how to reward all the people who create the things that you use."

    It seems to me that they're talking about copyrighted content here. Rather than concocting a plan to bundle up free content and make people pay Google for access, it looks to me like Page was actually talking about reasonable means of access to copyrighted information.

    --
    Read the EFF's Fair Use FAQ
  41. I'm sorry, by The+Cydonian · · Score: 2, Interesting
    but I think no one here has been paying enough attention to Microsoft lately.

    In particular, ladies and gentlemen, consider MSN Search's fantastic Encarta Integration feature; you ask, for instance, "Who killed Abraham Lincoln? (to take MSN's own sample search string), and it gives you the answer. As much as I hate MSN, I think this makes it MSN 1 - Google 0; nope, Google's answer.com integration just doesn't match.

    I'd consider Google's offer of hosting Wikipedia sites in the light of this feature offering by Microsoft.

  42. They do not own any information in wiki by iamacat · · Score: 2, Interesting

    Just offering a free service, in exchange perhaps to displaying some text ads or offering more relevant results to their searches. If someone is not satisfied, he can always host a copy himself.

    As for subscription or pay-per-view information services, I am all for it, even for $100 per month, if the knowledge/art I get is not further restricted - I can burn a CD and give it to someone who can not afford access.

  43. Noted Windbag Trolls for Page Views. /. Suckered by maggard · · Score: 4, Insightful
    Seriously, who gives a flip what John Dvorak burped up this week? He's a whored out hack whose career has declined to regularly posting outrageous things in a bid to get attention and pretend some degree of relevancy.

    PC Magazine is zombie, it's empire crumbled, aside from it's regular product comparison charts (which are widely blamed for much of today's feature-bloat) nobody would still be aware of it's continued existence. From that sad little bailiwick Dvorak bleats for attention and worse yet the gullible wanna-be defenders rush to dispute him.

    This week he's on a smear against Google & Wikipedia. It could as well been another (willfully) know-nothing Linux FUD article, or another Mac-troll, or whatever. They're all trash and only PHB's struck in the 80's still pay the slightest attention to his "opinions" (quotes because I don't think be means a bit of what he says himself.)

    The folks who run Wikipedia are notably honest. To date the folks at Google have done pretty well by their "No Evil" credo. Everything on Wikipedia is open so if need be it could be quickly reconstituted elsewhere. Thus, whatever the negotiations between Wikipedia & Google there's nothing to fear.

    If the current Wikipedia administration does something heinously stupid the project will route around them. Besides which the best guesses are Google is talking bandwidth & caching, perhaps prioritized ranking, not ownership.

    Dvorak, he's taking an old quote out of context and trying to create a scare. That's not reporting, or even editorializing, that's just baiting, pure & simple. Don't play into his game, he's the SCO of journalism.

    --
    I don't read ACs: If a post isn't worth so much as a nom de plume to its author then I wont bother either.
  44. This is all fud by Raul654 · · Score: 4, Informative

    First, full discloser - I'm a long time wikipedia user and I probably accidentally played a peripheral role in breaking this story. I first heard about the google deal back in July. Google is not the first company to offer to host wikipedia. The typical offer comes from "Mom and Pop ISPs" (Jimbo's words) that really don't have any idea what they're getting themselves into (1,400 hits/sec is a helleva lot to do for free). What I have to say in reply to this story is - it is, IMHO, totally FUD. It's completely hypothetical, and it's unrealistic. You have to remember - all the text on Wikipedia is licensed under the GNU Free Documentation License or in the public domain; all the images and audio are licensed under the GNU Free Documetnation license, or CC-by-SA, or something liberal equivalent. So even if, on the off chance, Google succumbs to the Corporate pressure to be evil, anyone can take the text and reuse it in less evil ways. Furthmore, I trust Jimbo, Angela, and Anthere (the visible members of the board) in dealing with google to make sure the deal is done right by the rest of us contributors. There's a long history on Wikipedia of being against ads of any form - the spanish wikipedia forked several years ago over hypothetical discussion of it.

    --


    To make laws that man cannot, and will not obey, serves to bring all law into contempt.
    --E.C. Stanton
  45. Wrong. by Raul654 · · Score: 5, Informative

    You're wrong. The problem wasn't that we didn't have enough servers, but that the servers we had were misconfigured. The slowness experienced in January was resolved when the configuration bugs were ironed out. The problem is a lack of skilled sysadmins and developers. (And for the record, we just put in an order for 10 more servers)

    --


    To make laws that man cannot, and will not obey, serves to bring all law into contempt.
    --E.C. Stanton
  46. Why volunteer to help a for-profit company by G4from128k · · Score: 3, Interesting

    I fear that authors/editors would withdraw from Wikipedia if it were under the arm (or in the iron-fist) of a for-profit company. If these people felt like Google was profiting on the backs of their freely-contributed content, these content creators would leave and the Wiki would whither for lack of fresh/updated content. Donating time so that other may profit does not seem likely.

    What is interesting is that Amazon makes this work. The company is clearly a for-profit entity. Yet its crown jewels are the volunteer-created book reviews. I'm not sure what makes this work. It might be that friends-of-authors are motivated to post glowing reviews, it might be that people who disliked the book are motivated to post scathing reviews, it might be that some reviewers simply like to publish, or all of the above. Perhaps Wiki/Google-pedia could borrow this model to mix free-labor with for-profit.

    Looking further into the future on an alternate path, I wonder if Googlepedia could become a fully for-profit (or at least self-sufficient) professionally run and staffed encyclopedia. With micro-royalties to authors/editors (and moderation-based revocation of payments for "bad" content), the organization would attract content creators on a for-pay basis. This aligns the motivational underpinnings of the organization with those of the content creators. The current Wikipedia is for-free people creating for-free content. A future Googlepedia could by for-pay people creating for-pay content.

    One overriding lesson from Wikipedia (and Slashdot for that matter) is the ultimate necessity of sources of hard currency for online sites. As long as something is small (and below a certain scale of popularity) it can survive on donated hardware, bandwidth, or the benevolence of a monied patron (someone who pays the hosting bills out-of-pocket). But once it reaches a certain scale, the cost of serious server power, bandwidth, and professional administrators pushes the budget far beyond the hobby scale. Although pleas for donations can help, I suspect large-scale sites must, ultimately, turn to ads, tie-in product sales, and subscriptions.

    What is fascinating, in a long-term trend sense, is that the cost of scale are steadily declining. Cheaper hardware, declining bandwidth costs, and improvements in systems management tools mean that sites can reach ever-larger scales before generating prohibitive burn rates on costs. The number of visitors that a hobbyist/free-site can support continues to rise. Perhaps Wike need only wait for the singularity point when the cost to reach (and serve packets to) the entire world is within the reach of a home-grown, volunteer-run organization.

    --
    Two wrongs don't make a right, but three lefts do.
  47. Information wants to be valuable by Have+Blue · · Score: 2, Interesting

    Information can be free, sure. The process of obtaining that free information is not necessarily free. The free information is hosted on a server that must be bought, and transmitted over a connection that must be bought. If it's a large-scale hosting project, support staff must be present to keep it working, and these staff must be paid. The person accessing the information is paying their ISP for their own connection, and had to buy their own computer. How much would you pay for a wikipedia that responded to requests as fast as Google does to searches?

  48. Let's talk about the "revealing speech" by drinkypoo · · Score: 3, Interesting

    The speech certainly is revealing - it reveals cryptoluddite's agenda, which is anti-google.

    One risk of that is that people don't get paid for their content, which is clearly a problem. I'd personally like to see a model where you can buy into the world's content. Let's say you pay $20 per month and get access to the world. Somebody else needs to figure out how to reward all the people who create the things that you use. This is basically what happens with a lot of systems today. Radio stations pay into a big fund, and then the organization decides which labels and which artists to reimburse, based on what got played on the radio. It's a nice model because it allows access to everyone for everything that exists, but you don't have to think about, "Oh, I'm going to spend five cents to look at this web page" or things like that. That will allow content producers to still get rewarded for what they do.

    If you look at the quote in context, I think it's pretty clear that google is not talking about doing the selling, unless they are the gateway to ALL the content. They will never be that gateway. I do think that there is a market for commercial versions of some of this media, but I think the future is that you will pay only for directed media, and for convenient access to media. For instance a newspaper will have several classes of information, based on what they think they can sell to who; There will be information that is free on the web and also in print, information that is included in the cost of the paper but for which you must pay extra on the web, and so on.

    In the meantime sites like E2 and Wikipedia will probably be freely available for the forseeable future, but I would like to see them have commercial or "pro" versions of the site. For example, the pay site would have full-text searching, and the free side might not (and in both cases, currently does not.) You would also be able to enter RFBs for research papers, and you could accept them based on price and posting history. This model would work better for E2 than for Wikipedia due to Wikipedia's collaborative nature, but it is not inapplicable to Wikipedia.

    Anyway, any comissioned research would become a part of the database at an appropriate time (possibly part of the license agreement) and thus everyone would benefit. At the minimum, the site would make a commission, which would definitely benefit all of the service's users.

    This is precisely the way software is going, and I don't see any reason that all kinds of media won't see the same development. In fact, I see no way that any kind of media can survive without making this transition.

    --
    "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
  49. Storage allocation by lynx_user_abroad · · Score: 3, Insightful
    There are two ways to destroy a file; you can overwrite the blocks with zeros, or you can remove the inode.

    Similarly, there are two ways to stop people from reading a library book: you can remove the book from the shelf, or you can just remove it's entry in the card catalog.

    We should all keep in mind that Google is becomming the "card catalog" for much of the on-line world. Many would argue that if it doesn't exist on the from page of a Google search, then for most of the world, it just doesn't exist.

    --

    The thing about things we don't know is we often don't know we don't know them.

  50. I think what Page meant... by CashCarSTAR · · Score: 2, Interesting

    Would be more like 20 bucks for access to ALL the world's content. Lexis-Nexis, academic journals, and other subscription only services. Access to all of 'em.

    Would be nice. I'd pay for it in a heartbeat.

  51. Re:Hmm by DrEasy · · Score: 2, Funny

    What does Google have a monopoly on? There are other search engines I can easily use. Maybe you should look up the definition of monopoly?

    But what if Googlepedia has already changed the definition of monopoly? brrr...

    --
    "In our tactical decisions, we are operating contrary to our strategic interest."
  52. Avoiding timeouts by Sindri · · Score: 2, Funny

    I seriously think Google is offering to host Wikipedia because they are tired of seeing their bot timeout when indexing pages from the site.

  53. paranoid and poorly researched by maveric149 · · Score: 3, Informative

    First any offer of hosting by Google or anybody else for that matter will not make the 40 or so servers that the Wikimedia Founation already owns go away or stop the foundation from paying its own hosting costs for those servers. Nor will it stop donations from coming in so the foundation can buy more hardware and bandwidth. And the foundation is *not* going to just rely on any one hosting partner but will instead seek out and act upon multiple offers (this is in fact necessary due to the exponential growth of traffic to the sites it operates; such as Wikipedia.org).

    The most glaring omission Dvorak makes is the simple fact that due to the license Wikipedia uses, that it would be impossible for any one company to control it. If the 'end' were really near, somebody with better intentions could just download the *whole* Wikipedia and host it. But it would never come to that because the foundation would not allow it ; its very mission is to ensure free access to the projects it runs.

    I'm very disappointed in Dvorak.

  54. We need a Wikipedia category by Englabenny · · Score: 2, Interesting

    Yes, the that's right. We have lots of wikipedia stories and more and more coming with this googlepedia story. It's time to create a Wikipedia category so we can use their logo in the stories!

  55. Google = Microsoft-in-Training by Anonymous Coward · · Score: 2, Interesting

    Lest we forget, Google is not in this game as a benevelent dispenser of information...in whatever form. They are a publicly-traded, profit-driven corporation who is on-track to becoming the next 800 lbs. gorilla that everyone regrets dancing with.

    In the end, Google, like Microsoft, will only get in the way of progress and innovation.

  56. Re:Hmm by bunratty · · Score: 2, Insightful
    So then Microsoft doesn't have a monopoly on desktop OSes, as I can easily use others... I'm confused :-P
    Ok, I'll explain. In order to run my son's educational software, I need to use Windows. In order to see my digital pictures and videos from my camera and camcorder, I need to use Windows. In order to take a practice computerized GRE, I need to use Windows. For my wife to do her job, she needs to use Windows. There is no real choice on the desktop -- in many situations, you are forced to use Windows. Microsoft has a 95% or greater share of the OS market on desktop computers.

    Now explain how anyone is forced to use Google instead of Yahoo! or MSN search. Or point me to stats that say even 90% of searches use the Google engine.

    Too often these days people use the term monopoly when they really mean big company I don't like.

    --
    What a fool believes, he sees, no wise man has the power to reason away.
  57. I totally agree (was Re:Total FUD) by Teancum · · Score: 2, Informative

    While I don't speak for the Wikimedia Foundation Board of Trustees, I am a regular follower and poster of the events on the Wikimedia Foundation mailing list where this proposal has taken on a bit of urgancy.

    The main point that needs to be looked at is the fact that Wikipedia has been experiencing some absolutely explosive growth in demand from people both trying to add articles, as well as people simply accessing it, like numerous cross-links to Wikipedia mentioned in various /. articles as well as references in news media. All of this crushing demand to view content (where Wikipedia could produce a slashdot effect on /. itself) is taking up bandwidth that simply requires money just to be able to serve up the content.

    The current proposed budget for maintaining the servers is on the order of $130,000 and all of that comes from voluntary donations of the community. (BTW, please give some $$$ if you are a regular user of Wikipedia).

    Google has quietly given an offer to not only co-locate some Wikimedia servers at their facilities, but also to pay for the servers themselves as part of the general Google server farm.

    From what I've seen, nothing in the proposal is to have Google "take over" the Google content. Just like Google uses data in the Open Directory Project for their google website directory, they are free to use the content of Wikipedia as long as they comply with the terms of the Gnu Free Documentation License.

    This is not a way to "lock up" the content, but rather a way to browse Wikipedia in a way where you can be assured that the bandwidth is available to view the content. Basically, a mirror of the Wikipedia project. This is not even a new idea.

    I would imagine that the fine points of negotiation right now are that links to add content would be folded back into the main-line Wikipedia database. This is just like the Open Directory Project has been doing for a number of years, so the preceedence is definitely there, even for Google. I don't deny that there is a valid business rationale for Google to host Wikipedia, but don't read more into it than is there: Google offering to host Wikipedia content.

    John Dvorak absolutely does not speak for the Wikimedia Foundation, or even as a member of the community in general, and his comments are just to inflame issues from an otherwise uninterested technology journalist just trying to improve the sales of the publications he works for. Having been through similar publicity flare-ups in the past with other "open source" groups, Mr. Dvorak is not showing behavior consistant with even mediocre journalists that would at least contact members of the community he is reporting about. He is just doing raw speculation and that is it.

    This article is disingenuous and I hope that Dvorak gets taken to task for the comments that he has made. I also hope that people like him don't kill the good-faith proposal that frankly the Wikipedia could really use, nor "poison" the water of other potential offers to help out in relieving the crushing bandwidth needs of the Wikipedia and other related projects. It is articles like this that give journalists an awful name and destroy what is left of credibility to their profession.

  58. What really happened that week by Jamesday · · Score: 3, Informative
    You're both a bit right. Here's a highlight view of some of the things happening during that week:
    • New squid cache servers in Paris. After network bandwidth issues there were resolved they speeded up access in bits of Europe. But they also slowed down all page saves because saves also tell the Squids to remove (flush) related pages from their cache and those more distant servers took longer to flush. Can be tens of thousands of flushes to do. Squid flushing/purging is now much faster and longer term it's being modified to be taken completely out of the save loop. This one really hurt save speed for a while, as the developers sorted out what was happening and improved the way purging was done. Net result: all purging of squids is much more efficient and saves remain faster than they used to be. The way pages are delivered to those who aren't logged in was also improved, so style sheets are served from the Squids for them now - that makes page views for those not logged in less sensitive to Apache web server load.
    • Load balancing pain. Load balancing is what chooses which Apache web server gets the next request. The previous system wasn't very even so many requests were getting sent to the most heavily loaded Apaches when they should have been sent to a less loaded one instead. It's been a long-standing problem for us. During the week or so you're talking about we were testing several different replacement load balancing systems to find those which would give a good result:
      • Pen seemed better than running modified Squids on the Apaches but wasn't good enough.
      • Perlbal from the Livejournal people worked very well and gave a nicely even load balance. Brad and Mark from LJ were very helpful in getting it described and set up. We took two Apache web servers to use for this in case they used too much CPU. As it turned out they used only about 10% on each machine. But that left us two Apaches not building pages...
      • Our Squid expert wrote a replacement ICP client to run on the Apaches. That also produced an even load balance so around the end of the week we switched to using it. Freed up those two Apaches to go back to page building duty. So, until we need its other features, Perlbal isn't in use - not quite the best solution for us today (but may be in the future).

      So, we left that week with much improved load balancing for the Apaches. Much more consistent page load times now.

    • Bugs in MediaWiki 1.4 beta left the Apaches filling their available child slots. That combined with some specific web crawlers could sometimes let the crawlers take the site down by leaving no or very few free children to handle requests. Also increased Apache load a bit. The most important ones have been fixed but there's still an occasional stuck child. To deal with that we have a script restarting one Apache web server every 5 or so minutes, to ensure that it can't rise to a troublesome level.
    • The crawler/Apache child problem and a too high setting for maximum children would let the Apache server on some Memcached machines take so much RAM that the system swapped. Very bad news because that caused Memcached to respond very slowly - far too slowly, so all Apaches filled all child slots and the site appeared dead. That's been dealt with now. Not a Memcached issue as such - just the usual don't let the box swap to death situation. While tracking this down assorted other Memcached-related things were improved.
    • As a temporary workaround, the two Memcached machines we were using at that time had Apache stopped on them. That left us four Apaches short for a few days. That took us beyond the critical Apache CPU shortage point and Apache load and wait times rose significantly. Better than a dead site but not at all good. Response time drop with loss of machines isn't linear beyond a certain point and four more Apaches out of service took us beyond that point.
    • Also during that week the handling of database updates was substantially improved, so lock waits are mostly gone, wh