Slashdot Mirror


How Google Saved USENET

Masem writes: "Salon has a well-written article article on the recent revival of much of the USENET archives from '81 to '90 by Google. It mentions that much of the recovery was thanks to years of work in transferring data off 140-some 10" magnetic tapes (~120megs of data) to a more conventional format in order to recover much of the early posts. Even a reference to the previous Slashdot story is made." Update: 01/07 23:52 GMT by T : btempleton adds: "O'Reilly Network asked me to do an article on similar themes and rememberances of USENET history." Thanks, Brad.

19 of 280 comments (clear)

  1. Oooh 10" magnetic tapes! by TheLocustNMI · · Score: 5, Informative
    Having had to work with those bastards, I'd have to give extra kudos to Google! There are few places in the United States that can actually read them, and get you the data from them anymore, and they must've been lovingly cared for, with some of them being 20 years old!

    I think I speak for everyone when I say "Thank you Google for arming me with the information contained in old USENet posts to bring up embarassing teenage posts to my friends!"

    1. Re:Oooh 10" magnetic tapes! by RadioheadKid · · Score: 5, Informative

      Actually, Google didn't do much, if any of the magnetic tape work, it was Bruce Jones, a grad student who transferred 107 tapes in two weeks and then David Wiseman did the rest over the next ten years. Google just downloaded them from him...

      --
      "Karma can only be portioned out by the cosmos." -Homer Simpson
  2. Didn't search USENET as much before Google. by reaper20 · · Score: 5, Insightful

    Google Groups is awesome, especially when searching for some obscure piece of hardware advice or settings.

    I don't have to worry about getting and setting up a news client, and it's just one tab over from my default search engine.

    Google did save USENET for me - though I never post, searching through all the linux and comp newsgroups is usually faster than looking up a HOWTO.

  3. groups.google.com always has the answers... by ThomasMis · · Score: 5, Insightful

    As a software developer, no matter what problem I run into, somebody else has already run into that problem and has asked my question and recieved an answer on groups.google.com. Whenever I get stuck on anything at all, it's the first place I run to. groups.google.com is the single most useful site you can point your browser (konqueror!!!) towards. I'm not sure how they make money over there at google, but what a great service they are providing!

    --
    Check out my podcast: DreamStation.cc Video Game Show
    1. Re:groups.google.com always has the answers... by aussersterne · · Score: 5, Insightful

      This is absolutely true. I am often asked "What book(s) can I buy, to learn what you've just told me? How do I gain the knowledge in [subject X] that you have? I don't care if it takes me a decade, I just want to learn it, but I can't seem to find out where. Is it written down?"

      I tell them: it is a decade's worth of learning, and then some, but not from books. It is all from USENET. I became a competent C programmer who writes more efficient code and makes fewer fundamental mistakes thanks to usenet. I learned to use BSD and then to use Linux as fast and furious as I can type and to get myself out of any system problem, save my data from nearly any corruption thanks to usenet. I am able to network these odd things, build these robots, and have this "cool stuff" that you like so much that works so well thanks to usenet. I can make nearly any computer go, now matter how old or wierd or what media or operating system it uses (a feat which makes you a legend in your own department) thanks to usenet.

      It's not my knowledge... I humbly picked it up in the mid and late '80s and early '90s and still constantly refer to it, first through Deja and now through Google. It is our knowledge, collective and stretching backward in time. To ever lose the news archive would be a tragedy -- the amount of searchable data on everything from chemistry and biology to computing and electronics to literature and politics is truly stunning. With the news archive, you can learn to hotwire together any two things so long as they have *wires* to do something useful; you can learn to brew just about anything including some of the best beer ever; you can learn just what the HELL James Joyce is talking about at times in Ulysses. Every question has been answered before you even asked.

      The only sad thing has been the degree to which the groups have been turned into a boulevard of endlessly flashing neon porn signs in the last few years, almost to the degree that anything else is drowned out by the brightness.

      Study USENET. Use USENET. Live and learn. Amen.

      --
      STOP . AMERICA . NOW
  4. Archaic Technology by irregular_hero · · Score: 5, Interesting
    The article isn't kidding about the difficulty of finding a reader for your typical nine-track tape these days. I spent lots of bucks on a SCSI nine-track a few years ago for archiving system and application software on nine-track from old computer systems. And although the purchase helped, there are still occassions when I have to fire up some very old Big Iron to read one tape or another.

    An interesting thing about these tapes: They stretch over time and can sometimes become unreadable because of that. There are times when, to extract the information on the tape, I would put a number of them in my freezer for an hour or so, then try again. Nine times out of ten that would actually work.

    Another note about the article: I can still remember discussions with others who had modems about 1200 baud being just "too fast". The reasoning was that the average person couldn't read much faster than 300 baud. :)

  5. Repeat I know, but a great read by C.+Mattix · · Score: 5, Interesting

    I know this is a repeat but this is a great read. Dr. Gene Spafford's farewell posting. If you don't know who that is, look it up.

    ===
    From: spaf@cs.purdue.edu
    Newsgroups: news.announce.newusers,news.misc,news.admin.misc,n ews.groups,soc.net-people
    Subject: That's all, folks
    Followup-To: poster
    Date: 29 Apr 1993 19:01:12 -0500
    Message-ID:

    [ I originally was going to post nothing on this topic. I'm burned
    out, and I don't want my fatigue to appear like I'm posting
    self-indulgent garbage. However, several people have argued with
    me, and convinced me that maybe I should make a statement to "end an
    era," and as a piece of net "history." At the least, even if it is
    perceived as self-indulgent garbage, it will fit right in with the
    rest of the net. ]

    There is a Zen adage about how anything one cannot bear to give up is
    not owned, but is in fact the owner. What follows relates how I am
    owned by one less thing....

    About a dozen years ago, when I was still a grad student at Georgia
    Tech, we got our first Usenet connection (to allegra, then being run
    by Peter Honeyman, I believe). I'd been using a few dial-in BBS
    systems for a while, so it wasn't a huge transition for me. I quickly
    got "hooked": I can claim to be someone who once read every newsgroup
    on Usenet for weeks at a time!

    After several months, I realized that it was difficult for a newcomer
    to tell what newsgroups were available and what they covered. I made
    a pass at putting together some information, combined it with a
    similar list compiled by another netter, and began posting it for
    others to use. Eventually, the list was joined by other documents
    describing net history and information.

    In April of 1982 (I believe it was -- I saved no record of the year,
    but I know it was April), I began posting those lists regularly,
    sometimes weekly, sometimes monthly; the longest break was for 4
    months a few years ago when I was recovering from pneumonia and poor
    personal time management. (Tellingly, only a few people noticed the
    lack of postings, and almost all the mail was "When will they come
    out?" rather than "Did something happen?") As time went on, people
    began to attach far more significance to the posts than I really
    intended. It was flattering for a very short time, and a burden for
    most of the rest; there is no telling how much time I have devoted
    over the last decade to answering questions, editing the postings, and
    debating the role of newsgroup naming, to cite a few topics. I really
    tired of being a "semi-definitive" voice.

    Starting several years ago, at about the time people started pushing
    for group names designed to offend or annoy others, or with a lack of
    concern about the possible effects it might have on the net as a whole
    (e.g., rec.drugs and comp.protocols.tcp-ip.eniac) I began to question
    why I was doing the postings. I have had a growing sense of futility:
    people on the net can't possibly find the postings useful, because
    most of the advice in them is completely ignored. People don't seem
    to think before posting, they are purposely rude, they blatantly
    violate copyrights, they crosspost everywhere, use 20 line signature
    files, and do basically every other thing the postings (and common
    sense and common courtesy) advise not to. Regularly, there are postings
    of questions that can be answered by the newusers articles, clearly
    indicating that they aren't being read. "Sendsys" bombs and forgeries
    abound. People rail about their "rights" without understanding that
    every right carries responsibilities that need to be observed too, not
    least of which is to respect others' rights as you would have them
    respect your own. Reason, etiquette, accountability, and compromise
    are strangers in far too many newsgroups these days.

    I have finally concluded that my view of how things should be is too
    far out-of-step with the users of the Usenet, and that my efforts are
    not valued by enough people for me to invest any more of my energy in
    the process. I am tired of the effort involved, and the meager --
    nay, nonexistent -- return on my volunteer efforts.

    This hasn't happened all at once, but it has happened. Rather than
    bemoan it, I am acting on it: the set of "periodic postings" posted
    earlier this week was my last. After 11 years, I'm hanging it up.
    David Lawrence and Mark Moraes have generously (naively?) agreed to
    take over the postings, for whatever good they may still do. David
    will do the checkgroups, and lists of newsgroups and moderators
    (news.lists), and Mark will handle the other informational postings
    (news.announce.newusers).

    I'm not predicting the death of the Usenet -- it will continue without
    me, with nary a hiccup, and six months from now most users will have
    forgotten that I did the postings...those few who even know now, that
    is. That is as it should be, I suspect. Nor am I leaving the
    Usenet entirely. There are still a half-dozen groups that I read
    sometimes (a few moderated and comp.* groups), and I will continue to
    read them. That's about it, though. I've gone from reading all the
    groups to reading less than ten. Funny, though, the total volume of
    what I read has stayed almost constant over the years. :-)

    My sincere thanks to everyone who has ever said a "thank you" or
    contributed a suggestion for the postings. You few kept me going at
    this longer than most sane people would consider wise. Please lend
    your support to Mark and David if you believe their efforts are
    valuable. Eventually they too will burn out, just as the Usenet has
    consumed nearly everyone who has made significant contributions to its
    history, but you can help make their burden seem worthwhile in
    between.

    In closing, I'd like to repost my 3 axioms of Usenet. I originally
    posted these in 1987 and 1988. In my opinion as a semi-pro
    curmudgeon, I think they've aged well:

    Axiom #1:
    "The Usenet is not the real world. The Usenet usually does not even
    resemble the real world."
    Corollary #1:
    "Attempts to change the real world by altering the structure
    of the Usenet is an attempt to work sympathetic magic -- electronic
    voodoo."
    Corollary #2:
    "Arguing about the significance of newsgroup names and their
    relation to the way people really think is equivalent to arguing
    whether it is better to read tea leaves or chicken entrails to
    divine the future."

    Axiom #2:
    "Ability to type on a computer terminal is no guarantee of sanity,
    intelligence, or common sense."
    Corollary #3:
    "An infinite number of monkeys at an infinite number of keyboards
    could produce something like Usenet."
    Corollary #4:
    "They could do a better job of it."

    Axiom #3:
    "Sturgeon's Law (90% of everything is crap) applies to Usenet."
    Corollary #5:
    "In an unmoderated newsgroup, no one can agree on what constitutes
    the 10%."
    Corollary #6:
    "Nothing guarantees that the 10% isn't crap, too."

    Which of course ties in to the recent:

    "Usenet is like a herd of performing elephants with diarrhea --
    massive, difficult to redirect, awe-inspiring, entertaining, and a
    source of mind-boggling amounts of excrement when you least expect
    it." --spaf (1992)

    "Don't sweat it -- it's not real life. It's only ones and zeroes."
    -- spaf (1988?)

    --
    Gene Spafford, COAST Project Director
    Software Engineering Research Center & Dept. of Computer Sciences
    Purdue University, W. Lafayette IN 47907-1398
    Internet: spaf@cs.purdue.edu phone: (317) 494-7825
    ===

  6. We don't compare. by Henry+V+.009 · · Score: 5, Funny

    Ye Gods!

    The modern slashdot nerd trembles in the presence of those ancient USENET nerds of old

    A 300 pound slashdot weakling is easily flung aside by the 500 pound USENET god. Who at slashdot keeps taped archives of every post for the nerds of future generations? Truly those were nerds.

  7. i dont know how i feel about this by Anonymous Coward · · Score: 5, Funny

    i'm a tad concerned about the posts i made in the early 90's when i was an asshole know it all teenager coming back to haunt me... i wish google never uncovered those... i cringe when i read them now...

    1. Re:i dont know how i feel about this by ktakki · · Score: 5, Funny

      John Walker Lindh? Is that you?

      k.

      --
      "In spite of everything, I still believe that people are really good at heart." - Anne Frank
  8. My father was a Computer Scientist by sinserve · · Score: 5, Funny

    In a major university, and I decided to honor his
    soul and follow his foot steps.
    And now, thanks to google, I find myself battling
    the flame wars he started.

    Better go back and do him and VI and honor .. alt.emacs, here I come.

  9. Save the posts by Kefaa · · Score: 5, Insightful

    I am sorry they will allow requestors to delete their own postings. While we might wish it otherwise, 10, 20, 50 years later, this may be the real historical value. To purge, seems the equivalent of having a letter to the editor removed from newspaper archives.

    To those who feel like "they are walking around with their baby picture stapled to their forehead", we all mature. What I thought at 20, 30, and 40 show how I grew. What other archive in human history can provide the transitional opinions, discussions, and outright imbecilic flames wars?

    While we would hate to have someone pull out our post in support of the flat earth theory, to act as though we all believed the earth was round is rewriting history. Convenient for us, but misleading to the future.

    The question now becomes, what happens after Google and Slashdot, when the archive is tera-bytes large? Will it take 100 years for the next conversion?

  10. Me, too!!! by ideut · · Score: 5, Funny

    The first "me too" post isn't until two years into the archive. I suppose that says something about the intelligence of the usenet demographic back then.

    --

    --

  11. The One Engine by Saint+Aardvark · · Score: 5, Funny

    Three tapes for rec.singles desperate
    Seven for alt.swedish.chef.bork.bork.bork
    Nine for comp.sci compiling late
    One for Google's engine dark
    In their Linux cluser where the shadows lie.
    One engine to search them all, one engine to bind them
    One engine to index them all and in the darkness find them
    In Google's cluster where the shadows lie.

  12. The next story by ortholattice · · Score: 5, Funny
    Since Salon's revenue is based on page hits, the next story will be:

    How Slashdot Saved Salon

  13. Re:I've really got to wonder... by krogoth · · Score: 5, Interesting

    I think google should be paid just for being so damn cool. They deserve spontaneous income for things like the groups (with the history they now have), having a '1337-h4x0r' language you can use (http://www.google.com/intl/xx-hacker/), changing their banner for special days (anyone else see the christmas thing?)...

    There's a lot of companies right now that should be punished for doing stupid things, but Google is the complete opposite; I'd like to see Microsoft, the RIAA, and the MPAA have to donate 20% of their money to google :)

    --

    They that quote Benjamin Franklin on liberty and safety deserve neither.
  14. Re:Just think... by ideut · · Score: 5, Informative
    Reading your first link, it's amusing to see that even ten years ago there were a lot of ridiculous IP shenanigans. Such as

    "Ashton-Tate is once again pushing its case for a copyright on the programming language used in DBase. ".

    And the numerous silly patents, such as

    'Emacs is threatened by IBM patent number 4,674,040 which covers "cut and paste between files" in a text editor. Many Emacs features are threatened by patent number 4,458,311, which covers "text and numeric processing on same screen." Patent 4,398,249 covering the general spreadsheet technique known as "natural order recalc" stops us from using it in GNU '

    --

    --

  15. Message forums (Slash) are killing off Google. by BrookHarty · · Score: 5, Insightful

    Allot of the good gurus are moving over to slash ran message forums. Talking to a guy who is a perl guru, he has moved most of his perl help requests from usenet to Perl Mongers. I've been seeing this trend in the last few years, as independent subjects are moving over to a website based web forums. I even spend more time reading 5 mailing lists and a dozen message forums, and dont touch usenet anymore.

    With these message forums and mailing lists not linked to a usenet group, there is a lot of wasted knowledge that is not shared. I would love to see a slash-mod or some type of mailing list enhancement that posts a overview or some kind of daily message post to usenet.

    The whole idea of usenet was knowledge sharing, not binaries and spam ads. Glad google has saved usenet, but some effort needs start using it again.

    Humm, Maybe Slashdot should enhance a usenet forum? Thou 5-20,000 posting a day on a usenet might be a little much. Maybe only 2+ posts make a moderated usenet group.

  16. The Usenet archive is not saved yet by osswid · · Score: 5, Insightful
    Google is a private startup. They might still go out of business, or be bought by someone. Even if they have a successful IPO, these could still happen later.

    What happens to the archive when they're bought by someone else, or end up in bankruptcy court? Will it go the away of the online digital photo storing sites, vanishing one day without a trace, taking irreplaceable data -- data of immense academic historical interest -- with it?

    Google should promise to donate the archive to the Library of Congress, do the transfer now, and make a social contract with the net community to turn over the reigns on this project if they're acquired or go out of business.