Slashdot Mirror


All of Gopherspace Available For Download

An anonymous reader writes "Cory Doctorow tells us that '[i]n 2007, John Goerzen scraped every gopher site he could find (gopher was a menu-driven text-only precursor to the Web; I got my first online gig programming gopher sites). He saved 780,000 documents, totalling 40GB. Today, most of this is offline, so he's making the entire archive available as a .torrent file; the compressed data is only 15GB. Wanna host the entire history of a medium? Here's your chance!' Get yourself a piece of pre-Internet history (torrent)." Update: 04/30 00:16 GMT by T: As several readers have pointed out below, our anonymous friend probably meant to say "pre-Web," rather than "pre-Internet."

54 of 200 comments (clear)

  1. Shame on Slashdot by suso · · Score: 4, Insightful

    Here's your chance!' Get yourself a piece of pre-Internet history

    I think Jon Postel is rolling in his grave right now.

    1. Re:Shame on Slashdot by commodore64_love · · Score: 4, Informative

      Beat me to it. The summary should read "Get yourself a piece of pre-world wide web history," since gopher came AFTER the birth of the internet (1981) but before the widespread usage of the web (circa 1993).

      --
      "I disapprove of what you say, but I will defend to the death your right to say it." - historian Evelyn Beatrice Hall
    2. Re:Shame on Slashdot by Daniel+Dvorkin · · Score: 2, Insightful

      Seriously. The story submitter was anonymous (probably a good thing!) but I'm really shocked that any Slashdot editor could let that line go through without comment. And spare me the "you must be new here" line -- I know perfectly well that /. editing standards can get pretty sloppy, but this is particularly egregious. Calling Gopher "pre-internet" is the kind of crap I'd expect on a mainstream news site, not from "News for Nerds."

      --
      The correlation between ignorance of statistics and using "correlation is not causation" as an argument is close to 1.
    3. Re:Shame on Slashdot by suso · · Score: 2, Funny

      since gopher came AFTER the birth of the internet (1981) but before the widespread usage of the web (circa 1993).

      I hope you don't mean the birth of the Internet was in 1981. Or maybe you typoed 1991 (when Wikipedia says gopher was released)? I thought gopher was actually a bit older than that.

      I just wish people would stop holding onto FTP like they were Charlton Heston.

    4. Re:Shame on Slashdot by mobby_6kl · · Score: 2, Insightful

      Really? You are shocked that a slashdot editor doesn't check and correct the stories he posts? You must be new here.

    5. Re:Shame on Slashdot by yotto · · Score: 3, Insightful

      Really? You think that someone asking if you're new here means they think you're new here?

      Are you new here?

    6. Re:Shame on Slashdot by whoop · · Score: 3, Informative

      I've been around a while, and I can't think of any time a Slashdot editor fact-checked, spell-checked, or proofread a submission. Look at it, they put the entire thing into a quote. That way they can just say they're quoting the submitter and that's what he said.

      They might add the "UserXXX writes," part themselves, but a couple characters of perl could probably do that part just as well.

    7. Re:Shame on Slashdot by Grant_Watson · · Score: 2, Informative

      This definition is probably looser than most, but here's a quick and dirty view:

      The Web is a huge collection of interlinked documents addressable by URLs and served with HTTP. The Internet is the world-wide TCP/IP network over which the Web and many other services operate.

    8. Re:Shame on Slashdot by pizza_milkshake · · Score: 3, Informative

      Here's my explanation in graphic form: http://parseerror.com/images/explain/internet-vs-web.jpg

    9. Re:Shame on Slashdot by suso · · Score: 2, Informative

      Um, the generally accepted start of the Internet is by activities surrounding the start of ARPANET in the late 1960s. ARPA in its name still lives on as part of reverse DNS entries. Some people say it started in 1967, some say 1969, either way, it was much earlier than 1981 and there are a lot more protocols that are part of what we call "the Internet" than just TCP/IP, although of course not all of it is routed globally. Check your /etc/protocols file sometime, the first line says Internet (IP) protocols.

    10. Re:Shame on Slashdot by Gorobei · · Score: 3, Insightful

      I have an EE degree. What's a good 2nd degree? CMP ENG or Comp Sci? I want to be eligible to apply for more jobs.

      You are eligible to apply for all jobs now. The trick is actually getting one.

      Second degrees are a net loss in the market. One degree means you are of at least average intelligence and can show up on time when it counts. Two degrees means pretty much the same thing.

    11. Re:Shame on Slashdot by nacturation · · Score: 2, Informative

      History of the Internet from 1957 to present:

      http://vimeo.com/2696386?pg=embed&sec=2696386

      Quite educational, even if you think you know all about it.

      --
      Want to improve your Karma? Instead of "Post Anonymously", try the "Post Humously" option.
    12. Re:Shame on Slashdot by nacturation · · Score: 2, Informative

      Having just watched it again, it may not fully answer your question. With what you learned from the video in mind, the OSI model is the layers the video talked about. There are seven layers altogether, with the lowermost layer being the physical hardware everything runs on, followed by the network connecting the hardware, then how data is passed over the network, and so on until you get to the application layer. You've heard of TCP/IP? That's TCP (layer 4) running on top of an IP (layer 3) network. ICMP is a different network which is what things like 'ping' (ICMP echo) and 'traceroute' run over. You've heard of UDP? That's another layer 4 protocol different from TCP.

      What runs on the application layer is things you're already familiar with. SMTP (email), telnet, FTP, DNS, NTP (network time protocol), and so on including HTTP. HTTP is effectively the web -- it's what a world wide web browser ("web browser", or now just "browser" for short) uses as its primary protocol and why you see URLs starting with http: . So HTTP or "the web" is an application that runs on top of everything below it. You still need the physical hardware, the network connecting the hardware, the various transmission protocols and so on to deliver the data used by the web. Similarly, SMTP or commonly just "email" is an application that runs on top of everything below it.

      Think of the acronyms if that will help you understand it better. SMTP is Simple Mail Transfer Protocol, a protocol for transferring simple mail. HTTP is HyperText Transfer Protocol, a protocol for transferring hypertext. FTP is File Transfer Protocol, a protocol for transferring files. NNTP is Network News Transfer Protocol, a protocol for transferring network news, what you've likely heard of as simply Usenet or "newsgroups". You get the idea.

      That's the simplistic view of things. In reality, HTTP has been extended to transfer more than just hypertext. Through the use of MIME types (image/gif, image/jpeg, text/html, text/xml, image/binary, and so on) you can transfer arbitrary things that browsers and other applications can understand.

      Hopefully that makes a bit more sense.

      --
      Want to improve your Karma? Instead of "Post Anonymously", try the "Post Humously" option.
    13. Re:Shame on Slashdot by zmollusc · · Score: 2, Funny

      Where do the tubes come in? Are they buried under the superhighway?

      --
      They whose government reduces their essential liberties for temporary security, receive neither liberty nor security.
    14. Re:Shame on Slashdot by jeremyp · · Score: 2, Informative

      Actually, the Internet is the world-wide IP network. TCP is just one of many protocols that are used to transmit information across it.

      --
      All I want is a secure system where it's easy to do anything I want. Is that too much to ask ~~ Randall Munroe
  2. Far cry from "all of gopherspace" by damn_registrars · · Score: 4, Insightful

    This was just all that was available in 2007. Had he done the same in 1997 it would have been quite a bit different - I'd suspect it would have been quite a bit larger then as well.

    --
    Damn_registrars has no butt-hole. Damn_registrars has no use for a butt-hole.
    1. Re:Far cry from "all of gopherspace" by migla · · Score: 2, Informative

      "Do you have any facts or figures underpinning your statements ?"

      That would indeed be interesting, but GP makes a reasonable assumption, akin to "There were more horse carriages out and about before the automobile." No?

      --
      Some of my favourite people are from th US; Vonnegut, Chomsky, Bill Hicks.
    2. Re:Far cry from "all of gopherspace" by rtaylor · · Score: 4, Interesting

      On a regular basis? Yes. Than exist in barns today for special occasions or limited use, possibly not.

      It has been indicated that more people know how to properly shoe a horse today than in the late 1800's. Lower percentage of the population, and not something they do every-day, but a larger total number of people.

      I wouldn't be surprised if the total number of documents on Gopher continued to climb despite the percentage of content on Gopher decreasing rapidly. The cost to host has rapidly decreased and amount of content in general has increased significantly that the total number of items could still be higher today than in the 90's.

      --
      Rod Taylor
    3. Re:Far cry from "all of gopherspace" by Obfuscant · · Score: 5, Informative
      Do you have any facts or figures underpinning your statements ?

      Yes.

      In 1997 we had a 100Gb disk array holding the research data from our lab, all of which was available via gopher (and ftp, and the web). We moved to a 200Gb array shortly after, and then a 400Gb after that. And then 3Tb, around 2008.

      Sometime around 2007 or 2008 the SunOS system that ran the gopher server died permanently and was replaced by a virtual linux server without gopher. Even without that server, I found not long ago that I was still creating .cap files -- which were gopher, as I recall, but maybe archie.

      Quantitatively, online currently I have more than 15Gb of data for just 1997, all of which was gophered at the time. In 1998, another 18Gb.

      So, I would say, had the gopher scraping been done in 1997 instead of 2007, the result would have been a lot more data. In fact, a few months earlier in 2007 and it might have BEEN a lot bigger.

    4. Re:Far cry from "all of gopherspace" by omglolbah · · Score: 3, Funny

      Because as most users of the internet he wasnt accurate about the unit.

      From the context one can assume (without that big a risk of error) that he is indeed speaking about gigabyte, and not gigabit.

      Buuut, anal responses are more important than content. We know this :-p

    5. Re:Far cry from "all of gopherspace" by badboy_tw2002 · · Score: 4, Funny

      Stop making fun of him, he's not Turing tested yet. In a few years, he'll start noticing some changes and then he'll grow up to be a big boy AI that can interact with the rest of us.

  3. The Ultimate Lesson in Open Source and Standards by eldavojohn · · Score: 5, Interesting

    In a bizarre case of ineptitude, my alma mater (due to financial problems or something) announced they would charge licensing fees for the use of its implementation of the Gopher server in February of 1993. This caused people to worry that eventually the standard and protocol itself would also be licensed. It did have other technical flaws but I think a lot of people thought Gopher could have become the internet had Beners-Lee not released a free for public use implementation of the hypertext concept.

    That move by the U of MN is a great lesson in how licensing can kill innovation. Standards should always be open and guaranteed open.

    --
    My work here is dung.
  4. Pre-internet history? by nebaz · · Score: 4, Insightful

    The web is NOT the internet. (Though sadly it essentially has become so, nowadays.)

    --
    Rhymes that keep their secrets will unfold behind the clouds.There upon the rainbow is the answer to a neverending story
    1. Re:Pre-internet history? by brainboyz · · Score: 2, Insightful

      Because the internet is not restricted to what you can do on a handful of ports with little more than a handful of protocols. That so many technical professionals limit themselves to the "web" tends to restrict creativity.

    2. Re:Pre-internet history? by Anonymous Coward · · Score: 3, Insightful

      The web is NOT the internet. (Though sadly it essentially has become so, nowadays.)

      Hardly. Most traffic is bittorrent and email (mostly spam).

    3. Re:Pre-internet history? by calmofthestorm · · Score: 2, Funny

      They teach us the difference and why it no longer matters;P

      --
      93rd rule of Slashdot: No matter how obvious my sarcasm is, my comment will be taken seriously by someone.
    4. Re:Pre-internet history? by SavvyPlayer · · Score: 2

      All of which (except SSH) typically run on port 80 -- we routinely teach our firewalls the difference, and that it no longer matters...

  5. Gopher isn't dead. by Anonymous Coward · · Score: 4, Interesting

    http://www.tekeeze.com/geeky/7-fun-sites-you-can-only-find-on-the-gopher-internet/

    Includes things like Twitpher (which might not be working right now) Twitter for Gopher.

    Firefox (others?) supports gopher://

  6. Gopher by MrEricSir · · Score: 5, Funny

    So does this mean we're getting 6 more weeks of winter or not?

    --
    There's no -1 for "I don't get it."
    1. Re:Gopher by forkazoo · · Score: 4, Insightful

      So does this mean we're getting 6 more weeks of winter or not?

      No, just another ten years of November.

    2. Re:Gopher by Anonymous Coward · · Score: 5, Informative

      So does this mean we're getting 6 more weeks of winter or not?

      No, just another ten years of November.

      I believe you mean September.

  7. Re:Wrong by gyrogeerloose · · Score: 3, Informative

    To a lot of people, WWW=Internet. Us old greybeards who remember when the Internet was telnet, FTP, e-mail and Usenet know better.

    --
    This ain't rocket surgery.
  8. Re:Interesting by commodore64_love · · Score: 3, Interesting

    I thought Gopher was okay, but not near as exciting as my first exposure to Amiga Mosaic web browser. After all, it had teh 4000-color pron. ;-) Plus exciting sites like this one: http://web.archive.org/web/19961114151757/http://scifi.com/ - I mean how cool is that? It's animated and colorful. :-)

    Aside -

    Looking at that schedule reminds me how cool Sci-Fi Channel used to be. Weekend Anime. Inside Space reports. Sci-Fi Trader. Sci-Fi Buzz. FTL Newsfeed. It was like a geek paradise for fandom. Today's channel is more akin to watching the TNT channel - ordinary and nothing special.

    --
    "I disapprove of what you say, but I will defend to the death your right to say it." - historian Evelyn Beatrice Hall
  9. Re:The Ultimate Lesson in Open Source and Standard by Hatta · · Score: 2, Informative

    That's more the fault of the clients than the protocol. There's no reason you can't serve hypertext documents over gopher, and no reason a gopher client couldn't display graphics.

    --
    Give me Classic Slashdot or give me death!
  10. Re:Wrong by uglyduckling · · Score: 2, Informative

    IIRC Usenet wasn't a network so much as local repositories which synced. Your local Usenet server would sync up with other peer servers on a schedule, I suppose a bit like a massive distributed email system. Some Usenet sites weren't strictly Internet connected, but many used the Internet as the means to communicate with peer servers.

  11. Re:The Ultimate Lesson in Open Source and Standard by cwgmpls · · Score: 3, Informative

    In gopher, everthing is either a link or text. There is no way to embed a link into a body of text -- what is now called "hypertext".

  12. Index anyone? by avm · · Score: 2, Interesting

    Is there a plaintext index of URLs this archive includes anywhere? I'm connected via 3G and pulling a 15gig torrent isn't feasible. I'd love to wander thru some of my personal archived bookmark lists and such just to see if any of them wound up being preserved.

  13. Re:Wrong by gandhi_2 · · Score: 2, Funny

    I remember when fingering the gopher was totally normal.

  14. Re:Wrong by elfprince13 · · Score: 2, Informative

    UUCP was the original method used for Usenet transfer, and was distinct from the Internet, but it was hooked up to the Internet at various locations to make contact with servers outside the local UUCP network. This was an era when email (transferred via UUCP) could take longer than snail mail to make it to its intended user (and the addresses were more like a full trip-map than just an address)

  15. Re:The Ultimate Lesson in Open Source and Standard by Hatta · · Score: 4, Insightful

    There's no markup for hypertext in HTTP either.

    --
    Give me Classic Slashdot or give me death!
  16. Re:Anyone remember ARCHIE servers? by mtippett · · Score: 3, Funny

    Sure, back when people knew the IPs of their local archie and simtel archives.

    Those where the days...

  17. Re:Wrong by Rene+S.+Hollan · · Score: 3, Informative
    Usenet carried posts and articles in newsgroups. Synchronization took place via abstracted mechanisms, most commonly uucp over serial modem links.

    So, yes, Usenet preceded the Internet in the sense that it did not rely in IP, though both generally evolved around the same time.

    But, there was a rather vibrant pre-WWW internet where the protocols of choice were smtp (mail), ftp (file transfer), and gopher and archie for repositories of places to find stuff. News could be carried via nntp (net news transfer protocol).

    What some may not know was that sendmail could work over transiently connected points as well, rather like usenet. Anyone still remember bang path notation? One would address mail using the sequence of hosts required to get it from one's own to the destination, using names understood by each successive host in the sequence. One of the reasons sendmail configuration files were so horrendous was to permit relaying between networks using different host naming conventions.

    --
    In Liberty, Rene
  18. Re:Copyright by Anonymous Coward · · Score: 2, Insightful

    Does anyone give a frak?

  19. Re:The Ultimate Lesson in Open Source and Standard by blincoln · · Score: 2, Funny

    I prefer to believe that Gopher failed because the world wasn't ready for the awe-inspiring virtual reality experience that was TurboGopher VR.

    --
    "...always new atoms but always doing the same dance, remembering what the dance was yesterday." -Richard Feynman
  20. Re:Wrong by commodore64_love · · Score: 2, Informative

    Ahhh the good old days.

    You post a question on rec.arts.tv like, "When does the new season of TNG start?", wait for the midnight syncing between your local BBS and the rest of the nation, and then you come back tomorrow morning to learn the answer. If you're lucky. Sometimes you had to wait 2 days for a reply.

    --
    "I disapprove of what you say, but I will defend to the death your right to say it." - historian Evelyn Beatrice Hall
  21. Re:Wrong by commodore64_love · · Score: 2, Insightful

    >>>I always accessed my Usenet groups via the Internet anyway

    I used a 1 kbit/s modem (yes very slow). My messages are still archived on google groups, and I wish there was a way to erase them, because it's somewhat embarrassing to read posts from your teenage self 25 years ago (especially the typos). ;-)

    --
    "I disapprove of what you say, but I will defend to the death your right to say it." - historian Evelyn Beatrice Hall
  22. Gopher lives! by John+Hasler · · Score: 4, Informative

    ...gopher was a menu-driven text-only precursor to the Web...

    What do you mean, "was"? Gopher still works fine. There are dozens of servers out there. See quux.org or just install your Linux distribution's gopher package and fire it up.

    --
    Warning: this article may contain humor, sarcasm, parody, and perhaps even irony. Read at your own risk.
    1. Re:Gopher lives! by daveime · · Score: 3, Funny

      Dozens of servers ought to be enough for anybody.

  23. Re:What a terrible Marketting line by McGiraf · · Score: 2, Funny

    new furniture?

  24. Re:The Ultimate Lesson in Open Source and Standard by nxtw · · Score: 4, Informative

    There's no markup for hypertext in HTTP either.

    The original pre-RFC HTTP states that a response is an HTML message.

  25. Re:Interesting by cyclomedia · · Score: 3, Interesting

    Ever since the RSS vs ATOM war peaked (and fizzled) i've been waiting for a re-gopherisation of the internet, where files, videos, music, audio and pictures are all linked and indexxed by interconnected RSS feeds that dont include all the crud you have to wade through in web pages to get anywhere. Something akin to MRSS with png thumbnails and optional links to "buy the dvd box set now" where you could create your own Channels (feeds full of links to shows directly, or other feeds) and then browsable from your telly directly, i think i'm rambling

    --
    If you don't risk failure you don't risk success.
  26. Re:The Ultimate Lesson in Open Source and Standard by Trepidity · · Score: 2, Informative

    To be somewhat more accurate, it's not "now" called hypertext: it was called hypertext before gopher even existed. Gopher was first released in 1991, while Ted Nelson coined "hypertext" in 1965, and there were dozens of implementations before the WWW (the most popular outside academia was probably Apple's HyperCard, released in 1987).

  27. Re:Wrong by Alioth · · Score: 5, Interesting

    A funny thing happened to me a while back.

    I was trying to build Nethack for a server, and it was failing linking on some missing curses library. So I did a google search to try to find out which library I was missing so I could find which -dev package I needed to install to get this library.

    The first Google search result was... ...a post by *me* asking *exactly* the same question ("Which lib do I need") almost 15 years earlier on one of the linux newsgroups!

  28. Re:The Ultimate Lesson in Open Source and Standard by kaszeta · · Score: 2, Interesting

    Yeah, I was going to make a similar comment, since I was a sysadmin at the University of Minnesota during the later Gopher years (what I call "the pathetic self-pity era".) Highlights included being required to run a Gopher server (since until late '97 all official department content was supposed to made available on Gopher as well as HTTP, we had this bastard of a server called GN that would serve the same content to both), suffering through the "Gopher World Tour", listening to several of the Gopher team carp about how this WWW thing was overrated and people would come back to Gopher, etc. I think the best is when, in 1996, someone from the computer lab told me I shouldn't be telling my users about Netscape, I should be showing them TurboGopher VR.