Slashdot Mirror


The Linux Kernel Archives

Jeremy Andrews writes "KernelTrap offers an interesting look at the history behind the Linux Kernel Archives, home of the Linux kernel. They start from the beginning in 1997, when kernel.org ran on a generic "white box PC" using a shared T1, to the present where it runs on multiple quad Opterons each with 24 gigabytes of RAM, 10 terabytes of disk space, and a gigabit link to the internet. Much of the article is based on an interview with Peter Anvin, also including quotes from Linus Torvalds, Paul Vixie of Internet Systems Consortium, Inc who donates the bandwidth, and Matt Taggart of Hewlett-Packard who donated the hardware."

96 of 154 comments (clear)

  1. Hmm... by Anonymous Coward · · Score: 2, Funny

    Boy what I could do with that and BitTorrent... *rubs palms together*

    1. Re:Hmm... by Anonymous Coward · · Score: 1, Funny

      If you are like the typical, ethical P2P-advocate on Slashdot, you will probably download the free-as-in-beer-and-free-as-in-free-sex-Linux-dist ro-of-the-Day or sharing your newest scientifical discoveries, not stealing movies.

      We all know that.

  2. Re:Cool by ari_j · · Score: 2, Funny

    Yes, I was very surprised to discover that, in the past 8 years, kernel.org has had to expand their server performance and their bandwidth allocation. Why didn't they just keep the T1 and Pentium-66? I guess I will have to RTFA to find out why.

  3. Nice to see folks getting credit... by tcopeland · · Score: 3, Interesting

    ...especially having dealt with something like this (on a much smaller scale) recently.

    We were having bandwidth limitations on RubyForge; it was getting up to 80 GB per month at the end of 2004. Mirroring out releases helped get usage back down to 15 GB per month. Many thanks to our mirror providers!

    1. Re:Nice to see folks getting credit... by XiQ · · Score: 1

      Today it will probably be 15*80 GB...

    2. Re:Nice to see folks getting credit... by Kent+Recal · · Score: 1

      80GB. Per month?
      Don't get me wrong but that sounds pretty low (nowadays!).
      You can get a cheap dedicated server for $30/month with 100GB traffic incl.

      A website that I babysit sends 1.5TB/month over the wire.
      We pay well under $1000 for 1/2 rack, power *and* traffic.

    3. Re:Nice to see folks getting credit... by tcopeland · · Score: 1

      > 80GB. Per month?

      Yup, although now with the mirrors in place the system is doing about 200 GB per month.

  4. Has the /. effect met its match? by NixLuver · · Score: 2, Funny

    Good Grief that's a lot of pipe! Saturating a PAIR of gig links? Certainly tends to make one stop and consider how many people are actually USING linux nowadays. Good to see!

    1. Re:Has the /. effect met its match? by |<amikaze · · Score: 1


      Meh, it sounds like a lot until you consider the number of people with broadband. For example, 2000 people with 1Mbit connections could saturate a pair of gigabit links.

    2. Re:Has the /. effect met its match? by GuidoW · · Score: 1

      Yes, but for how long?

      With a 1 MBit/s link, you can download 1 GB in roughly 2.5 hours. Thus, 2000 people with 1Mbit connections could saturate a pair of gigabit links for maybe 5 hours when downloading FC.
      (Sidenote: I have no idea how large FC 4 is actually going to be.)
      The article says they expect the links to be saturated for about 3 to 4 days.

      --
      If it's so secret, then how come I've never heard of it?
  5. Only 1 Gb/sec? no wonder I can't get new releases by Anonymous Coward · · Score: 2, Funny

    Way to slow.

    Mod +5 funny -5 irreverant

  6. Re:Bandwidth and Slashdot Effect on kernel.org by MrAnnoyanceToYou · · Score: 1

    Err... I see ads... Is there a chance they break even on bandwidth if they get traffic of the level Slashdot gives instead of losing $$?

  7. Yes but... by PR_Alistair · · Score: 5, Funny
    multiple quad Opterons each with 24 gigabytes of RAM, 10 terabytes of disk space, and a gigabit link to the internet
    ...does it run Linux?
    1. Re:Yes but... by tehshen · · Score: 2, Informative

      At this time, the servers run Fedora Core and use the 2.6 kernel provided by RedHat.

      Yup.

      --
      Guy asked me for a quarter for a cup of coffee. So I bit him.
    2. Re:Yes but... by daveKfs · · Score: 3, Funny

      More importantly, does it have a stopwatch?

      --
      i r baboon
    3. Re:Yes but... by caluml · · Score: 1
      servers run Fedora Core and use the 2.6 kernel provided by RedHat.

      Seriously? I thought they ran Debian. Did they change? If so, it's a bit of a kick in the face for Debian. And the standard kernel from Redhat? Why?

    4. Re:Yes but... by hpa · · Score: 2, Informative

      mirrors.kernel.org takes up most of the disk space. 10 TB obviously includes plenty of room to grow.

    5. Re:Yes but... by Infinite+Entropy · · Score: 1

      The article said that it was for ease of updates.

    6. Re:Yes but... by Nutria · · Score: 1

      The article said that it was for ease of updates.

      Somebody's smoking a bit too much weed, if they think that RH is easier to upgrade than Debian.

      IIRC, though, I think it's simply because the admins know/like/use RH.

      --
      "I don't know, therefore Aliens" Wafflebox1
    7. Re:Yes but... by brsmith4 · · Score: 1

      yum update is no more difficult than apt-get update. Plus, it's fewer characters so yeah, it is easy. =-P

    8. Re:Yes but... by Nutria · · Score: 1

      yum update is no more difficult than apt-get update.

      RPMs tend to be much less granular than DEBs. So, you need to upgrade or newly install more stuff, even if you don't need/want it..

      --
      "I don't know, therefore Aliens" Wafflebox1
  8. Re:Bandwidth and Slashdot Effect on kernel.org by HaydnH · · Score: 2, Interesting

    "The normal bandwidth used by kernel.org is between 150 to 200 megabits per second"...

    "When asked about viewing the actual access logs, Peter explained that although they do occasionally get requests from various sorts of researchers, they generally don't make them available for privacy reasons." ... "There has been discussion about making the logs available in an anonymized form, but it's not the top priority."

    Perhaps the anonymous logs should be sold to pay for some of this juicy bandwidth they're consuming?

    Haydn.

    --
    Time is an illusion. Lunchtime doubly so. - Douglas Adams
  9. Slashdot history! by Dante · · Score: 4, Interesting

    This was a great article! I can attest the there is quite a difference with the new hardware, I got a 500KBps download last night while downloading rc3-mm2.

    Can we please have the same kind of article about slashdot hardware?

    --
    "think of it as evolution in action"
    1. Re:Slashdot history! by VistaBoy · · Score: 4, Funny

      Oh, you mean the Macintosh Plus that CmdrTaco has in the corner of his living room?

    2. Re:Slashdot history! by Dante · · Score: 1

      I think it was a alpha.....

      --
      "think of it as evolution in action"
    3. Re:Slashdot history! by Dielectric · · Score: 1

      DEC Multia, to be exact. Not to be confused with something cool like an AlphaServer, they were very small form factors with not a lot of proc power. Pentium class, mostly.

    4. Re:Slashdot history! by Nutria · · Score: 1

      DEC Multia, to be exact. Not to be confused with something cool like an AlphaServer,

      The Multia is Alpha, albeit rather anemic even back then.

      http://www.obsolyte.com/dec/multia/

      --
      "I don't know, therefore Aliens" Wafflebox1
    5. Re:Slashdot history! by 0racle · · Score: 1

      Yes but its not cool like an AlphaServer

      Last I checked a Multia wasn't an Alpha Server

      --
      "I use a Mac because I'm just better than you are."
    6. Re:Slashdot history! by Nutria · · Score: 1

      Yes but its not cool like an AlphaServer

      Point taken.

      Last I checked a Multia wasn't an Alpha Server

      Or this one? http://h18002.www1.hp.com/alphaserver/gs320/ We have a couple (running OpenVMS, of course), and they're pretty sweet.

      --
      "I don't know, therefore Aliens" Wafflebox1
  10. Interesting quote by LiENUS · · Score: 5, Interesting

    The 'kernel.org' domain name was picked because by that time in 1997 the more logical seeming Linux dot names were already taken. The Transmeta domain was intentionally not used to avoid creating the false perception that Transmeta owned Linux.

    I wonder what would have happened with Transmeta and Linux if they had used the Transmeta domain to host the kernel archives. Would IBM have gotten involved with Linux? Would SCO have sued Transmeta instead of IBM? Would Linus have left Transmeta?

  11. Alternative Resources by Drunken+Buddhist · · Score: 1

    When Linus Torvalds purchased his first computer on which he began writing the Linux kernel, the state-of-the art PC with 4 megabytes of RAM and running at 33 megahertz was too expensive for him to buy outright.

    Oh my god, it's a diesel!

    --
    -1, Disagree is not a valid option. Troll, Flamebait and Offtopic are not a substitute.
  12. Slashdotting? by TheKubrix · · Score: 3, Funny

    ...it runs on multiple quad Opterons each with 24 gigabytes of RAM, 10 terabytes of disk space, and a gigabit link to the internet...

    Do I smell a challenge?

    1. Re:Slashdotting? by tomhudson · · Score: 1
      Do I smell a challenge?
      No, you smell your average home machine in about 4 years.

      After all, 4 years ago, a pimped-out machine had less than 20gig of hd space, 128meg of ram, and 500-800 mhz. DVD burner? forget it, not at $3000 each.

      Nowadays even entry-level boxes are better than that.

    2. Re:Slashdotting? by TimSee · · Score: 1

      I seriously doubt the average home machine is going to cost > $50K in 4 years. While a future home PC might have that kind of compute power, it won't cost ANYWHERE near that much.

      A quick check at HP.com shows a ProLiant DL585 with 2 800-series dual-core Opteron processors starts at $12K. Add in 24GB of RAM and 10TB (two MSA30 each with 14 drives) and you're north of $50K in a hurry...

    3. Re:Slashdotting? by Daniel+Phillips · · Score: 1

      "you smell your average home machine in about 4 years"

      Multiple 4 way opterons? 24 GB memory? At least 6 years way, assuming that the average home machine does not simply drop in cost at the expense of specs.

      --
      Have you got your LWN subscription yet?
    4. Re:Slashdotting? by tomhudson · · Score: 1
      Costs are coming down way quicker than Moores Law would have predicted.

      How much would a terrabyte of storage have cost you 4 years ago? You would have needed to stack 50 x 20gig drives, plus the cases, controllers, and power supplies, etc. Today? Under a grand, as an off-the-shelf item. Terabyte drives will be in people's boxes by Christmas of next year, if not this year. You can buy 2 TB of storage today for less than what I paid for an 80 meg hd 15 years ago.

      Same thing for cpus. You'll have a hard time NOT buying a 64-bit dual-core within a couple of years.

      Ram? Faster, cheaper, and able to stuff more in the box each generation. A lot of 5-year-old boxes can't handle more than 512 meg, but today a lot of retail boxes can handle 2 to 4 gig, and you can buy a motherboard that will handle 16 gig here for less than $400.

      5 years ago nobody needed 256 meg of ram, or video cards with more than 12 meg. Things change. That $50,000 box will be less than $5,000 in 5 years.

      Ch..ch..ch..changes ...

    5. Re:Slashdotting? by tomhudson · · Score: 1

      1998 most people were "making do" with drives in the 2 to 4 gig category, video cards with 4 to 8 meg, and 16 to 64 meg of ram. 300 mhz was the limit, and 233 or 266 were much more common (450 and 500 mhz pentiums were only introduced in February 1999).

    6. Re:Slashdotting? by FidelCatsro · · Score: 1

      I remember my PC of 4 years ago , Brand spanking new Athlon thunderbird with an awe-inspring 512MB of ram and a great GF2 pro , and a hugh twin -30GB HDD in a raid array (that and my iMac g3 , guess which one i still use as a media server).
      it cost around 3 grand in total . Today i could build a system that with an 80GB HDD , an athlon-xp 2500+ , 512MB DDR RAM a better graphics card all for around 1/15th the price , so what cost me then 3000 will today (for something alot better) cost 150-200 (self built ofcourse , 300- if i bought it as a complete PC).

      I currently have sitting in my room a sun Ultra-sparc server from 98 or 99 which retailed i belive for about 20,000 back then (no idea the exact price), cost me 200 a couple of years ago .

      Just some anicdotal evidence for you

      --
      The only things certain in war are Propaganda and Death. You can never be sure which is which though
    7. Re:Slashdotting? by Dan+Ost · · Score: 1

      Actually, I bought my wife's machine 4 years ago and it came with a 80GB HD,
      1.6GHz AMD CPU, 128meg RAM, CDRW drive, and a DVD drive. Cost about $1500.
      Except for the RAM, it's actually quite comparable to today's entry level
      boxes. Still runs like a champ, too.

      --

      *sigh* back to work...
    8. Re:Slashdotting? by shutdown+-p+now · · Score: 1

      No, it's the smoke from their server.

    9. Re:Slashdotting? by tomhudson · · Score: 1
      But back in May of 2001 it was the fastest amd you could buy was a 1.4ghz - they broke the 1ghz barrier in June of 2000, and 1.6ghz in October of 2001 http://john1701a.com/cpu/AMD_High-Performance_Desk top.htm

      It's amazing how quickly we take for granted what was cutting-edge just a few years ago.

    10. Re:Slashdotting? by tomhudson · · Score: 1
      Do you know ANYBODY (at least anybody with a pulse) who actually bought the dog Pentium 2?

      It was way overpriced.

    11. Re:Slashdotting? by tomhudson · · Score: 1
      They were "making do" because that was all that was available at a reasonable cost. You couldn't buy a 2 ghz machine retail because they didn't exist. Neither did 80gig hds, never mind 300 and 400 gig. I gig ram chips were still in the labs.

      For what the average computer cost back then (2-3k), you can get a decently pimped-out box today. Not only have prices come down, the barrier has been lowered as well. And its just going to get better.

      Look at how quickly 64-bit cpus went from esoteric to "how many do you want"?

    12. Re:Slashdotting? by tomhudson · · Score: 1
      8/2001 is less than 4 years ago. That was the "order date". You couldn't get them in any quantity at that time. And at $562 just for the cpu, nobody was building 1.8 ghz systems for a grand in May of 2001.

      Even the bargain-basement celerons hadn't broken 1ghz by August.

  13. Awesome by caluml · · Score: 1

    I'm in awe of that box. It just pushes so much data, all the time. And 1000Mb/s of bandwidth?! That's more bandwidth than Google!*


    * I strongly suspect this not to be true.

    1. Re:Awesome by moz25 · · Score: 2, Interesting

      Seeing as google has thousands of boxes, my estimate would be that the combined google services pump out over 10Gb/s, rather than just 1Gb/s.

    2. Re:Awesome by ldspartan · · Score: 1

      What amazes me is that on the new cisco ultra-router, the CRS-1, the slowest port it has is 10GigE. That's silly.

      --
      lds

    3. Re:Awesome by wolrahnaes · · Score: 1

      "What amazes me is that on the new cisco ultra-router, the CRS-1, the slowest port it has is 10GigE. That's silly."

      How so? That thing is designed for serious backbone duty, and should thus be built to handle insane amounts of data. That thing makes a business-class router on a GigE line look like a Linksys on a DSL line (not knocking Linksys, I love my WRTs).

      --
      I used to get high on life, but I developed a tolerance. Now I need something stronger.
    4. Re:Awesome by ldspartan · · Score: 1

      I meant silly as a synonym for "awesome" :).

      I'm very excited by having gotten to touch an engineering model of the CRS-1!

      --
      Phil

  14. Re:Bandwidth and Slashdot Effect on kernel.org by A+beautiful+mind · · Score: 1

    Exactly, why is copy pasting a paragraph from the linked article + crap formatting is "Interesting"?

    I don't think that the linked article (last time i checked it wasn't) is slashdotted.

    --
    It takes a man to suffer ignorance and smile
    Be yourself no matter what they say
  15. load average 1024??!!! by teeker · · Score: 4, Interesting

    Referring to 32-bit systems, Peter noted, "we learned that the Linux load average rolls over at 1024. And we actually found this out empirically."

    Can you even get the server to TELL you what the load is when it's that high?? That's INSANE!

    --
    teeker
    1. Re:load average 1024??!!! by Dante · · Score: 2, Insightful


      sure;
      The proccessors could actualy be doing very little other then waiting on the disks.. I'd be more interested in output from vmstat.

      --
      "think of it as evolution in action"
    2. Re:load average 1024??!!! by jeffmeden · · Score: 1

      It depends on how fast the processes are making it to runtime. If you have 1000 jobs that will each take a few msec, but new ones are created at that same rate, you will still get responsiveness out of the system. If they each take a few sec, on the other hand, you have a problem... Oh how I wish everyone understood the difference between load average and load on the cpu.

    3. Re:load average 1024??!!! by opqdonut · · Score: 1

      Would you care to elaborate on that?

      AFAIK load avg is avg number of processes waiting for execution and cpu load can be measured as how much non-idle work the processor is doing.

      --
      yes > /dev/dsp
    4. Re:load average 1024??!!! by diegocgteleline.es · · Score: 1

      Can you even get the server to TELL you what the load is when it's that high??

      Of course you can. "Normal people" often get high load values when they run out of mem and the box starts swapping and you can't control the box. Try running some thousands of "cp /dev/zero /dev/null" or something like that to get high load value, you still will be able to control the box - specially with a reniced root shell.

    5. Re:load average 1024??!!! by jon787 · · Score: 1

      I've seen load averages well over 1500 on Solaris boxen. The CS dept at my college has some issues with giving us sane limits, so programming assignments occasionally go a little crazy.

      Uptime usually works well into 2k, but "ps" and even ls in the /proc folder crap out around 500.

      --
      X(7): A program for managing terminal windows. See also screen(1).
    6. Re:load average 1024??!!! by Kent+Recal · · Score: 1

      Yup, they should put up some pretty graphs for kicks. I wonder whether that would be even doable - maybe some of the counters in /proc are just overflowing all the time?

      Would definately look funny. The scale alone would be good for some jawdropping at second glance...

    7. Re:load average 1024??!!! by drsmithy · · Score: 1
      Can you even get the server to TELL you what the load is when it's that high?? That's INSANE!

      Someone needs to check up on what the load average actually means :).

  16. So then... by advocate_one · · Score: 1
    what does the increase in traffic say about the level of Linux adoption out there...

    As we have some figures for the numbers of machines in the early days and surely we have the traffic figures for then as well...

    We should be able to make a reasonable guess at the number of machines out there with Linux on them...

    --
    Donald 'Duck' Dunn: We had a band powerful enough to turn goat piss into gasoline.
  17. Hat's off to HP by oni · · Score: 4, Insightful

    I'm really happy to see HP giving so much support. I'll definitely remember this the next time someone asks my opinion about what server hardware to buy.

    1. Re:Hat's off to HP by ntsucks · · Score: 1

      Second that here, their generosity to Linux users should be recognized. We all know that IBM contributes heavily to Linux, but they are far from the only ones.

      BTW-- I can attest that the dual Opteron DL145's from HP are rocking boxes for Linux.

      --
      Those who can do. Those who can't sue.
    2. Re:Hat's off to HP by Polycom+Sucks · · Score: 3, Informative

      I'll second this. Their support for many things is outstanding. HP really does seem to care about the end users.

      I have seen them support networking equipment purchased second (third?) hand off of eBay without asking "Where did you buy it?" or anything like that.

      In another instance, one of their supprot people searched through several dozen models of laptops and found one in a different product line that had compatible drivers so that an OS besides Windows XP could be used. No other company has come close to providing this level of support.

  18. Slashdot effect minor compared to .tar downloads by moz25 · · Score: 3, Interesting

    It doesn't surprise me that being linked from slashdot is just a minor effect. A kernel package is tens of megabytes, while a single visit will likely consume less than 100KB.

  19. So, what they need to do... by jd · · Score: 2, Funny

    ...is get themselves mentioned on Slashdot on the same day that there's a simultaneous release of a major distribution and a Linux kernel.

    --
    It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
  20. Re:Bandwidth and Slashdot Effect on kernel.org by PReDiToR · · Score: 1

    Probably not.

    If we're savvy enough to need a kernel tarball, we're savvy enough to run AdBlock, no?

    --

    Do not meddle in the affairs of geeks for they are subtle and quick to anger
  21. Re:Bandwidth and Slashdot Effect on kernel.org by MrAnnoyanceToYou · · Score: 1

    Unless we're on a corporate link where we can't use anything but g*d*f*ing IE because the tech department is lazy, pathetically understaffed, stupid, incompetent, or REALLY pathetically understaffed and run by someone who doesn't quite care....

  22. Bittorrent as a large-scale software distribution? by dugenou · · Score: 1
    With the increased bandwidth generated by the git repositories, better than ftp mirroring, distributed sharing would come handy, releaving feed pipe and average load too.

    However, in this post from hpa, it looks like the tools are not ready.

    --
    Love salty crackers? catchy electronica? Try !
  23. noatime interesting by redelm · · Score: 4, Interesting
    More people should look into `noatime` for file-intensive systems. Peter said all the access time updates doubled his load average, and I've seen worse. Try running the `updatedb` to freshen the locate database. Takes minutes. remount FS noatime, flush buffers with a grepbomb, and it takes seconds. Remount with atime, back to minutes.

    1. Re:noatime interesting by cloudmaster · · Score: 1

      More people should look in to noatime in general. The only use I have for atime is rendered moot by stupid network clients that feel compelled to recursively hit every file in a directory when they're browsing. Cough apple calculating size of folders via netatalk cough. Etc.

    2. Re:noatime interesting by cpeterso · · Score: 1


      Do any Linux distros disable noatime by default? If someone cares about noatime, let them manually enable it. Currently, we are all paying the price for those users!

    3. Re:noatime interesting by redelm · · Score: 1
      AFAIK, no distros have `noatime` as default in /etc/fstab.

    4. Re:noatime interesting by Al+Al+Cool+J · · Score: 2, Informative

      Gentoo has noatime as default. There's even a comment in /etc/ftsab about how it improves performance.

    5. Re:noatime interesting by ozbird · · Score: 1

      The sense should be reversed - you should be required to specify "atime" in your mount options if you want it.
      (atime hurts performance is because of the extra drag from the collapsing quantum wavefunctions.)

    6. Re:noatime interesting by runderwo · · Score: 1

      It's default noatime im Debian sarge.

  24. Re:here's an idea by diegocgteleline.es · · Score: 4, Interesting

    DONT USE APACHE.

    This was suggested. The kernel.org people didn't seem to have interest in it. Those light http servers are probably good for lots of small static html files. kernel.org is not like that - it needs to serve + 20 MB files and CD ISOs. Your benchmarks don't measure that. I can bet the kernel.org people knows what they're using and they'd have switched if it'd be really useful.

  25. Re:What? by TheKubrix · · Score: 1

    I agree. Whats worse is that HP really sucks at making computers. They're so focused on price that they tend to put low quality product in their machines......

  26. Re:fine but... by Anonymous+Cowpat · · Score: 1

    it DOES run linux, and they've got several so you can't ask us to imagine a beowulf cluster either...

    Does it run MSDOS?

    --
    FGD 135
  27. I worked at Globix by assantisz · · Score: 1

    I worked at Globix when we offered free bandwidth to kernel.org. In the beginning, when things were going well and we had hundreds of millions of dollars to spend, we used this to leverage our poision in the open source community. Of course, when the bubbled bursted Globix tried to get rid of all the free riders first. It was done very selectively, though. While some were cut loose as fast as possible (like kernel.org), others were kept because they had better connections to some of the executives. I don't want to name names, just that much: it is one of the better known nudie magazines. It was quite a qide to work at Globix. They are still around, barely.

  28. Re:Slashdot effect minor compared to .tar download by A+beautiful+mind · · Score: 1

    If it's an announcement of a new kernel, it is likely that at least some percent of the /. crowd will download the new kernel.

    --
    It takes a man to suffer ignorance and smile
    Be yourself no matter what they say
  29. Obligatory... by rrosales · · Score: 1

    "where it runs on multiple quad Opterons each with 24 gigabytes of RAM, 10 terabytes of disk space, and a gigabit link to the internet"
    Can you image a cluster of these...no wait...

  30. That's what rsync does by trb · · Score: 3, Interesting
    Serving data with http and ftp is is not very CPU intensive, but over time the amount of rsync traffic being fed by the kernel.org server continued to increase, and rsync is CPU intensive. "That's what rsync does" Peter said, "it trades bandwidth for CPU horsepower...

    I don't have occasion to use rsync, and I'm not too familiar with its design, but I think it synchs directories by checksumming the files in them to see if they differ. So Peter is saying above that the server's bottleneck is checksumming. I would think that on a server like this, checksums could be cached - why checksum a stable file more than once? Once you have a checksum for linux-2.6.0.tar.bz2, why calculate it again?

    This would require a bit of bookkeeping when files change, but wouldn't it be worth it on such a busy system? (Or am I confused?)

    1. Re:That's what rsync does by cloudmaster · · Score: 1

      There's also the overhead of compression, which admittedly doesn't make much sense on a file that's already bzipped. What they need is a caching rsync proxy that checks the file to see if it's changed since the last access. :)

    2. Re:That's what rsync does by Feyr · · Score: 1

      rsync uses rolling hashes for segments of the file. iirc segment boundaries are decided on by the client (i could be wrong on that part)

    3. Re:That's what rsync does by hpa · · Score: 3, Informative

      Fortunately, rsync has a compression exclude list.

    4. Re:That's what rsync does by cloudmaster · · Score: 1

      Yeah, and busy rsync sites regularly exclude everything in order to keep CPU overhead down - ignoring the observation that compression is handy when you're transferring parts of a file as opposed to the whole thing...

  31. Interesting they use Fedora and not RH EL by Glasswire · · Score: 3, Funny

    ...I guess that:
    a) kernel.org doesn't think the 'enterprise readiness' of RH Enterprise Linux is that great, (vs what Fedora offers) even in what should be considered one of the most mission-critical sites in the Linux ecosystem (or that the difference with Fedora is worth paying for)
    b) No one at RH is bright enough to be embarassed by this and offer kernel.org some free licenses...

  32. It's About Marketing, DUH by zeromemory · · Score: 1

    I'm really happy to see HP giving so much support. I'll definitely remember this the next time someone asks my opinion about what server hardware to buy.

    I believe that was the reaction HP's marketing department also expected. Admittedly, providing the hardware was a very nice gesture, but in reality, it's a brilliant marketing move.

    Furthermore, I hope you will take other factors and datapoints into consideration when someone asks you for your advice, though. The servers donated were relatively high-end -- they might or might not be reflective of all HP hardware.

  33. Re:here's an idea by hpa · · Score: 4, Informative

    We have, indeed, considered that, but it'd not really buy us anything. Earlier Apaches would sit on a lot of memory while serving large files, but current versions just have a thread sitting in sendfile(), which is just about as lightweight as you get.

    Sure, the startup cost of the transaction is higher than for a lightweight HTTP server, but the startup cost of the transaction isn't a big deal for us, and we appreciate the flexibility that Apache offers.

  34. CVS + p2p by jlebrech · · Score: 1

    thats where we need somekind of link between cvs and a peer to peer system.

  35. Re:What? by Nutria · · Score: 1

    Please name the last president that hasn't placed "cronies" in high ranking positions. I doubt you will find one after 1970.

    1970???? Try 1829 (Andrew Jackson).

    And don't forget that Kennedy put his brother in as Attorney General.

    --
    "I don't know, therefore Aliens" Wafflebox1
  36. Re:Bittorrent as a large-scale software distributi by Nutria · · Score: 1

    the torrent would sure be a releaf.

    Must ... resist ... urge ... to ... ask ... why ... bit ... torrent ... would ... put ... leaves ... back ... on trees.

    --
    "I don't know, therefore Aliens" Wafflebox1
  37. load average includes IO by r00t · · Score: 1

    At least on Linux, the load average includes
    processes waiting for in uninterruptable sleep.
    That would be disk IO, mostly.

  38. Re:here's an idea by Kent+Recal · · Score: 1

    Without having ever looked at the apache code I'd like to ask anyways:

    Doesn't apache do some additional stuff per request that others don't?
    Things like check whether authentication should be requested and the like?

    I might be mistaken but if not - don't these things amount to a significant overhead, esp. when talking.. uh.. "lots" of requests per second?

  39. Re:Bandwidth and Slashdot Effect on kernel.org by PReDiToR · · Score: 1

    Can't you plug in a Keychain?

    I know where my wife works they won't let her use USB ports, which is a real bind when it comes to taking files to work sometimes.

    --

    Do not meddle in the affairs of geeks for they are subtle and quick to anger
  40. Re:Bandwidth and Slashdot Effect on kernel.org by MrAnnoyanceToYou · · Score: 1

    Err... I think they've got the DNS passworded or something. I'm not that attached to any of the data I use at work, seeing as I'm a temp / contractor / whatever, so I don't give a damn what kind of spyware is on my machine... I like firefox quite a bit more, but I get a NT security style login prompt every time I try to load it up. I guess I could track down whoever needs this password and use the other browser, but wI spend so much time messing with red tape I can't imagine spending any more for software I want to use on my own.

  41. rsync is more than a simple checksum by toby · · Score: 1
    Once you have a checksum for linux-2.6.0.tar.bz2, why calculate it again?

    What's different about rsync is that it does not ordinarily use a single file checksum (and therefore copy whole files if changed). Instead, to save bandwidth, it uses a more sophisticated system to ensure that only changed parts of a file are transmitted - and it detects changed parts by comparing (many) checksums, I believe. The report sums it up like this:

    The algorithm identifies parts of the source file which are identical to some part of the destination file, and only sends those parts which cannot be matched in this way. Effectively, the algorithm computes a set of differences without having both files on the same machine.

    (Disclaimer, I have only skimmed the rsync report and that was some time ago, but I am a longtime and happy rsync user.)

    --
    you had me at #!