Slashdot Mirror


Linux 2.4 VM Documentation

popoutman writes "Mel Gorman has announced the availability of a guide to the 2.4 kernel VM including a walkthrough of the VM code. Anyone interested in obtaining a solid understanding of the Linux 2.4 VM will certainly want to take a look at this documentation. Mel says that the effort is at least several weeks from being finished, but that he's releasing it now with the hopes of getting feedback to be sure he's on the right track. He also notes that the 2.5 VM is still too much of a moving target for him to document it just yet." See also a Kerneltrap story.

115 comments

  1. Here's a tip for the author. by gpinzone · · Score: 2, Insightful

    Somewhere in your document, you ought to explain what "VM" stands for. That goes for the poster as well.

    1. Re:Here's a tip for the author. by Anonymous Coward · · Score: 0

      See the title

      Code Commentary On The
      Linux Virtual Memory Manager
      Mel Gorman

      VM = Virtual Memory, I guess

    2. Re:Here's a tip for the author. by tprox · · Score: 2, Interesting

      It stands for Virtual Machine, though I know what you mean about explaining acronyms.

    3. Re:Here's a tip for the author. by Anonymous Coward · · Score: 1, Funny

      Choose one :

      VM = Value Management
      VM = Variable Metric
      VM = Velocity Management
      VM = Velocity Meter
      VM = Velocity Modulation
      VM = Version Manager
      VM = VIBE Magazine
      VM = Video Monitor
      VM = Vietnam (Socialist Republic of Vietnam)
      VM = View Mail (Emacs)
      VM = Virtual Machine
      VM = Virtual Madness
      VM = Virtual Memory
      VM = Vivamos Mejor (Let Us Live Better; Guatemalan agency)
      VM = Voice Mail
      VM = Voice Message
      VM = Voice Modulation
      VM = Voltmeter

      but not that one!!!

    4. Re:Here's a tip for the author. by tomhartung · · Score: 2, Informative

      Down a ways on the page we see the title of one of the documents: "Understanding the Linux Virtual Memory Manager."

      As a (an ex-)Java guy I immediately thought it stood for "Virtual Machine."

      --
      See my blog at tomwhartung.com for my resu
    5. Re:Here's a tip for the author. by stratjakt · · Score: 1

      I skimmed the article, and it seems to be about the Virtual Memory Manager's implementation in the kernel.

      I assumed it was about Virtual Machines in user land as well.

      Oh well, the fact that it's an incomplete document on something that noone but kernel hackers care about is irrelevent. It's only on /. because of the word 'Linux'.

      --
      I don't need no instructions to know how to rock!!!!
    6. Re:Here's a tip for the author. by iggymanz · · Score: 3, Informative

      heh, it can stand for a few things in the realm of computers:
      1. virtual memory (using secondary storage to allow the program to address more memory locations than there is in physical RAM)
      2. virtual machine (a machine implemented in software on a real machine, which *could* have different opcodes, or the same ones as the real machine)
      3. VM - the operating system by IBM which allows one of their mainframes to act like many (stands for virtual machine)

      ( Also, there's the VM in VMS, the DEC Vax and Alpha operating system, which stands for VIrtual Memory as in #1)

    7. Re:Here's a tip for the author. by gpinzone · · Score: 2

      Then it should be called the Linux VMM, no? I can guess at a possible meaning, too...but I shouldn't have to.

    8. Re:Here's a tip for the author. by thodi · · Score: 5, Insightful

      People who don't know what it stands for would not want to read the document anyway, don't you think?

    9. Re:Here's a tip for the author. by Anonymous Coward · · Score: 0

      Acronym? Do you say "Vee Em" or "vum"? If the former, it's NOT an acronym. Go read up on what they are (think NATO, AIDS etc.)

    10. Re:Here's a tip for the author. by Anonymous Coward · · Score: 0

      The target audience of this paper *knows* what VM means in the context of the Linux Kernel.

    11. Re:Here's a tip for the author. by Anonymous Coward · · Score: 0

      This is what Java does to you ;-)

  2. Better document... by Anonymous Coward · · Score: 1, Flamebait

    Not to sound like a bitch; but if any you want to understand the concept of a good VM, you should read this which is a proven VM, and has been proven vastly superior to the new Linux VM(s).

    1. Re:Better document... by snak0rific · · Score: 1

      yes, unfortunately i can't take fbsd places where linux goes with reckless abandon. what if i want to hax on a Hitachi SuperH ? learning fbsd is useless to me. how about if i want to target a toaster? can't run fbsd on that either! being a computer science type i'm not really interested in computers per se but "computing in general" which linux is proving to be a great OS for... in general.

      --
      -- "Put on your big girl panties and lift!"
    2. Re:Better document... by cgh4be · · Score: 2, Informative

      You're looking for NetBSD

      Platforms supported:
      acorn26 acorn32 algor alpha amiga amigappc arc arm32 atari bebox cats cesfic cobalt dreamcast evbarm evbmips evbppc evbsh3 evbsh5 hp300 hp700 hpcarm hpcmips hpcsh i386 luna68k mac68k macppc mipsco mmeye mvme68k mvmeppc netwinder news68k newsmips next68k ofppc pc532 playstation2 pmax pmppc prep sandpoint sbmips sgimips shark sparc sparc64 sun2 sun3 vax x68k x86_64

    3. Re:Better document... by AtrN · · Score: 2
      Being a "computer science guy" and being interested in "computing in general" means you should not limit yourself to a single kernel. Learn, look around, be all that you can be :)

      Also, since we're talking VM systems here, I see little need for a CPU with an MMU in a toaster and you'd likely be using the MMU-less Linux in that case. However there may be valid reasons for needing virtual memory in your toaster's OS but I really just want something that slightly burns bread (yes I know about the voice controlled one done at UW) which can be done very cheaply these days. Bettter be a fancy toaster!

    4. Re:Better document... by snak0rific · · Score: 1

      yes, but then it's already done for me. since there's no official "GE Toaster" port, i get to write it myself, and still have the ownij GNU userland! but it still won't be official, just a port to "GE Toasters" :/

      --
      -- "Put on your big girl panties and lift!"
  3. JVM by Anonymous Coward · · Score: 1, Interesting

    If I were to spend a lot of time integrating a JVM into the kernal, would it be useful?

    Kind of like FreeBSD did, fast JAVA under linux would be awesome.

    1. Re:JVM by axxackall · · Score: 5, Funny

      yeah, right. let's also integrate to the kernel Perl, Python, Tcl, Lua, Emacs (for Elisp), Guile, Hugs, OCAML, Bash, Apache (for PHP) and Gecko (I want my Mozilla to work faster too!). I wonder, why is X server still not there? And don't forget about at least two CORBA brokers: Gnorba (everyone would love faster Gnome) and OmniOrb (just for a case). Hey, let's put everything into the kernel! Ooops... It's not kernel anymore and it doesn't want to run either. What was the mistake?

      --

      Less is more !
    2. Re:JVM by Anonymous Coward · · Score: 0

      not so much a JVM integrated, but have it run bytecode instead of standard executables

    3. Re:JVM by PetiePooo · · Score: 5, Funny

      I suspect it would be a waste of your time. Lets look at a bit of history.

      There used to be a kernel-space HTTP server. It was integrated into the kernel for a specific reason: zero-copy access to the network interface memory. It was fast and relatively feature-poor. If it crashed (fortunately, a rare occurance), you got a kernel panic.
      Along came a user-space, zero-copy HTTP server. It was faster and had a few more features to boot. Being a user-space program, if it crashed, you got a core-dump. It could also be run in a chroot jail, a gigantic step more secure than running in ring-0.

      Two lessons can be read from this:
      1. Don't integrate something with the kernel unless there is a specific advantage you hope to gain from it. Will making a JVM part of the kernel really speed it up? Are you sure?
      2. Don't under-estimate the speed of a properly designed user-space Linux program. The kernel developers have done a magnificant job tuning the kernel and providing APIs for performance-critical apps.

    4. Re:JVM by CableModemSniper · · Score: 4, Funny

      OMG! You've reverse-engineered windows!

      --
      Why not fork?
    5. Re:JVM by Anonymous Coward · · Score: 0

      no if it had all those scripting languages integrated into the kernel, windows would actually be really useful and easy to automate.

  4. always nice to see some documentation by mark_lybarger · · Score: 2, Insightful

    but this kernel has been out now since 04-Jan-2001. i know it took them a while to shake out the vm system, but the kernel is now stale. anybody who needs to know the internals of how vm works in the 2.4 kernel already knows. the rest of us just argue on /. about why there wasn't lots of testing done in past releases or about how you don't make big changes in a supposidly stable release series.

    i'm sure those who need to know though are full aware of the vm workings of the 2.6 kernel (ibm, redhat, oracle, rik, google, etc).

    Linux: the big player everyone likes to root for, but noone wants to put in the lineup.

  5. This should be helpful by Jack+Wagner · · Score: 0, Troll

    Most low level coders are going to find this very useful and I for one, even though I use Gnu/FreeBSD (lower context switch latency and better pipelining performance when under heavy paging loads, etc), am certainly glad to see the Linux camp start to come around regarding the documentation issue.

    Fred Brooks in "The Mythical Man Month" states that proper documentation should be given a 20% time slice of each project if you want to have a successful product and this is certainly THE one arena where Linux is lacking, stopping them from commercial success.

    As a tip for best results I suggeest using the Extreme Documentation method when writing your docs, it's saved time on the order of Olog(n) for me and a proven time saving technique.

    Warmest regards,
    --Jack

    --


    Wagner LLC Consulting Co. - Getting it right the first time
    1. Re:This should be helpful by Anonymous Coward · · Score: 0

      Please don't use "GNU/FreeBSD" as a label for FreeBSD, the GNU parts are replaceable, in fact, the only reason it's still around is because the GCC collection is the best opensource compiler, but by no means irreplaceable.

    2. Re:This should be helpful by Anonymous Coward · · Score: 0
      "I use Gnu/FreeBSD (lower context switch latency and better pipelining performance when under heavy paging loads, etc)"

      Translation:

      I have a much bigger dick than you linux n00bs. I am 13371111, 1337 I tells you!!1111111 Fear my skillzzzz I am the 1337est of them alllll111111 I am so 133333333333333........

      (Disappears up own arsehole)

    3. Re:This should be helpful by sporadek · · Score: 5, Funny
      As a tip for best results I suggeest using the Extreme Documentation method when writing your docs, it's saved time on the order of Olog(n) for me and a proven time saving technique.

      Define your function "Olog", please. Surely Mr. "Wagner LLC Consulting Co. - Getting it right the first time" couldn't have meant O(log n)... :)

    4. Re:This should be helpful by Anonymous Coward · · Score: 0

      If he loads GNU versions of the BSD system tools, then he is able to call it GNU/FreeBSD. But I doubt that's what he did because he would have had to write some of his own tools in that process. Haha, he probably loaded GNU tar and GNU emacs the same way people do for Solaris, but he just trolls away with FreeBSD.

    5. Re:This should be helpful by glrotate · · Score: 2

      FreeBSD has had commercial success?

    6. Re:This should be helpful by uid8472 · · Score: 1

      Define your function "Olog", please. Surely Mr. "Wagner LLC Consulting Co. - Getting it right the first time" couldn't have meant O(log n)... :)

      I believe the OP was in fact referring to the Olog-hai, the race of trolls enhanced by Sauron. HTH.

    7. Re:This should be helpful by be-fan · · Score: 2

      Uruk-Hai and it was Saurumon.

      --
      A deep unwavering belief is a sure sign you're missing something...
    8. Re:This should be helpful by Anonymous Coward · · Score: 0

      If I had to run FreeBSD I'd replace the userland tools with the GNU ones (GNU fileutils, sh-utils, etc) because I hate BSD's. Obviously FreeBSD's C library would stick around, but I'd make the system as GNU as I could.

    9. Re:This should be helpful by Richard_at_work · · Score: 1

      the Iomega NAS series of rack servers run a modified freeBSD OS.

    10. Re:This should be helpful by Luminous+Coward · · Score: 1

      According to Robert Foster, the Uruk-hai were bred by Sauron in Mordor, not by Saruman (as was shown in Peter Jackson's movie).

  6. Is late documentation a problem with Open Source? by WarSpiteX · · Score: 1

    I'm no expert on Linux/Open Source, this is purely my curiosity at work. But it seems like many of the smallest projects, or those with large teams don't get documentation well after release. Or this is a misconception of mine?

    --


    I'm a little segfault, short and stout.
  7. An Idiot Savant's Book by Anonymous Coward · · Score: 0

    ooo kernel wapner ... need to read book....

  8. Where I work by oliverthered · · Score: 5, Funny

    We design aned document things first, work out the bugs in the high level design and then code.

    Well maybe not all the time and with serveral itterations, but I only manage your credit raiting, not you kernel VM.

    --
    thank God the internet isn't a human right.
    1. Re:Where I work by Anonymous Coward · · Score: 0

      So, you clearly don't work for any real software company in existence today.

      I'm betting the code is written in LISP. You don't have a deadline... and the code doesn't have to do something useful. You are an academic, and I claim my five pounds.

    2. Re:Where I work by Anonymous Coward · · Score: 0

      I work for a large multi-national company,that processes and deals with you credit history amoungst other things.

      We mainly use Cobal and XSL.

      There are 50 programmers/dbs's on the projedct I'm currently working on, we all refer to the documentation, each other and the client. If there's something vague or inconsistant/contradictory in the documentation it gets fixed before any serious work is done.
      If what's asked for seems a little odd then it's consulted to make sure the client and the project team know what's required (ofter the client doesn't know they only think).

  9. every developer worth his salt knows... by syle · · Score: 5, Funny

    ...that virtual memory works because of small, magical faeries and gnomes.

    --

    /syle

    1. Re:every developer worth his salt knows... by SpaceLifeForm · · Score: 2
      --
      You are being MICROattacked, from various angles, in a SOFT manner.
    2. Re:every developer worth his salt knows... by FroMan · · Score: 2, Funny

      How would this be modded informative... I'd like to meta-meta mod that moderation to +1 Funny along with this post as +1 Funny.

      --
      Norris/Palin 2012
      Fact: We deserve leaders who can kick your ass and field dress your carcass.
  10. Links to actual documentation by braz · · Score: 1

    Congratulations to Mel Gorman for actually managing to get down to this low a level and still explain it sensibly.

    Here are the links for interested readers:
    http://www.kerneltrap.com/node.php?id=55 5 (kerneltrap article)

    Actual documentation:
    The documentation comes in two parts. The first is "Understanding theLinux Virtual Memory Manager" and it does pretty much as described. It isavailable in three formats, PDF, HTML and plain text.Understanding the Linux Virtual Memory Manager
    PDF: http://www.csn.ul.ie/~mel/projects/vm/guide/pdf/un derstand.pdf
    HTML: http://www.csn.ul.ie/~mel/projects/vm/guide/html/u nderstand/
    Text: http://www.csn.ul.ie/~mel/projects/vm/guide/text/u nderstand.txtThe second part is a code commentary which is literally a guided tourthrough the code. It is intended to help decipher the more crypticsections as well as identify the code patterns that are prevalent throughthe code. I decided to have the code separate from the first document asmaintaining the code in the document would be too painfulCode Commentary on the Linux Virtual Memory Manager
    PDF: http://www.csn.ul.ie/~mel/projects/vm/guide/pdf/co de.pdf
    HTML: http://www.csn.ul.ie/~mel/projects/vm/guide/html/c ode
    Text: http://www.csn.ul.ie/~mel/projects/vm/guide/text/c ode.txt

    1. Re:Links to actual documentation by Anonymous Coward · · Score: 0

      there should be a new moderation category: "karma whore" :)

    2. Re:Links to actual documentation by Anonymous Coward · · Score: 0

      braz's article cleaned up, and with links for easy clicking (and AC to avoid karma-whoring)...
      ========

      Congratulations to Mel Gorman for actually managing to get down to this low a level and still explain it sensibly.

      Here are the links for interested readers:
      The kerneltrap article

      Actual documentation:
      The documentation comes in two parts. The first is "Understanding the Linux Virtual Memory Manager" and it does pretty much as described. It is available in three formats, PDF, HTML and plain text.
      "Understanding the Linux Virtual Memory Manager" as PDF, HTML, and Text.
      The second part is a code commentary which is literally a guided tour through the code. It is intended to help decipher the more cryptic sections as well as identify the code patterns that are prevalent through the code. I decided to have the code separate from the first document as maintaining the code in the document would be too painful Code Commentary on the Linux Virtual Memory Manager
      "VM Code Commentary" as PDF, HTML, and Text.

    3. Re:Links to actual documentation by Anonymous Coward · · Score: 0

      But is it +1 or -1?

    4. Re:Links to actual documentation by CoolVibe · · Score: 1
      I would love to have his LaTeX sources to this documentation. I would love to know how he got the code listings to work so well with latex2html. It barfs on mine, leaving ugly \par thingies in the grapics it produces...

      Or does he use a patched latex2html?

  11. MORE documentation needed. by dmelomed · · Score: 1

    This is great, however, where's the internals documentation for other subsystems of the kernel? People need documentation for JFS, ReiserFS, and other large systems. We'd see more of these in other kernels.

    1. Re:MORE documentation needed. by mark_lybarger · · Score: 1

      http://www-124.ibm.com/developerworks/oss/jfs/

      here's the first one i found relevant. i don't know if it's the current implementation or not. i don't follow that stuff too much. i just save files and use whatever doesn't fsck every time the system crashes or locks up.

      i like the method of releasing specs that implementations are to adhere to. and deviation from that should then be documented. W3C i believe was to accomplish a similar goal, but somehow they weren't too successfull.

    2. Re:MORE documentation needed. by inode_buddha · · Score: 2

      cd /usr/src/linux
      make sgmldocs psdocs pdfdocs htmldocs mandocs

      However, I also subscribe to the relevant mailing lists, and review them daily.

      HTH

      --
      C|N>K
  12. VM: Does it really matter? by XaXXon · · Score: 4, Interesting

    I've always wondered why, in today's world of gigabytes of memory in personal computers, why such a big deal is made about virtual memory..

    I'm not trying to troll, and maybe someone can explain this to me, but as far as I know, VM is used when a computer runs out of RAM. You take some of what's in the RAM and toss it on to the hard drive. When you need it again, you reload it back into RAM and use it.

    I've got 1.5GB of RAM in the box I'm typing this on, and the only reason I don't have more is that I have no use for it.

    At my work, we've got some systems that can eat up 2GB of RAM, but we just put 4GB in the box and don't worry about it. We've found that if we let anything touch VM it slows things down way too much.

    Anyways, I was hoping someone could enlighten me as to why VM has been such a big deal recently.

  13. Re:VM: Does it really matter? by Anonymous Coward · · Score: 0

    > I've always wondered why, in today's world of gigabytes of memory in personal computers, why such a big deal is made about virtual memory..

    Because the "today's world" you describe is limited to the affluent portion of computer users/buyers. Many, many many computers exist in non-affluent parts of the real world that have 32-64Mb of RAM, and virtual memory is useful there. These machines are not ignored by the various Open Source OSs, they're deliberately targeted as being ideal platforms - still useful and usable despite being "obsolete" by Microsoft/Intel standards of What A Computer Should Have To Be Useful.

  14. Re:VM: Does it really matter? by Anonymous Coward · · Score: 0

    Because "slows things way down" is a damn sight better than "crashes applications and/or the system", especially on multiuser systems where there are certainly spikes of activity.

  15. TROLL - Please mod parent down by Surreal_Streaker · · Score: 0

    This is a known troll. Mod's: please check this user's other posts and moderate accordingly.

    1. Re:TROLL - Please mod parent down by Jack+Greenbaum · · Score: 1
      One man's troll is another man's humor. I find this supposed troll particularly amusing.

      -- Jack

    2. Re:TROLL - Please mod parent down by NTT · · Score: 1

      Agreed. Anybody who puts a domain name that isn't registered in their user info is a fraud.
      See it here...
      http://www-whois.internic.net/cgi/whois?w hois_nic= wagnerconsultingllc.com&type=domain

  16. Re:Is late documentation a problem with Open Sourc by cyba · · Score: 1

    It's the other way round.
    The real problem is "release early (without documentation)" rule.

  17. Re:VM: Does it really matter? by Anonymous Coward · · Score: 0

    Two words: Graphic arts. I edit reproductions of oil paintings, and some of my image files are upwards of a gigabyte themselves (before layers), so I imagine my undo buffer plus whatever GIMP needs to run whatever filter it's running can take up quite the hefty chunk of memory... and let's not even start on the amount of RAM it takes to hold a full-length feature film at high resolution in memory *uncompressed*.

  18. Mod Parent Down by driverEight · · Score: 1

    Surreal_streaker is right this is a TROLL!

    --

    It's not the size of your .sig that matters, it's how you use it.

  19. Re:Pretty good English for a foreigner by unDiWahn · · Score: 1

    Uh -- that's a joke, right? Can't really tell...
    It's .ie as in Ireland, y'know, one of the many other English-Speaking countries out there... like, say, England?

  20. Re:VM: Does it really matter? by GGardner · · Score: 5, Informative
    I've always wondered why, in today's world of gigabytes of memory in personal computers, why such a big deal is made about virtual memory..

    This is a good question -- there are several features the VM system allows for in addition to overcommitting physical memory. If your system has enough memory to run with no swap/paging space set up, the VM system still provides these features without ever paging or swapping out memory.

    The first is demand paging. That is, only those pages of an executable that are needed are brought into memory, and on-demand. Yes, this saves memory, but more importantly, it makes program startup much faster. Without demand paging, the whole program would need to be brought into memory at startup. Nowadays, when disks are much slower relative to CPUs, than they used to be, this makes a big difference, especially in the Unix shell-pipeline style of programming, where you run many different short-lived programs.

    Secondly, the VM system allows for shared memory segments between processes. This allows for shared memory, threads and shared libraries.

    Finally, the VM system implements caching of the file system, which we all know and love as a good use for all the memory we stuff into our machines now.

  21. Re:Pretty good English for a foreigner by hughmcg · · Score: 1

    Ireland boss....

  22. Re:Pretty good English for a foreigner by xanadu-xtroot.com · · Score: 1
    --
    I'm not a prophet or a stone-age man,
    I'm just a mortal with potential of a super man.
  23. Re:VM: Does it really matter? by Jack+Greenbaum · · Score: 2, Informative
    Secondly, the VM system allows for shared memory segments between processes. This allows for shared memory, threads and shared libraries. Um. Don't you mean it enforces memory separation between processes?

    -- Jack

  24. Documentation and uncommunicative maintainer by FeatureBug · · Score: 4, Informative

    Documenting the VM is a good idea. I hope it is accepted into the /Documentation directory in 2.4.x kernel tarballs but I'm not sure whether you'd get a reply from the maintainer, Marcelo Tosatti. I sent him a carefully written email in August 2002 documenting an invalid config in 2.4.19 which causes the build process to fail. It was disappointing he never responded and apparently did not fix the bug which is still present in 2.4.20 and more recent patches.

    1. Re:Documentation and uncommunicative maintainer by Anonymous Coward · · Score: 0
      Could it be that the kernel maintainer is buried alive in email ?

      I work at the same office as Marcelo and I know for a fact that he gets so much email that it's pretty much impossible to read all of it. Now imagine a conference or a project for a client, which would mean not being able to read email for a week, ie. impossible to catch up later.

      Before criticising Marcelo, it'd be good to imagine how you would handle being bombarded with hundreds of bug reports and patches every day...

      I think Marcelo is doing a good job and shouldn't be flamed just because part of the job is impossible.

    2. Re:Documentation and uncommunicative maintainer by FeatureBug · · Score: 2

      Ok, I sympathise with his daily email overload but maybe he should delegate some of his workw , like dealing with kernel email? There are many other qualified people who have offered to help with maintaining 2.4.x kernels. That would be better than silently dropping email. Maintaining a relatively common Linux kernel series like 2.4.x is such an important job we should not have to rely on one overloaded person -- that's an unnecessary bottleneck. This is not a criticism of Marcelo's quality of work, which I think is very high. I am suggesting that he should delegate at least the work of dealing with kernel email.

    3. Re:Documentation and uncommunicative maintainer by Anonymous Coward · · Score: 0
      maybe he should delegate some of his work, like dealing with kernel email?
      This is called the Linux Kernel mailing list and it can be reached at linux-kernel@vger.kernel.org. If you mail Marcelo personally chances are he won't have time fixing bugs (he is too busy integrating bugfixes by other people), but if you mail the mailing list chances are somebody will have time to solve the bug and send the fix to Marcelo...

      You did send your bug report to the mailing list, didn't you?

    4. Re:Documentation and uncommunicative maintainer by FeatureBug · · Score: 2

      From other people's comments it sounds like Marcelo is too overloaded even to deal with all the bug fixes he receives by email, let alone the bug reports. He silently drops fixes as well as bug reports, so my point is he should delegate some of his heavy workload, such as dealing with email, to the many other qualified people who have offered to help as maintainers of 2.4.x kernels. For some reason he hasn't accepted any of those offers of help.

      LKML is a very high traffic mailing list. I'm wary of adding to that flood of email. However, due to the lack of response from Marcelo I will probably email LKML.

  25. Re:VM: Does it really matter? by GGardner · · Score: 5, Interesting
    Um. Don't you mean it enforces memory separation between processes?

    Good point -- this is such a given on Unix systems that I didn't word it very well. What I should have said is that it enforces memory separation and protection between most memory segments and most processes, and allows for sharing of segments when explicitly setup. This is perhaps the most important thing the VM system does.

  26. Re:VM: Does it really matter? by heh2k · · Score: 3, Informative

    VM does NOT mean just paging/swapping. that is a small part of it. mac os and windows users constantly misuse the term. the vm does demand paging, copy-on-write, file/page cache, buffer cache, shared mem, mmapped files, and more. the most important of which is protection.

  27. Re:VM: Does it really matter? by arjun · · Score: 1

    I've always wondered why, in today's world of gigabytes of memory in personal computers, why such a big deal is made about virtual memory..
    three basic reasons
    1. resource virtualization
    2. information isolation
    3. fault isolation

  28. Re:VM: Does it really matter? by Anonymous Coward · · Score: 0

    We have a 3 systems (unix/not linux) each with 16G of RAM, 6TB of fiber attached SAN storage. When doing a sql query, the rdbms likes to cache indicies in RAM (real or perceived). If the dba team does not have a handle on what data needs to be pinned in RAM and what not, then VM becomes very popular and important. Unless you have data that will never change and an audience that will never make any different choices, then this predicting is a crap-shoot at best.

    When to start chucking things off into paging space ? What stuff to chuck first ? File accesses ? Computational memory ? Processes ? Which processes go first ? Who's processes are more important ? Can the VM system be tuned without a reboot ? I have really been keeping my eye on these discussions for these reasons.

  29. Re:Pretty good English for a foreigner by iggymanz · · Score: 2

    ..but is it English speaking by choice? Check this out.

  30. Re:what's Gnu/FreeBSD ? by Anonymous Coward · · Score: 0


    didn't you mean BSD/FreeBSD ?

  31. Re:Is late documentation a problem with Open Sourc by Anonymous Coward · · Score: 0

    V2_OS has had plenty of documentation for a long time, but no one to code it. It might be an exception among the Open Source projects, though.

  32. Re:Pretty good English for a foreigner by lukew · · Score: 1, Offtopic

    Where the heck is .ie? He has surprisingly good English for someone who wasn't born in the US.

    Here you go fellows, an excellent example of the [hopefully minority] view that America is the center of the universe.

  33. Re:VM: Does it really matter? by Anonymous Coward · · Score: 0

    Virtual Machine --- Not Virtual Memory.

  34. Re:VM: Does it really matter? by uid8472 · · Score: 1

    VM does NOT mean just paging/swapping. that is a small part of it. mac os and windows users constantly misuse the term.

    In all fairness, if under the old MacOS you booted with "virtual memory" off then virtual memory (in the correct sense) really was disabled.

  35. Re:VM: Does it really matter? by tstoneman · · Score: 2, Informative

    If you are asking this question, it really means you have not worked on systems that do not have virtual memory, like DOS.

    Virtual Memory means virtualizing a process's access to memory such that the memory addresses it accesses isn't necessarily the same physical memory address. It allows for complete memory separation between processes, which offers a vast amount of protection so that processes can't crash other processes.

    The only alternative to Virtual Memory is to use actual physical memory addresses. This was exactly the case with DOS. In DOS you would have to manage memory and make sure your process (it's single-threaded... remember?!?) didn't overwrite memory that it didn't own. Everything in memory could be accessed from every single process.

    With virtual memory, two processes could access memory location 0x1000000 and it would map to different physical memory addresses. In fact, the OS will give each process a full memory address range that is completely usuable by the process without any fear of stepping over someone else's memory. This effectively separates processes and prevents a hell of a lot of corruption because of memory bugs in programs.

    This really simplifies programming because whenever we get a memory address from malloc, we don't have to do anything about making sure we don't step over someone else's memory information.

    What you are in fact talking about is paging, the act of having a backing store of information on hard drive. This allows you to keep track of more memory than you physically have, by swapping memory back and forth from physical memory to a hard drive. Any unused memory would be paged to disk.

    Yes, today you can have gigabytes of memory, and when running Mozilla, boy you really don't need virtual memory.

    But what about 10 years from now, when Microsoft Office takes up 50 GB of hard drive space, and 16 GB of memory?

    Virtual memory is a way of abstracting memory use on a per process basis, and it's inclusion in most modern operating systems is one of the reasons why things are relatively stable these days, compared to the old DOS days, anyway.

  36. Different VM by be-fan · · Score: 4, Interesting

    I did a lot of research on VMs about a year ago, and while the different VM systems out there really are interesting they're all kind of limited by the whole "page-level protection" model we've stuck with since the '70s. There are a lot of clever algorithms in the FreeBSD and 2.4 VMs to deal with high load and paging, but the core VM stuff (mapping memory, protection, sharing etc) is largely the same, and is limited by the MMU capabilities of current CPUs. All CPUs basically seperate processes into unique memory contexts, and map (using some sort of page table or reverse page table) physical memory pages of fixed size into the contexts. This solution is sub-optimal for object oriented systems with large scale sharing (the page tables get unweildy, and certain secondary structures, like reverse maps*, grow out of control). What I'd like to see is something new, just for the hell of it being new.

    Here's what I'm thinking. Ditch pages and memory contexts entirely. Instead, divy up a 64-bit virtual address space among individual processes, say 48 bits apiece. If a process wants to access memory outside it's 48-bits, it would need to access it through special pointers (which, thanks to a tagged-RAM architecture) could written to by the OS (allowing the OS to define its own protection and sharing mechanisms). Does anybody know of any existing systems that work even vaguely like this? Or of a different MMU architecture at all? I was hoping that AMD might at least include software TLB management, because there is some nifty stuff you can do with that, but it looks like Hammer will use the same VM mechanism that came out with the 386!

    --
    A deep unwavering belief is a sure sign you're missing something...
    1. Re:Different VM by Wesley+Felter · · Score: 2

      Here's what I'm thinking. Ditch pages and memory contexts entirely. Instead, divy up a 64-bit virtual address space among individual processes, say 48 bits apiece. If a process wants to access memory outside it's 48-bits, it would need to access it through special pointers (which, thanks to a tagged-RAM architecture) could written to by the OS (allowing the OS to define its own protection and sharing mechanisms).

      Look, it's AS/400!

      Doing something "just for the hell of it being new" doesn't seem too popular among CPU manufacturers who are trying to actually make money.

    2. Re:Different VM by be-fan · · Score: 2

      Um, it's called 'innovation.' I mean, we could have stuck with segmentation forever (or better yet, overlays!) but aren't you glad we didn't?

      PS> Hm. I knew AS/400 had a tagged RAM architecture, but I was under the impression that it was for marking crypto data. Could a scheme like the above be implemented on AS/400, and if so, where can I find some reference info about the MMU?

      --
      A deep unwavering belief is a sure sign you're missing something...
    3. Re:Different VM by Wesley+Felter · · Score: 2

      Doing something different just to be different is not innovation. Doing something different because it's better is innovation. In this case, I don't think there's much benefit to single-address-space systems.

      I knew AS/400 had a tagged RAM architecture, but I was under the impression that it was for marking crypto data. Could a scheme like the above be implemented on AS/400, and if so, where can I find some reference info about the MMU?

      My impression is that OS/400 has a single address space and a single level store. The hardware is mostly undocumented AFAIK.

    4. Re:Different VM by catbutt · · Score: 1
      Doing something different just to be different is not innovation. Doing something different because it's better is innovation.


      Maybe, but in my opinion people who just like to try different stuff (because it might be better, or because it interests them to try it, or just because...) tend to be the types who eventually come up with something better. While people who don't want to try anything because they worry they might be "reinventing the wheel" or that "maybe it won't be better" or whatever....probably are unlikely to ever create something useful.

      Usually the best ideas are not arrived at directly, but after exploring many different ideas along the way. Most of those ideas are probably not better than what they replace, but they might be a stepping stone to a really great idea.

      (sorry to go further off topic, but I just hate to see creativity squelched :) )
    5. Re:Different VM by be-fan · · Score: 1

      Actually, single address space operating systems with fine-grained (object-level) protection mechanisms suit object-oriented languages perfectly. I looked up some information, and OS/400 apparently is such a system (and is object-based to boot). The whole programming paradigm of having persistant objects that can be shared and protected naturally (without introducing something like files) simplifies the programming model *greatly*.

      --
      A deep unwavering belief is a sure sign you're missing something...
  37. Re:Pretty good English for a foreigner by Anonymous Coward · · Score: 0

    English good as am beatnick useless Americaner slave in corner. Feednick good laces and shoe, da?

  38. Re:VM: Does it really matter? by Pseudonym · · Score: 5, Informative
    I've always wondered why, in today's world of gigabytes of memory in personal computers, why such a big deal is made about virtual memory.

    Because it's important. :-)

    Some people think that virtual memory means paging to secondary memory, such as disk. It doesn't. Virtual memory systems can support this, but many OSes (e.g. QNX) support virtual memory with no disk paging. (OK, QNX does support disk paging, but only as an afterthought, so that QNX can be self-compiled. GCC takes a lot of memory.)

    Virtual memory provides a virtual address space for each process. The benefits include:

    • The system can avoid memory fragmentation because contiguous virtual pages do not need to be contiguous in physical memory.
    • Processes don't step on each others' address spaces.
    • Separate instances of the same program can share program text space and shared libraries.
    • Different processes can use the same addresses for different purposes.
    • Processes can share memory between each other in a controlled manner (for example, one process may have read-only access to some region, or memory can be shared between a parent and child process via a read-only or copy-on-write mechanism).
    • Processes on different physical machines can share memory. (This can be done using the standard Unix interface. No extra kernel support is required.)
    • Processes can view files as memory (e.g. using mmap), reading in only as much of the file as is necessary.
    • Processes can allocate large regions of memory and only have the amount of those regions which they actually use backed up by physical memory (whether that is primary or secondary memory).
    • The OS has a lot more freedom in how to lay out a process in physical memory to optimise cache usage.

    This is just off the top of my head.

    --
    sub f{($f)=@_;print"$f(q{$f});";}f(q{sub f{($f)=@_;print"$f(q{$f});";}f});
  39. Re:VM: Does it really matter? by nathanh · · Score: 2
    I've always wondered why, in today's world of gigabytes of memory in personal computers, why such a big deal is made about virtual memory.

    Because VM is not just used for swap. It's also used for memory protection, buffer caching, shared libraries, demand paging, and dozens of other performance improvements.

  40. copulating modules by ghassanm · · Score: 2, Funny

    2.1.2 Browsing the Code

    Generally, related operations are clustered together in the same file and there is not much copulation between modules.

    I hope the discovery channel will do a special on copulating modules some day. I've seen module coupling but no copulation yet.

    1. Re:copulating modules by Mel · · Score: 2, Funny

      > I hope the discovery channel will do a special on copulating
      > modules some day. I've seen module coupling but no copulation yet

      Haven't you ever heard of sexy code? Is is meant to be taken literally, isn't it?

      (typo corrected in the work in progress version)

  41. Re:Is late documentation a problem with Open Sourc by inode_buddha · · Score: 2

    I truly think this should be modded as "Insightful", it's quite true in my experience.
    The cure that I use for this is to read the code comments and subscribe to the mailing lists.

    --
    C|N>K
  42. So any clues why VM goes zombie? by Jayfar · · Score: 3, Interesting

    5 root 9 0 0 0 0 Z 0.0 0.0 0:18 kswapd

    My kswapd went zombie after ~18days uptime; it had gone zombie after about 70 days prior to that reboot. I've found references to this occurance ...going back at least through 2.2 kernels and it still happens with 2.4.18 (not the latest, but no real reason for me to build a new kernel unless it solves this). I've never read a thread on this that arrives at a solution or even a description of what is going on. I first encountered it when I went from Mandrake 8 (2.4.8 kernel) to 8.1 (2.4.18).

    Fortunately, I have plenty ram and this doesn't seem to otherwise affect anything.

    1. Re:So any clues why VM goes zombie? by tconnors · · Score: 1

      My kswapd went zombie after ~18days uptime; it had gone zombie after about 70 days prior to that reboot.

      I got the feeling this was always from bad RAM/motherboard etc.

  43. Re:VM: Does it really matter? by Anonymous Coward · · Score: 0

    While you, like us, have a Gbyte or more on your desktop system, most of the servers I've seen have an order of magnitude less. The vast majority of our 40+ servers have 128 Mbytes or less. That's because the desktops are the cheap junk we replace every couple of years. This round, we're using Dells. On our servers, we buy quality so we can't afford to upgrade nearly as often. $3,500 IBM rack-mount servers + $500 redundant power supplies + $1,500 RAID controllers + $800 or much more worth of SCSI hard drivers + ECC RAM, etc, is expensive. Over half of our servers that run Linux have an uptime of over one year. It would be very costly to replace them with something else that reliable. So, like most places, our servers end-up on the trailing edge of technology. Also, the cost of replacing servers is very expensive. The downtime and the pain of moving software is costly. We swap-out desktops in about half a day, but it took us three months to trackdown all of the little problems when we upgraded our PostgreSQL server. Small differences are painful when you have about 400 different programs that access data on that one server. So while on your desktop you only have to worry about a few applications, when we upgrade one of our servers, many more applications and many more people (including customers directly, if it's a web server!) are affected. That's why most servers have less RAM than you think they would. You just can't afford upgrade them!

    This is why Linux is so nice. It makes good use of the hardware so, in the case of our Sun servers, it has given us three years (so far) more use out of the same hardware. And, no we didn't have 1.5 Gbytes of RAM on those machines when we bought them five years ago.

  44. Re:VM: Does it really matter? by sjames · · Score: 2, Informative

    Another benefit is sparse allocation. When memory is allocated, it's all mapped to a zero filled page and marked copy on write. That allows programs to allocate massive arrays that are expected to be primarily filled with zeros without actually requiring that much memory. When a value is writen, a new page is allocated and mapped into the virtual space.

    If desired, a file can be mmaped into the array instead to provide a simple persistant storage.

    IIRC, suspend to disk is (or at least can be) also accomplished through VM.

    The VM also allows an x86 system to have more than 4GB of ram. No single process can have more than a total of 4G (counting the kernel space) but using PAE, more than a total of 4GB can be allocated to multiple processes through the VM.

    Although Linux doesn't do this, some OSes completely replace the filesystem with a VM. All files and directories are objects and are demand paged.

  45. Best course feedback I ever heard by smittyoneeach · · Score: 1

    "Instructors used too many TLAs without explaining what they mean".
    TLA=Three Letter Acronym, BTW

    --
    Get thee glass eyes, and, like a scurvy politician, seem to see things thou dost not.--King Lear
  46. Re:VM: Does it really matter? by Pseudonym · · Score: 2, Insightful
    If desired, a file can be mmaped into the array instead to provide a simple persistant storage.

    Mmap is way, way more useful than that.

    The key benefit of mmap is that on decent OSes (i.e. not NT/2000/XP), you get effectively get the buffer cache for the file mapped into your address space. Without mmap, you'd effectively have two copies of the file go through memory: one in the buffer cache and one in the application.

    This is not so serious if your file is a sequentially-read stream, because the application won't hold much at a time. However, if it's a randomly accessed file with a large working set (e.g. in a database server), the win is huge.

    --
    sub f{($f)=@_;print"$f(q{$f});";}f(q{sub f{($f)=@_;print"$f(q{$f});";}f});
  47. Re:VM: Does it really matter? by dotgain · · Score: 1

    Another cool thing 'bout virtual mem, is you no longer have a prob with memory fragmentation, or on old MacOS's "Largest available block"

  48. Re:VM: Does it really matter? by sailesh · · Score: 1
    There is a lovely paper by Andrew Appel and Kai Li which explains a lot of cute uses of VM.

    Abstract:

    Memory Management Units (MMUs) are traditionally used by operating systems to implement disk-paged virtual memory. Some operating systems allow user programs to specify the protection level (inaccessible, readonly, read-write) of pages, and allow user programs to handle protection violations, but these mechanisms are not always robust, efficient, or well-matched to the needs of applications. We survey several user-level algorithms that make use of page-protection techniques, and analyze their common characteristics, in an attempt to answer the question, "What virtual-memory primitives should the operating system provide to user processes, and how well do today's operating systems provide them?"

  49. Windows CE 2 by NigelJohnstone · · Score: 1

    Similar, but 4Gb space split into 32Mb chunks. The running process's 32Mb chunk is swapped into the bottom page, so every process thinks it runs in 0-32mb address space.

  50. Yes you did a mistake ;-) by Anonymous Coward · · Score: 0

    Java is a compiled language that produce bytecode for the Java platform and have nothing to do with perl, python or TCL ... !!!

    The bytecode is translated using a VM into native assembler either at the fly (the interpreter) or asynchronously with a JIT engine.

    There come a common mistake : "Java is interpreted" ! That's the reason 1/3 of the people still thing Java and JavaScript are the same thing.

    The Linux kernel do have a module dedicated to Java bytecode translation, but it is dog slow compare to the modern JIT engines (Hotspot with exact GC for instance).

    When people talk about Java they generally talk about the platform and not the language. Just because once compiled, the language do not care any mode (please also note, that you can build bytecode classes using alternative languages such as eiffel, basic for instance using appropriates compiler http://grunge.cs.tu-berlin.de/~tolk/vmlanguages.ht ml ).

  51. Re:VM: Does it really matter? by p3d0 · · Score: 1

    VM is not just for paging, which is what happens when you run out of RAM. It's a level of indirection between a memory address and a RAM chip. That level of indirection allows you to do all kinds of cool things.

    --
    Patrick Doyle
    I mod down every jackass who puts his moderation policy in his sig. Oh, wait a sec....
  52. Re:VM: Does it really matter? by spoon42 · · Score: 1

    This is all very true, and as Figure 1 clearly shows, the ISA memory hole still causes problems:

    *_g_o_a_t_s_e_x_*_g_o_a_t_s_e_x_*_g_o_a_t_s_e_x_*_
    g_______________________________________________g_ _
    o_/_____\_____________\____________/____\_______o_ _
    a|_______|_____________\__________|______|______a_ _
    t|_______`._____________|_________|_______:_____t_ _
    s`________|_____________|________\|_______|_____s_ _
    e_\_______|_/_______/__\\\___--___\\_______:____e_ _
    x__\______\/____--~~__________~--__|_\_____|____x_ _
    *___\______\_-~____________________~-_\____|____*_ _
    g____\______\_________.--------.______\|___|____g_ _
    o______\_____\______//_________(_(__>__\___|____o_ _
    a_______\___.__C____)_________(_(____>__|__/____a_ _
    t_______/\_|___C_____)/______\_(_____>__|_/_____t_ _
    s______/_/\|___C_____)___ISA_|__(___>___/__\____s_ _
    e_____|___(____C_____)\______/__//__/_/_____\___e_ _
    x_____|____\__|_____\\_________//_(__/_______|__x_ _
    *____|_\____\____)___`----___--'_____________|__*_ _
    g____|__\______________\_______/____________/_|_g_ _
    o___|______________/____|_____|__\____________|_o_ _
    a___|_____________|____/_______\__\___________|_a_ _
    t___|__________/_/____|_________|__\___________|t_ _
    s___|_________/_/______\__/\___/____|__________|s_ _
    e__|_________/_/________|____|_______|_________|e_ _
    x__|__________|_________|____|_______|_________|x_ _
    FIGURE 1

    Why this has never been fixed is a mystery to me.


    GSA PUBLIC KEY
    asl;dfhlaskhdflkhasdflkasdhkflhasldfhjaskhdkf jhasdfkhahsdfy90y08fy08aw3408g0nb0h0bh00bhf00f0abh 0a0hfh0gh0aifhsdhghkasdkkchxnvnlasdf0g8y00a8b908fd halnblldvbnxcv8b08df-0b8--a8dfganslenal34n,mfnbmnx lkbn0adfbn8a8m-d-bh-a-npsnk33nkknlaw34nlkkljlkajkb j-9cv-b90a9-c-9adf-9b9a-9-9n3nkaklnFUCK TACO FUCK TACO FUCK TACO al;skdjflkasjd0gyb08cbxh08bhlxkbhlk3halasjdhfjlhas ;dfh;jlasdfjhasgjhgjhgjhogjhjhkhjjhbnhkhjhnfjjhfjk jknjfkjgnfjkljnh jfjhfhjfjfhghjlkjjhjlasdhf;lllllhhhahhhfalsa230pa3 h8-hwpvjdfnadfbh08xc0yb80xaho;bj4fn aj3bjabj;b0xc8[0x8fbyaiobf3jlwbjlwsbjlfasdflab79xp c7aguj;xkcbvs0x8y0kjaasbkdbnkjbaxxvna0r-0s-0d--f TENTACLE RAPE jfda8sd0f0a8sdhf-8as-dg0a=[nxlkncvl;na08rgvh-a9psd g ahsld0aac-x9bxcbn CLAIM FAILED lkjasfdnogvobxcaaax-98xccccccanpigasasdgn;occ8-ax9 jdsagne4o3n------9napisdamdn;jlansld DEFAULT CATCH asl;idkn0s08a008sd8f0cchcxchhd0as0d-fa-j3nnl3knlax 0c-8jsalejnfas800dg-0afg-a8u93ha;kjhq;j123e;qmnseu dfh0agg089asd-9c-x xcuihlkhslbhmhblkhcx08c-0x-080bh3km,

    --
    --- this comment is presented in WIDE SCREEN STEREO!!!