Hardware Virtualization Slower Than Software?

Sponsored by VMWare.. what do you expect? by thegrassyknowl · 2006-08-12 20:21 · Score: 5, Insightful

See title... VMWare make software virtualisation products. Of course they're going to try and find that software methods are better.

--
I drink to make other people interesting!

Re:Sponsored by VMWare.. what do you expect? by cp.tar · 2006-08-12 20:25 · Score: 3, Insightful

Even so, they may be at least partially right.

Besides, if a hybrid approach is necessary, VMWare will need to adjust as well. Or am I missing something?

--
Ignore this signature. By order.
Re:Sponsored by VMWare.. what do you expect? by zerogeewhiz · 2006-08-12 20:26 · Score: 3, Insightful

Haven't read it, but I wonder if they were using VT/Pacifica chipsets or no...

It's like Apple's claim that their Intel jobbies are 5x faster - a bit silly and very, very specific...

And yes, VMWare are hardly likely to mention that Xen-style virtualisation is going to be better now, are they?
Re:Sponsored by VMWare.. what do you expect? by mnmn · 2006-08-12 20:48 · Score: 4, Informative

If you search back on Vmware vs Xensource, you'll see Vmware is doing everything to discredit Xen and hardware hypervisors. Instead of saying 'it doesnt work' its more effective to say it works, we have it too, it fails on its own so it needs our software too. From everything I've read about hypervisors including the Power CPU hypervisors from IBM (which have been functional for years) and the original Cambridge paper that created Xen, Hypervisors really outperform software solutions. You do need a software mini-OS as the root on top of which you'd install the OSes which is better than using Windows as the root OS.

But Vmware's agitation is understandable. They're about to lose it all to an open source project. Where have I seen this before?

--
"Give orange me give eat orange me eat orange give me eat orange give me you." -Nim Chimpsky
Re:Sponsored by VMWare.. what do you expect? by XMLsucks · 2006-08-12 20:53 · Score: 5, Insightful

VMware sells both hardware-accelerated and software virtualization products. They implemented full support for VT (how else would they benchmark it? Plus they were the first to support VT). If you run VMware on 64-bit Windows, then you use VMware's VT product. But because VMware's original software method is faster than the VT method on 32-bit, they continue to use the software approach.

VMware's paper is a typical research paper, published at a peer-reviewed conference. This means that they have used the scientific method. The chances are 99.9999% that you will easily reproduce their results, even if changing the benchmarks.

I, on the other hand, am smart enough to see that they are stating the obvious. If you read the Intel VT spec, you'll see that Intel does nothing for page table virtualization, nor anything for device virtualization. Both are extremely expensive, and besides sti/cli, are the prime candidates for hardware assists. Intel will likely solve this performance issue in future revs, but right now, VT isn't fast enough.

Hmmm, virtualisation? Do you happen to work on Xen?
Re:Sponsored by VMWare.. what do you expect? by Anonymous Coward · 2006-08-12 21:02 · Score: 5, Informative
See title... VMWare make software virtualisation products. Of course they're going to try and find that software methods are better.

Disclaimer: I work for VMware.
1. VMware already supports VT, but it's not enabled by default because for normal workloads it's slower. If VT really were faster, do you really think we'd be choosing to use a slower approach and making customers unhappy?
2. Even Intel admits the first generation of VT hardware wasn't so great and now claims that they were aiming for correctness instead of performance:
  - http://x86vmm.blogspot.com/2006/04/intel-quietly-b acking-away-from-vt.html (from Keith's weblog)
  - http://news.com.com/Intel+feeds+virtualizations+ne ed+for+speed/2100-1006_3-6048217.html
Re:Sponsored by VMWare.. what do you expect? by arivanov · 2006-08-12 21:09 · Score: 4, Informative

While they offer software virtualisation products, they are also interested in these products having hardware assistance. The AMD and Intel specs were designed with input from them (amidst other vendors).

As far as the results there is nothing surprising here. This has happened before. Fault driven emulation of 80287 was nearly 50%+ slower than compiled in emulation. There were quite a few other examples x86 which all revolve around the fact that the x86 fault handling in protected mode is hideously slow. Last time I have had a look at it in asm was in the 386 days and the numbers were in the 300 clock cycle range for most faults (assuming no wait on memory accesses). While 486 and Pentium improved the things a bit in a few places, the overall order remains the same (or even worse, due to memory waits). Anything that relies on faults in x86 is bound to be hideously slow.

Not that this matters, as none of the VM technologies is particularly caring about resources. They are deployed because there is an excess resource in the first place.

--
Baker's Law: Misery no longer loves company. Nowadays it insists on it
http://www.sigsegv.cx/
Re:Sponsored by VMWare.. what do you expect? by julesh · 2006-08-12 22:46 · Score: 4, Informative

From everything I've read about hypervisors including the Power CPU hypervisors from IBM (which have been functional for years) and the original Cambridge paper that created Xen, Hypervisors really outperform software solutions.

Note that Xen's original hypervisor implementation *is* a software solution -- it relies on rewriting the guest operating system kernel so that the kind of hardware traps that VMware are talking about here are unnecessary. Note that it worked flawlessly before the virtualisation technology (eg. Intel VT) that VMware is testing was avialable.
Re:Sponsored by VMWare.. what do you expect? by XMLsucks · 2006-08-12 23:01 · Score: 2, Interesting

Where have you seen VMware discrediting XenSource? I haven't seen that. Can you back this up with some links? Searching for "VMware vs Xensource" was fruitless for me. And searching for "VMware discredits XenSource" was also fruitless.

But Vmware's agitation is understandable. They're about to lose it all to an open source project. Where have I seen this before?
I'll let you in on a secret: if you consider all costs, and return on investment, using VMware is a competitive advantage over using Xen. But I don't care whether you believe me, because if you don't, you'll be at a competitive disadvantage, which is to my benefit.
Re:Sponsored by VMWare.. what do you expect? by andreyw · 2006-08-12 23:44 · Score: 2, Insightful

If VMWare's solution still needs a host OS (I remember them using stripped-down Linux for their server offering), then no... they might use use a subset of VT, but its not a true hypervisor.

And by the way... yes... device virtualization is still not there, but your page tables claim is bullshit. If you read the VT (and the SVM) docs, you would realize that you can implement shadow page tables RIGHT NOW. The hardware assists are there.

Hybrid? Good + Bad = Better? by MrFlannel · 2006-08-12 20:31 · Score: 2, Insightful

Software-assisted virtualization: 393 seconds. Hardware-assisted virtualization: 484 seconds. Ouch. It sounds to me like a hybrid approach may be the best answer to the virtualization problem.

So, um, a hybrid approach is better because it will take 439* seconds? Why?

* - I imagine in real life it's not a 1:1 ratio, but for the sake of argument, work with me.

--
Clones are people two.

The correct conclusion is more limited by njdj · 2006-08-12 20:37 · Score: 5, Insightful

The correct conclusion is not that virtualization is better done entirely in software, but that current hardware assists to virtualization are badly designed. As the complete article points out, the hardware features need to be designed to support the software - not in isolation.

It reminds me of an influential paper in the RISC/CISC debate, about 20 years ago. Somebody wrote a C compiler for the VAX that output only a RISC-like subset of the VAX instruction set. The generated code ran faster than the output of the standard VAX compiler, which used the whole (CISC) VAX instruction set. The naive conclusion was that complex instructions are useless. The correct conclusion was that the original VAX compiler was a pile of manure.

The similarity of the two situations is that it's a mistake to draw a general conclusion about the relative merits of two technologies, based on just one example of each. You have to consider the quality of the implementations - how the technology has been used.

Re:The correct conclusion is more limited by TheRaven64 · 2006-08-12 23:09 · Score: 4, Interesting

The easiest architecture to virtualise is the Alpha. It had a single privileged instruction, and all that did was shift to a higher privilege mode (which had a few shadow registers available) and then jump to an address in firmware. The firmware could be replaced by using one of these calls. If you wanted to virtualise it then you could do so trivially be replacing the firmware with something that would check or permute the arguments and then vector off into the original firmware.
It also had a few other advantages. Since you were adding virtual instructions, they all completed atomically (you can't pre-empt a process in the middle of an instruction). This meant you could put things like thread locking instructions in the PALCode and not require any intervention from the OS to run them. The VMS PALCode, for example, had a series of instructions for appending numbers to queues. These could be used to implement very fast message passing between threads (process some data, store it somewhere, then atomically write the address to the end of a queue) with no need to perform a system call (which meant no saving and loading of the CPU state, just jumping cheaply into a mode that could access a few more registers).

--
I am TheRaven on Soylent News
Re:The correct conclusion is more limited by lukas84 · 2006-08-12 23:15 · Score: 2, Funny

The IBM iSeries (identical to the pSeries hardware) also have a hardware HyperVisor.

Their entry models (10k US$) are slow as shit though. Can't say anything about the more expensive machine, but anything that requires around 12 hours to upgrade it's operating system can't be trusted.
Re:The correct conclusion is more limited by renoX · 2006-08-12 23:44 · Score: 2, Interesting

>The naive conclusion was that complex instructions are useless. The correct conclusion was that the original VAX compiler was a pile of manure.

Note that the 'naive conclusion' and the 'correct conclusion' are not contradictory: I remember an article recently where it was shown that the Alpha had three times the power of a correspondig VAX, which made nicely the point that CISC is shit.

Now as Intel has shown, given enough efforts and money even x86 the poorest CISC ISA ever (VAX ISA was much nicer than x86 ISA: more registers, orthogonal design) can be competitive and sofware compatibility makes the rest..
Re:The correct conclusion is more limited by Hal_Porter · 2006-08-13 03:16 · Score: 2, Insightful

> I'm curious why making a VAX fast is such a problem?

I read that the calling convention specified a general call instruction which was architected to do a lot of stuff - build a stack frame, push registers and so on, so even an efficient implementation will be slow. Much of the time, you could get away with something much simpler.

>I would have thought the 16(if memory serves) orthogonal registers would have made a nice
>target for compilers, contrary to the ridiculous number of (non-orthogonal) registers on x86..

x86 is orthogonal in protected mode, and register renaming helps with the low number of architectural registers. And if you're doing something intensive, you have SSE registers to use too. And x86-64 has more architectural registers anyway. So most of the architectural problems with the 8086 have been solved or mitigated.

I guess if the VAX had been as popular, something similar would have happened of course.

--
echo -e 'global _start\n _start:\n mov eax, 2\n int 80h\n jmp _start' > a.asm; nasm a.asm -f elf; ld a.o -o a;

Re:Hybrid? Good + Bad = Better? by cp.tar · 2006-08-12 20:43 · Score: 2, Insightful

I suppose there are certain things hardware virtualisation does better.

The trick is, I'd guess, to find out which works better in which circumstances.

You see that people suspect this white paper because of its origin; they are right in doing so at least because only one type of test has been performed; surely not all computing tasks perform the same way as a kernel compile.
This suggests that VMWare have found the example which supports their claims the best; the question is, of course, whether this is the only such example.

So if we suppose that there are certain types of problems where hardware virtualisation outperforms software virtualisation, hybrid solutions seem to be the right way to go.

P.S. I don't really know what I'm talking about...

--
Ignore this signature. By order.

hardware v/s software by toolz · 2006-08-12 20:44 · Score: 2, Insightful

When are people going to figure out that "hardware solutions" are really software running on hardware, just like any other solution?

Sure, the instructions may be hardcoded, coming out of ROM, or whatever, but in the end its instrructions that tell the hardware what to do. And those instructions are called "software", no matter how the vendor tries to spin it. And if the solutions performs badly, it is because the software is designed badly. Period.

--
You aren't remembered for doing what is expected of you

Re:Bias? by RegularFry · 2006-08-12 20:56 · Score: 4, Insightful

Insisting on third-party verification of results is hardly damning either of them... It's just scientific. You (and everyone else) are absolutely right to be sceptical, and not just because VMware have a vested interest in this case. They might just be wrong. Or not.

--
Reality is the ultimate Rorschach.

wrong by m874t232 · 2006-08-12 21:04 · Score: 3, Insightful

Hardware virtualization may be slower right now, but both the hardware and the software supporting it are new. Give it a few iterations and it will be equal to software virtualization.

It may or may not be faster eventually, but that doesn't matter. What matters is that small changes in the hardware make it possible to stop having to depend on costly, proprietary, and complex software--like that sold by VMware.

Re:This is why hypervisors rule by rwhiffen · 2006-08-12 21:25 · Score: 2, Insightful

I don't see how that tracks. How is the %2 impact going to save me a bundle? Moving to linux suposedly will save me money if I virtualize or not, don't see how it being virtualization friendly improves things. Are you saying I'll spend less in hardware by switching to linux? Migrating to linux isn't free (man-hours wise), so the hardware savings better be pretty damn substantial to offset it.

I should be sleeping.

Rich

Use Paravirtualization by graf0z · 2006-08-12 21:28 · Score: 3, Insightful

Paravirtualization (running hypervisor-aware guest kernels, eg patched linux on xen) is faster than both, binary translation and "full" virtualization. And you don't need CPUs with VT extension.

g

Re: Use Paravirtualization by interiot · 2006-08-12 23:03 · Score: 2, Insightful

It just seems like many people who try to move away from Windows seem to want to at least have the option to use Windows once in a while.... The Mac-moving-to-Intel thing was met with a lot of excitement because of this, a lot of linux people seem to say this, and it seems like in a lot of companies employees must be productive with specific document formats. Certainly Windows isn't the only point of virtualization, but it seems like it's a really big one, especially for desktop users.

Not just the CPU by kripkenstein · 2006-08-12 21:55 · Score: 4, Interesting

What matters is that small changes in the hardware make it possible to stop having to depend on costly, proprietary, and complex software--like that sold by VMware.

I am 100% in favor of cheap and open solutions. But I don't agree that this will soon be the case for virtualization. VMWare and the few other major vendors do a lot more than software virtualization of a CPU (which is all TFA was talking about). To have a complete virtualization solution, you need to also virtualize the rest of the hardware: storage, graphics, input/output, etc. In particular graphics is a serious issue (attaining hardware acceleration in a virtual environment safely), which from last I heard VMWare were working hard on.

Furthermore, Virtualization complements well with software that can migrate VMs (based on load or failure), and so forth. So, even if hardware CPU virtualization is to be desired - I agree with you on that - that won't suddenly make virtualization as a whole a simple task.

No not really by Sycraft-fu · 2006-08-12 22:11 · Score: 2, Informative

In the end, the software instructions are actually executed on hardware, and that hardware imposes limits on what they do. In the case of virtualization the problem comes with privlidge levels. Intel processors see 4 levels of privlidge called Ring 0-3, of which two are used by nearly all OSes, 0 and 3. The kernel and associated code runs in Ring 0, everything else in Ring 3. Now the effect of what ring you are in controls what instructions the processor will allow you to execute, and what memory you can access. So if software in Ring 3 tries to execute a certian instruction, the processor will just not do it, it'll generate a fault.

Virtulization software has to deal with this, when the computer it's virtualizing wants to execute such an instruction, it can't just hand it off to the processor, it has to deal with it itself, it has to translate it to instrucitons that can be executed and virtualize what happens, hence the name vitrualization.

The idea with hardware support like VT is that the processor itself will take a more active hand. Virtual machines will actually be able to execute Ring 0 instructions on the processor, because they won't really be running in the main Ring 0, it'll create a seperate isolated privlidge space for it.

A more simple analogy would be to think of basic math. Suppose you want to multiple two numbers and now suppose again that you have a processor that only has an add instruction. Well, you'd have to do the multiplication in software, as in you'd have to do an add loop. Now suppose that a new version of that processor adds a multiplication instruction, that actually commands a multiplication unit. Now you are doing it in hardware. It is not only less code, but faster because there's a dedicated unit for it.

It's not like companies just whack instruction on their CPUs for the fun of it, they command different parts of the hardware to do different things. SSE, 3DNow, etc don't just have the processor run little add or multiply loops, they actually kick on seperate sections of hardware, designed for SIMD. Hence why they get the results they do.

Re:No not really by Sycraft-fu · 2006-08-13 01:01 · Score: 3, Interesting

I haven't read the results, and I doubt I have the technical knowledge to properly analyze them properly. However if I were to guess as to why this might be the case I'd say it's because they didn't do it right. This is a new and fairly complex technology, I somehow doubt it's easy to get right on the first try.

I am not willing, based on a single datapoint, to make any conclusions. That's tanget to my point anyhow, my point was that doing something in hardware and software are quite different.
Re:No not really by chris_eineke · 2006-08-13 07:47 · Score: 2, Funny

However if I were to guess as to why this might be the case I'd say it's because they didn't do it right.
Holy crap, you just bloated "They're wrong." into 26 words. Do you work as a government advisor in your free time?

--
"All you have to do is be fragile and grateful. So stay the underdog." Chuck Palahniuk, Choke

Look to IBM by dpilot · 2006-08-12 22:40 · Score: 2, Informative

IBM has been shipping virtualization since before many of these newcomers were even born. What do you think the 'V' in MVS or VM stands for? I wonder how well IBM's expired patents compare to modern virtualization. Of course in this case it helps to own the hardware, instruction set, and operating system.

--
The living have better things to do than to continue hating the dead.

Re:Look to IBM by pe1chl · 2006-08-12 23:02 · Score: 3, Interesting

IBM's VM also started as a software product that had to cope with virtualisation problems in the hardware.
Just like what is happening now, they added specific support to the hardware to make VM perform better.
This all happened before the development of today's architectures, but in the early days of microcomputing, IBM had the position that Microsoft has today: they were the big company that had 90% of the market, and in the eyes of the newcomers all they did was by definition the wrong thing. So nobody would bother to look at 360 mainframes, VM and how it was done before designing their own processor.
(this would be similar to telling a Linux geek to look at how certain problems are solved in Windows... it is Windows, it is Microsoft, so it has to be the wrong solution)
Re:Look to IBM by TheRaven64 · 2006-08-12 23:31 · Score: 2, Interesting

IBM contribute to Xen. I was at a talk last year by one of the IBM Xen guys. He made the point that IBM has a real advantage in virtualisation because, when they get stuck, they can pop along the hall to the grey-bearded mainframe guys and say 'hey, you remember this problem you had twenty years ago? How did you solve it?'

--
I am TheRaven on Soylent News

Re:Hybrid? Good + Bad = Better? by julesh · 2006-08-12 22:58 · Score: 2, Insightful

Because if you actually RTFA it shows that the hardware virtualization is faster for some benchmarks (e.g. processing system calls) and slower for others (e.g. performing I/O requests or page-table modifications); if you combine the best features of each you should be able to get a virtual machine that is faster than both.

I smell a straw man... by itsdapead · 2006-08-13 00:01 · Score: 2, Interesting

The naive conclusion was that complex instructions are useless. The correct conclusion was that the original VAX compiler was a pile of manure.

Perhaps the intended conclusion was that it was feasible to write an efficient compiler using only a small, intelligently chosen with compiler optimization in mind, subset of the instruction set. Perhaps the fact that the original compiler was (as you assert) "a pile of manure" was not unconnected to the fact that it tried to achieve speed by exploiting the entire, eclectic, VAX instruction set (wonder how they worked the famous polynomial instruction in?) instead of sticking to a subset and applying generalised optimization techniques.

PS: If you think RISC lost the war, then remember that modern x86 processors consist of a RISC core with a translator stage to handle all those pesky, legacy CISC instructions.

--
In a survey of 100 programmers, 111111 thought that duck-typing was a good idea.

And then there's paravirtualization by Anonymous Coward · 2006-08-13 01:04 · Score: 2, Interesting

I don't doubt their numbers, they've been creating virtualized systems very effectively for years.
I think that any kind of "full virtualization" is going to be subject to these issues. If you want to see performance improvements then you should modify the guest os.

VMware's BT approach is very effective and their emulated hardware and bios are efficient, but that won't match the performance of a modified OS that KNOWS it's virtualized and cooperates with the hypervisor rather than getting 'faked out' by some emulation.

I think that's a little innacurate by Sycraft-fu · 2006-08-13 01:16 · Score: 3, Insightful

It's not that people don't look to old mainframe solutions for things, they do, it's that often what was feasable on those wasn't on normal hardware, until receantly. There was no reason for chip makers to waste silicon on virtualization hardware on desktops until fairly receantly, there just wasn't a big desktop virtualization market. Computers are finally powerful to the point that it's worth doing.

It's no supprise that large, extremely expensive computers get technology before home computers do. You give me $20 million to build something with, I can make it do a lot. You give me $2000, it's going to have to be scaled way back, even with economies of scale.

You see the same thing with 3D graphics. Most, perhaps even all, the features that come to 3D cards were done on high end visualizaiton systems first. It's not that the 3D companies didn't think of them, it's that they couldn't do it. The orignal Voodoo card wasn't amazing in that it did 3D, it was much more limited than other thigns on the market. It was amazing in that it did it at a price you could afford for a home system. 3dfx would have loved to have a hardware T&L engine, AA features, procedural textures, etc, there just wasn't the silicon budget for it. It's only with more developments that this kind of thing has become feasable.

So I really doubt Intel didn't do something like VT because they thought IBM was wrong on the 360, I think rather they didn't do it because it wasn't feasable or marketable on desktop chips.

This HAS happened before - with Stacker by tomhudson · 2006-08-13 01:57 · Score: 2, Informative

This won't be the first time software beats hardware.

The original Stacker product was a combination of a hardware card and software. Think of the hardware card as an accelerator for doing the comression/decompression.

The hardware was faster on the oldest machines, but on anything above a 286/12 (I had a 286/20 at the time), or almost any 386, it ran faster without the hardware card. And on every 486, the card was useless.

So, while you may want to "consider the source" of this news, this is only one factor to weigh. As time goes on, I'm sure we'll see more studies, benchmarks, etc.

Remember, there are 3 things that are inevitable in a programmers' life - death, taxes, and benchmarks.

Yes, AMD Pacifica seems to be far better by Morgaine · 2006-08-13 02:27 · Score: 3, Interesting

Is AMD's Pacifica virtualisation system any better?

Apparently, yes, and by a good margin.

There are several documents and articles out there which point out VT's problems and how Pacifica is quite dramatically better. Here's an excerpt from "AMD Pacifica turns the nested tables", part 3 of an informative series of articles:

The basic architecture of the K8 gives AMD more toys to play with, the memory controller and directly connected devices. AMD can virtualise both of these items directly while Intel has to do so indirectly if it can do so at all.

This should allow an otherwise identical VMM to do more things in hardware and have lower overhead than VT. AMD appears to have used the added capability wisely, giving them a faster and as far as memory goes, more secure virtualisation platform."

So, it looks like AMD are ahead on hardware virtualization at the moment.

If I read it correctly, this is because Intel's VT actually requires a lot of software intervention, so it's not actually a very strong hardware solution at all.

--
"The question of whether machines can think is no more interesting than [] whether submarines can swim" - Dijkstra

Parallels on Mac OS? by akac · 2006-08-13 02:28 · Score: 2, Interesting

Well OK. But it could also mean that VMWare doesn't know yet how to properly create a hardware virtualized vm.

Parallels on OS X switches between software and hardware virtualization and using hardware virtualization its about 97% the speed all around of native hardware (consider that virtualization on current Yonah CPUs is equal to one core only). Software virt on Parallels is much slower - on par with running Windows Virtual PC on the same box using Windows XP (not Mac Virtual PC).

I designed h/w virtualization by Anonymous Coward · 2006-08-13 04:55 · Score: 3, Interesting

I designed one of the x86 h/w virtualization offerings. It's obvious that outside of device emulation, the biggest overhead of virtualization is the s/w emulation of what amounts to two levels of address translation (especially hairy in multiprocesor systems due to the brain-dead x86 page table semantics that do not require explicit invalidation). So clearly you want nested-paging support in h/w. However, that support is a little more complex than a few microcode changes to trap selected privileged instructions --- and due to schedule pressures, it didn't make it into the current release. Once that's in, expect h/w virtualization to speed up significantly.

Note that this doesn't make all the other stuff in VT/SVM useless; there are lots of places on the x86 where pure s/w virtualization has to go to great lengths of complexity just to get things correct. As a simple example: there's no way on "old" x86 h/w to save & restore segment descriptors (which you need to do on world switch) --- all you get is the selector, and if the guest O/S has overwritten the in-memory copy, you're out of luck. "Fixable" in s/w (obviously; VMWare does it), but just plain grody. So a major advantage of SVM/VT is that it becomes a lot *easier* to write a VMM (opening up the market to more players; this is starting to show in the Macintosh market) --- eventually, it should become faster, too.

On a separate note, over the next years, expect h/w assistance for dealing the device emulation (and not just from the CPU vendors).

Re:Say whaa??? by Anonymous Coward · 2006-08-13 10:07 · Score: 2, Insightful

So in what way is it different for VMWare? It also is free! And in addition lets you run unmodified kernels.

Slashdot Mirror

Hardware Virtualization Slower Than Software?

39 of 197 comments (clear)