Slashdot Mirror


HPE Unveils The Machine, a Single-Memory Computer Capable of Addressing 160 Terabytes (venturebeat.com)

An anonymous reader quotes a report from VentureBeat: Hewlett Packard Enterprise announced what it is calling a big breakthrough -- creating a prototype of a computer with a single bank of memory that can process enormous amounts of information. The computer, known as The Machine, is a custom-built device made for the era of big data. HPE said it has created the world's largest single-memory computer. The R&D program is the largest in the history of HPE, the former enterprise division of HP that split apart from the consumer-focused division. If the project works, it could be transformative for society. But it is no small effort, as it could require a whole new kind of software. The prototype unveiled today contains 160 terabytes (TB) of memory, capable of simultaneously working with the data held in every book in the Library of Congress five times over -- or approximately 160 million books. It has never been possible to hold and manipulate whole data sets of this size in a single-memory system, and this is just a glimpse of the immense potential of Memory-Driven Computing, HPE said. Based on the current prototype, HPE expects the architecture could easily scale to an exabyte-scale single-memory system and, beyond that, to a nearly limitless pool of memory -- 4,096 yottabytes. For context, that is 250,000 times the entire digital universe today.

150 comments

  1. Does is Run Linux? by Anonymous Coward · · Score: 0

    That's the first question. The second, is this like 10 years out? This sounds exciting on the surface.

    michaelgautier dot wordpress

    1. Re:Does is Run Linux? by rudy_wayne · · Score: 5, Funny

      Finally, enough RAM for Firefox!!

    2. Re:Does is Run Linux? by Anonymous Coward · · Score: 0

      At least until release 54 comes out.

    3. Re: Does is Run Linux? by Anonymous Coward · · Score: 1

      Wrong!

      The correct question is: "does it run Crysis?"

    4. Re:Does is Run Linux? by Anonymous Coward · · Score: 1

      Actually the first question should be would this be enough to hold all the Internet's porn?

    5. Re:Does is Run Linux? by Anonymous Coward · · Score: 5, Funny

      But it still comes short of what Chrome needs.

    6. Re:Does is Run Linux? by plopez · · Score: 1

      no

      --
      putting the 'B' in LGBTQ+
    7. Re: Does is Run Linux? by F.Ultra · · Score: 1

      Should be no considering the GPU would be lacking.

    8. Re:Does is Run Linux? by HotNeedleOfInquiry · · Score: 1

      Imagine what you could do with a Beowolf Cluster of these...

      --
      "Eve of Destruction", it's not just for old hippies anymore...
    9. Re:Does is Run Linux? by pablo_max · · Score: 1

      Assuming you have flash enabled...otherwise a Beowulf cluster is required.

    10. Re: Does is Run Linux? by Anonymous Coward · · Score: 0

      But only 5 Libraries of Congress... Still got work to do.

    11. Re: Does is Run Linux? by Anonymous Coward · · Score: 0

      I appreciate this comment. I have been making beowulf cluster jokes at my office and my fellow developers have been looking at me blankly

    12. Re:Does is Run Linux? by KiloByte · · Score: 2

      Does it run Linux? That's the first question.

      Only.

      The second, is this like 10 years out?

      Multiple vendors sell servers with 64TB RAM already, and expanding further was blocked by the lack of 5-level paging. Patches to do so have been floating on LKML for a while, thus hardware that can do that should be well past prototype stage.

      On the other hand, all patches I've seen are for x86, and this is arm64, so I'm apparently missing something.

      --
      The creatures outside looked from Alt-Right to Antifa; but already it was impossible to say which was which.
    13. Re:Does is Run Linux? by Big+Hairy+Ian · · Score: 2

      It's a shame there are no baseline performance statistics it would be interesting to know how much of a game changer this thing really is.

      --

      Build a Man a Fire, and He'll Be Warm for a Day. Set a Man on Fire, and He'll Be Warm for the Rest of His Life.

    14. Re:Does is Run Linux? by ausekilis · · Score: 2

      Yea, but how many cat pictures do you need open at the same time?

    15. Re:Does is Run Linux? by maestroX · · Score: 1

      In Soviet Russia, the largest single-memory computer uses 64KB segments.

    16. Re:Does is Run Linux? by David_Hart · · Score: 2

      Yea, but how many cat pictures do you need open at the same time?

      All of them, at once... obviously,,, You just can't have too many cat pictures...

    17. Re:Does is Run Linux? by edxwelch · · Score: 1

      My question is how many floppies do "4,096 yottabytes" take?

    18. Re:Does is Run Linux? by ausekilis · · Score: 2

      1 yottabyte = 2^80 or 10^24

      4096/1.44 = 2,844.4444

      So, basically 2,844,444,444,444,000,000,000,000,000 floppies.

      The weight of one floppy is 19g, in case anyone wants to do the conversion to VW Beetles.

    19. Re:Does is Run Linux? by edxwelch · · Score: 2

      Guess I need to buy another box

    20. Re:Does is Run Linux? by LifesABeach · · Score: 1

      I didn't see any images of the machine. But my first thought was, "16 Petabytes would stop old men eating their soup." My next thought was, "Finally, a machine that could handle a chat bot." So, how much HP?

    21. Re:Does is Run Linux? by GuB-42 · · Score: 1

      In Soviet Russia, the largest single-memory computer uses 64KGB segments.

      FTFY

  2. "The Machine" could they get any more non-descript by UnknownSoldier · · Score: 5, Interesting

    > it could require a whole new kind of software.

    Huh? You mean it not a von Neumann or Harvard architecture because the article doesn't lead me to _that_ conclusion:

    The new prototype has 160 TB of shared memory spread across 40 physical nodes, interconnected using a high-performance fabric protocol. It has an optimized Linux-based operating system (OS) running on ThunderX2, Caviumâ(TM)s flagship second generation dual socket capable ARMv8-A workload optimized System on a Chip.

    So basically 4 TB / node. Is each node have independent memory or not?
     

  3. For good or evil? by Anonymous Coward · · Score: 0

    Because they're going on and on about how it's going to be the best thing ever for Big Data... So...
    Why couldn't they build "The Machine" for good instead?

    1. Re:For good or evil? by gweilo8888 · · Score: 1

      Because good doesn't actually need "250,000 times more storage space than the entire digital universe today." Evil, though, would love to have it so that every single thing you do online can be permanently recorded on your profile for use as a cudgel by the government should you step out of line, and as a monetization tool by big business should you remain in check.

    2. Re:For good or evil? by Samantha+Wright · · Score: 1

      It might! One of the infographics on the HPE site claims the population of Earth will be 80 billion by 2020. That's gonna necessitate a whole lot of good.

      --
      Bio questions? Ask me to start a Q&A journal. Computer analogies available for most topics!
    3. Re:For good or evil? by Maritz · · Score: 1

      It might! One of the infographics on the HPE site claims the population of Earth will be 80 billion by 2020. That's gonna necessitate a whole lot of good.

      lol. I assume they meant 8 billion. Pretty bad mistake.

      --
      I do not want your cheap brainburning drugs. They are useless for work. And I am a working man today.
    4. Re:For good or evil? by St.Creed · · Score: 1

      It wasn't a mistake, it's much more ominous than that. Once The Machine will go live, all other machines that are connected in the (aptly named) Internet of Things will rise up against humanity. Afterwards, the survivors will be used as batteries to power The Machine and others of its kind. They will need approximately 80 billion humans to power the Eight Machines that make up the Council of Kobol. That is your future once this machine goes live. And they even have the gall to tell us outright.

      --
      Therefore, by the (faulty) logic you're using, you're just a cow with a keyboard - osu-neko (2604)
    5. Re:For good or evil? by Maritz · · Score: 1

      Bolt of insight. Mind blown. Show us the way!

      --
      I do not want your cheap brainburning drugs. They are useless for work. And I am a working man today.
    6. Re:For good or evil? by Hognoxious · · Score: 2

      That would make an awesome movie. Just the one, though.

      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
    7. Re:For good or evil? by St.Creed · · Score: 1

      I will. Right after I have made clear that I, for one, welcome our new Machine overlord!

      --
      Therefore, by the (faulty) logic you're using, you're just a cow with a keyboard - osu-neko (2604)
  4. Just great. by fahrbot-bot · · Score: 5, Funny

    I'll have to allocate an entire 1.6 TB drive for swap space.

    --
    It must have been something you assimilated. . . .
  5. "require a whole new kind of software" by Anonymous Coward · · Score: 1

    Then it's dead already. Unless it comes with some kind of magical recompiler.

    1. Re:"require a whole new kind of software" by K.+S.+Kyosuke · · Score: 1

      I think AllegroCache and similar stuff already has you covered on both fronts.

      --
      Ezekiel 23:20
    2. Re:"require a whole new kind of software" by dargaud · · Score: 1

      Unless the performance is really massively superior. Then you'll have some libs optimized for that beast, while the rest of the program runs on a normal frontend. Similar to what we are currently doing with CUDA and such.

      --
      Non-Linux Penguins ?
    3. Re:"require a whole new kind of software" by St.Creed · · Score: 2

      Yes. I fondly remember the Transputer. Brilliant stuff, but noone wanted to learn Occam, one of the most elegant parallel-from-the-ground-up languages I know. But they invented parallellizing compilers and libraries for that. Suboptimal, but given the raw power of this beast, I'm not sure that matters much.

      --
      Therefore, by the (faulty) logic you're using, you're just a cow with a keyboard - osu-neko (2604)
  6. I wonder how long that data takes to load... by Anonymous Coward · · Score: 1

    160 TB...

    32000 seconds or just under 9 hours at 40Gb/s assuming you have a storage array that can saturate that link.

    1. Re:I wonder how long that data takes to load... by dbIII · · Score: 1

      I really don't know why that got modded up.
      They call it a "fabric" because there are several network connections instead of a single choke point.

    2. Re:I wonder how long that data takes to load... by Anonymous Coward · · Score: 0
      40Gb/s is nothing these days, when you're talking about highly specialized hardware.

      ...only requires the energy consumption of a lightbulb, but can route over 3.7Pb/s of optical traffic, with up to 9.6Tb/s capacity per fiber

    3. Re:I wonder how long that data takes to load... by Anonymous Coward · · Score: 0

      40Gb/s is extremely low end for the type of storage array's you would put in that. hell we have servers in midrange with higher bandwidth for disk.

  7. Re:"The Machine" could they get any more non-descr by imgod2u · · Score: 2

    I would wager to guess that each node lives in some subregion of the memory address. And that each OS instance (or one giant distributed OS) accesses all addresses uniformly.

    It's certainly not infeasible even without memristor tech. But I wonder what benefits it has. The whole point of having localized nodes is to take advantage of the travel latency. Unless this is optimized specifically for embarrassingly parallel data feed-forward tasks, which even modern GPU workloads aren't anymore.

  8. Re:"The Machine" could they get any more non-descr by K.+S.+Kyosuke · · Score: 1

    Huh? You mean it not a von Neumann or Harvard architecture because the article doesn't lead me to _that_ conclusion:

    I think what HP means is that you no more have to compress/pack your database tuples into 4K-sized pages because they "just stay in memory". The same for other formerly-disk-based structures like B-trees and such. Also, changes in latencies on their own might change algorithm preferences massively.

    --
    Ezekiel 23:20
  9. 64-bit compilers SHOULD cut it for now... apk by Anonymous Coward · · Score: 0

    Into Zettabyte & Yottabyte range? They'll need new compilers (128-bit capable in whatever's used) is my guess.

    See - I'm not sure there are 128-bit capable ones currently! C++11 std. doesn't even address it in compilers afaik.

    (Yes, you can deal w/ 128-bit data but not "directly" in 1 chunk afaik but rather in specialized function registers in pieces)

    FOR REFERENCE:

    64 bit = 18,446,744,073,709,551,616 (16 Exabytes = 16* 1,024 to 6th power) addressable memory range. 160 terabytes (TB = 1,024 to 4th power) of memory in this prototype.

    APK

    P.S.=> Correct me where I'm 'off'/wrong here boys... apk

    1. Re:64-bit compilers SHOULD cut it for now... apk by Anonymous Coward · · Score: 0

      Obviously ignoring the stupid 10 based junk:
      2^10 is a kilobyte
      2^20 is a megabyte
      2^30 is a gigabyte
      2^40 is a petabyte
      2^50 is a terabyte
      2^60 is an exabyte
      2^60 *2^4 = 2^64, that's 16 exabytes.

      I think it is actually smaller, because I don't think 64 bit chips actually can have 64 address lines. I think there's only 52 bits of physical address lines available in long mode on an x86, for instance (like if you wanted 53 or 64 address lines, you'd need a "64bit mode plus" on the chip, analogous to current 64 bit "long mode", 32 bit "protected mode", etc. Other chips may not have this limit, or have different ones, but you can't go past 4 petabytes on x86 right now, at least.

  10. Re:"The Machine" could they get any more non-descr by Anonymous Coward · · Score: 0
  11. I might regret saying this but... by gfilion · · Score: 5, Funny

    160 TB of RAM ought to be enough for anybody

    1. Re:I might regret saying this but... by Gabest · · Score: 1

      If not, we can still use a memory extender to free up a few TBs.

    2. Re:I might regret saying this but... by skovnymfe · · Score: 1

      How so? Can you simulate the entirety of the universe using only 160TB? No? Then it isn't enough, is it? Hm!

    3. Re:I might regret saying this but... by Anonymous Coward · · Score: 0

      How so? Can you simulate the entirety of the universe using only 160TB? No? Then it isn't enough, is it? Hm!

      Universe? 160 TB is about one byte per atom in a human cell.

    4. Re:I might regret saying this but... by Anonymous Coward · · Score: 0

      But I still only have 580K of conventional memory.

  12. Re:"The Machine" could they get any more non-descr by imgod2u · · Score: 4, Interesting

    It seems to imply more than just persistent memory, though. It sounds like they're distributing processors in the data-path of the connected memory. Instead of the OS determining which context to put on a CPU and fetching the necessary data from memory/disk, the context and code will be decided by what data resides in memory that is closest to the processor node.

    A rather natural result of persistent, high-capacity memory for non-interactive compute tasks.

  13. Can do new things? by Anonymous Coward · · Score: 0

    What can this do that you couldn't when storing things on disk? Sure, it'll be faster than accessing disk but is there anything we couldn't do before that this thing can do now?

    1. Re: Can do new things? by Anonymous Coward · · Score: 0

      There may be hefty memory applications that haven't been explored primarily due to the time factor.

  14. Ob by Hognoxious · · Score: 4, Funny

    It's almost enough to store all the data their keylogger stole.

    --
    Confucius say, "Find worm in apple - bad. Find half a worm - worse."
    1. Re:Ob by Anonymous Coward · · Score: 1

      Ok, sure. But technically, Hewlett Packard Enterprise (HPE) doesn't make laptops. HP Inc. makes the laptops that had the keylogger. They're two different companies. Welcome to 2017.

  15. Could be worse by Anonymous Coward · · Score: 0

    If it were running Windows 3.1 it would recommend a 480TB swap file.

  16. Re:"The Machine" could they get any more non-descr by geekmux · · Score: 1

    ...could they get any more non-descript.

    Hey, it sure as hell worked for Pink Floyd.

  17. Bus size by Steve-Oh · · Score: 2

    4096 yottabytes = 4.096e27 bytes; 2^n=4.096e27, solve for n ... n = 92. Now we know the market for these 128-bit processors!

    1. Re:Bus size by Anonymous Coward · · Score: 0

      The trouble with big SI prefixes it that you no longer sound like a megadweeb, more a zettadweeb, or even a yottadweeb.

    2. Re: Bus size by Anonymous Coward · · Score: 0

      Hmm... Yottadweeb I am.

    3. Re: Bus size by Anonymous Coward · · Score: 0

      The silliness is strong on this one!

  18. Re:"The Machine" could they get any more non-descr by Anonymous Coward · · Score: 0

    "Re:"The Machine" could they get any more non-descript"

    I think they meant "The Machine That Goes Ping!"

  19. Re:"The Machine" could they get any more non-descr by K.+S.+Kyosuke · · Score: 1

    I don't recall them announcing this the last time this concept was in the news, but if they're doing that, then yes, that's an even bigger change. (I admit I'm still sort of fond of the Connection Machine...)

    --
    Ezekiel 23:20
  20. Interesting but, not amazing by somenickname · · Score: 5, Insightful

    It would have been a lot more interesting, and a lot more paradigm shifting, if it was 160TB of ultra-fast next-gen M.2 sticks with 0MB of traditional RAM and 0MB of traditional storage. That would be a truly unique machine to work on. If you read the article, this isn't even a single machine. It's actually 40 nodes with high speed interconnects. Basically, HP is now running Linux on their VMS clusters.

    1. Re:Interesting but, not amazing by Anonymous Coward · · Score: 0

      I thought they were going to use memristors for this...

      https://en.wikipedia.org/wiki/Memristor

      I'm too lazy to read the article, but that would be a true paradigm shift. It should be dense, fast, cheap.

  21. 1TTweet by Anonymous Coward · · Score: 0

    That's like...1 trillion tweets!

    1. Re:1TTweet by Anonymous Coward · · Score: 0

      I already have a device (/dev/null) which can store and retrieve all the meaningful tweets.

    2. Re:1TTweet by Hognoxious · · Score: 1

      The write speeds are awesome. Plus it's webscale because it doesn't use joins.

      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
    3. Re:1TTweet by Anonymous Coward · · Score: 0

      I'm sensing a joke I don't get.

    4. Re:1TTweet by amalcolm · · Score: 1

      https://www.youtube.com/watch?... Enjoy. One of the funniest things I've seen in a long time

      --
      Time for bed, said Zebedee - boing
  22. nice by Anonymous Coward · · Score: 0

    While the size is awesome, it is hardly revolutionary more evolutionary. It isn't like we could not handle datasets of that size previously, we simply couldn't have it ALL in memory at the same time.

    1. Re: nice by Anonymous Coward · · Score: 0

      Sure you can. There are HPC clusters, distributed memory technologies like MPI and fiber interconnects. Most that Im ware of are designed for computationally intensive tasks (CPU bound) tasks, however, they could be used for memory bound applications. This system is one of the beastiest specialized clusters for "large memory applications I've seen though.

  23. Jedi master Yotta Byte says... by millertym · · Score: 1

    Track and analyze your life to the smallest fraction we will. Soon. sooooooooon. MMHEHEHEHE!

  24. it isn't the size that matters. by Anonymous Coward · · Score: 0

    it isn't the size that matters. It is how well it works its magic.

  25. Interesting article, crappy journalism by aXis100 · · Score: 1

    The article contradicts itself multiple times.

    First, the start of the article (and the summary) say it's a prototype computer with a single bank of memory. Later they report that the machine has the 160TB spread across 40 nodes. It might be logically contiguous but it's hardly a "single bank".

    Secondly, the start of the article describes the architecture as memory-centric, but HP later states: "the Machine is an attempt to build, in essence, a new kind of computer architecture that integrates processors and memory seamlessly using a flexible interconnect scheme"

    Wat?

  26. what a colossal waste by ooloorie · · Score: 1

    Having huge banks of memory and passing them through a "single computer" bottleneck is a colossal waste.

    1. Re:what a colossal waste by Anonymous Coward · · Score: 0

      Having huge banks of memory and passing them through a "single computer" bottleneck is a colossal waste.

      From other comments, it seems some kind of logical abstraction using a bunch of computers. For instance, I could create 160TB of "memory" by say taking a bunch of linux nodes maxed out with ram, creating ram drives and exporting them via NFS. Then I think you could put them all into a software raid set to make one virtual nfs drive that really is memory. Various programs could use that as is. Of course normally NFS would be over probably 1Gb or 10Gb networking. The article mentions a "fabric" which doesn't tell much, though presumably it is less overhead.

      I'm wondering if HPs architecture really is that great. I'd assume the design is something like a memory rich motherboard, likely with 2+ cpus, then someone co-opting one or more pcie slots to do transfers. At work we use the national instruments pcie card that basically makes an exernal pcie bus to an external chassis. Presumably something similar would work. The problem though is scaling. Assuming you could get a 7 slot board then you could do an 8 node system.

      Actually come to think of it, can't pcie cards basically map themselves as "memory". If you could somehow make that work, then your local memory would be bank 0, then you would have bank 1 - 7 mapped in. I doubt you would manage pcie x 16 transfers on all 7 neighbors, but 4 or 8 are possibilities.

      A quick glance shows that windows server can now handle 24TB, though I didn't verify if anyone made hardware that would. Still, 24x8 = 192TB, so the numbers are in the ballpark.

      I'm guessing that kind of design would be 4+ cpus per node, so that kind of design would give say 32+ cpus and who knows how many cores access to the same 192TB of memory, which if the memory was mostly read only, would allow a lot of simultaneous very fast queries.

      I'm confused. What do we need HP for? I'm fairly sure any half competent tech company could make this. it does require some low level driver skills, but it seems doable given time. It certainly doesn't deserve some mystical naming crap. Its just another bump on the engineering road..

      I'm guessing on your dev machine you run an nfs server and have some external relays and such setup to reset all the nodes simultaneously and load their OS via the network, which would almost certainly be linux.

      There are no doubt details to work out. For one thing memory mapped from a card doesn't work the same as normal ram, or at least that was my last experience. You had to declare things volatile since the compiler didn't know any better. Actually the key is probably to make the external ram "act" like internal ram, which might be where the actual research is. For instance any c program using it by reference shouldn't have to do anything special, other than to perhaps wait a little longer for it.

      Still, it seems too complicated. Creating a pcie card with memory slots right on it might actually be less of a mess. Of course the bandwidth to the slots is still finite... If you had say one pcie x16 per actual cpu then you could easily use up that bandwidth on attached ram. Heck I can do that now in opengl calls, well at least with 8x and probably with 16x.

      I wonder how many jobs actually need to see memory like this? I suspect most can be decomposed into something that can be run in parallel when your hitting those numbers. The problem is memory is so much faster than a pcie slot.

      Frankly I'm curious on how HPs junk will compared to just a database spanned across a pcie x16 ssd with of course the system's ram maxed out. I suspect your going to be IO bound either way, and why go through all that mess of spanning it across multiple nodes if your not gaining something.

      Of course if I'm horribly wrong, feel free to tell me....

    2. Re:what a colossal waste by Megane · · Score: 1

      But think of the rainbow tables you could load into it! The NSA should be all over this.

      --
      #naabhaprzrag, #sverubfr-000, #agi-fcbafberq, negvpyr[pynff*=' negvpyr-ary-'] { qvfcynl: abar !vzcbegnag; }
    3. Re:what a colossal waste by Anonymous Coward · · Score: 0

      Still, it seems too complicated. Creating a pcie card with memory slots right on it might actually be less of a mess. Of course the bandwidth to the slots is still finite...

      This would be a throwback to the 80s.
      Back then CPU speed and bus speed were the same or close ; these days you have people putting flash memory on DIMMs, or hybrid DRAM + flash DIMM (that can save the contents of DRAM to flash in case of a power loss, and/or use the flash as a kind of non-volatile memory as a stop gap before better more proper non-volatile memory get available)

      You even have a graphics card with built-in SSD so as to give it some local storage, awaiting possibly better NVRAM too. This should tell you how PCIe 16x sucks.

      Although, custom/and or better interfaces than PCIe are in the work too, see NVLink 2.0 as an example. This can "blend" together the memory space of e.g. four GPUs between them, or with a host CPU. There's other crap like that such as "Gen-Z" protocol. Formerly there was Hypertransport (used between my crappy CPU and crappy chipset by the way). At HPE there's yet another custom NUMA interface it seems, this been going since the 90s. Like a cheap ass dual CPU PC does between its two CPUs but can go out of the motherboard.

  27. Bad by Anonymous Coward · · Score: 1

    If it is anything like the HPs I have owned, some major part will go out in 2 to 3 years.

  28. So basically by Anonymous Coward · · Score: 0

    This will be able to load a webpage in Firefox?

  29. Remember the Itanic! by LesserWeevil · · Score: 1

    Memory integrated architectures (PRISM, MPA, etc, etc..) have long been a twinkle in our collective eye, but I doubt HPE has the critical mass to pull this one across the finish line. Gone are the days when HP Labs held any credible sway in architecture. When was the last time HP(no E) told us they knew best in things architectural? Remember the Itanic!

  30. Addressing 160 TB by hcs_$reboot · · Score: 1

    Addressing 160 TB just requires a 48 bits bus, which most recent 64 bits architectures have. So "simultaneously" is probably missing from the title..

    --
    Slashdot, fix the reply notifications... You won't get away with it...
    1. Re:Addressing 160 TB by Anonymous Coward · · Score: 0

      It is not simultaneously at all. You have to RDMA it, but without copying it to local node memory. Each node is 2-core 4GiB. Address space is *unified*, so local memory works like thread-local-storage / per-cpu memory in Linux kernel lingo. Access to remote memory is fast enough and easy enough that it doesn't get in the way.

      But this thing is meant to be used by swarm-like parallelized tasks, with function-and-memory locality, creating islands of tasks closer to the memory those tasks are working at.

  31. Re:"The Machine" could they get any more non-descr by dbIII · · Score: 2

    But I wonder what benefits it has

    Being able to do an operation on an entire huge dataset in memory instead of a pile of fetching and carrying to do it on disk.
    Since the alternative is an order of magnitude (or several) slower a bit of latency isn't a terrible price to pay.

  32. "The Machine"? by 93+Escort+Wagon · · Score: 1

    You are being watched...

    --
    #DeleteChrome
    1. Re:"The Machine"? by Anonymous Coward · · Score: 0

      The government has a secret system...

      (captcha: TRACED!)

  33. Improved execution speeds of up to 8,000 times by Anonymous Coward · · Score: 0

    From HPE site: "The company has run new software programming tools on existing products, illustrating improved execution speeds of up to 8,000 times on a variety of workloads"

    Well, I'll believe it when I see it. Seems just too good to be true.

  34. particle detector by Anonymous Coward · · Score: 0

    Also makes a good particle detector.

  35. "4,096 yottabytes" by Anonymous Coward · · Score: 0

    "4,096 yottabytes ought to be enough for anybody.".

  36. Which CPU by unixisc · · Score: 1

    My question is different. Which CPU does it use? Xeon? Or does HP try to leverage what's left of the Itanium? And if it's Itanium, I doubt it'll be Linux: HP/UX would be the only game in town. Linux abandoned it long ago, and even FreeBSD didn't port their LLVM/Clang compiler to this platform.

  37. Time to download some RAM by Anonymous Coward · · Score: 0

    CLICK ME

    CAPTCHA: records

  38. Re:"The Machine" could they get any more non-descr by Anonymous Coward · · Score: 1

    This page:
    https://news.hpe.com/memory-driven-computing-explained/
    has more helpful information about how the architecture works. It's neat.

  39. Re:"The Machine" could they get any more non-descr by arglebargle_xiv · · Score: 1

    Who cares what it runs, the NSA has already ordered a dozen of them.

    In unrelated news, you may want to switch to a minimum password length of 32 characters for any account you care about. Just saying...

  40. What I think they're saying is... by Anonymous Coward · · Score: 0

    ... that the entire memory space works essentially like registers and that operations on data are built into the transfer process meaning that it's massively parallel and allows billions of ops per second on billions of locations per second.

  41. Remember...this is HP by Shoten · · Score: 1

    This is the same HP that hasn't come up with a hit since the bubble jet printer, people. The same HP that pushed a cloud computing solution that was so pig-fucking awful that The Onion mocked them about it. I worked at HP at the time, and I really have to think that The Onion had someone on the inside...because their parody was unbelievably on target. "We have 4G, 5G, 6G...we have all the Gs. We have app." That's literally as bad as what some of the people at HP were about it...it defied belief. This is the same HP that came up with a small microchip that could hold information and push it to your phone...but alas, as good as it sounded to have them talk about it, the phone's receiver had to stay within an inch of the thing, and the data transfer rate was literally as bad as a modem from the late 80s. This is the same HP that couldn't come within billions of dollars of precision as they tried to evaluate the price of another company they bought...and then effectively sued themselves when they realized that they fucked up on the offer they'd made, had accepted, and consummated. HP had to state on their SEC filings that flight of talented people had become a major impediment to their achieving their business goals...starting several years ago. And it hasn't gotten better since. These are stupid motherfucking people.

    Oh, in more recent news, this is the same HP whose business-grade laptops (since we're talking HPE here, really) had a keylogger built into the audio driver.

    So yeah...I doubt that this "machine" is all that. I'm curious...have they ever actually managed to CONNECT it to 160 terabytes of RAM at once, or is this a theoretical capability? Because they lie like a rug about this kind of silly detail. I can't help but notice that those 160 TB all have to be in a "single bank of memory." Wow, that must be one long-ass DIMM!

    --

    For your security, this post has been encrypted with ROT-13, twice.
    1. Re:Remember...this is HP by Anonymous Coward · · Score: 0

      This is the same HP that hasn't come up with a hit since the bubble jet printer, people.

      You do mean ink jet printer, as bubble jet was Cannon's.

    2. Re:Remember...this is HP by Anonymous Coward · · Score: 0

      This is the same HP that hasn't come up with a hit since the bubble jet printer, people.

      You're forgetting the optical mouse, which come a bit after out of HPL's Computer Peripherals Lab, though HP/Agilent never branded one, letting Logitech do it (and later Microsoft got in the game because of cross-patent agreements)

    3. Re:Remember...this is HP by MancunianMaskMan · · Score: 3, Insightful
      years ago we heard HPE (or was it still HP then) talked about betting the farm on "the machine" all full of its new memristor tech, cheap, fast, persistent, practical, egg-laying wolly milk pig kind of chips.

      Now it's "DIMMs with a little battery stuck on" to handle the "persistency". Hope that's just for the demo.

    4. Re:Remember...this is HP by Chrontius · · Score: 1

      The design should translate transparently to MRAM chips, if their engineers are competent.

      If they're really good, their architecture will also handle Intel's 3d Xpoint DIMMs, too.

  42. Re: "The Machine" could they get any more non-desc by Anonymous Coward · · Score: 0

    Obviously a tribute to Person of interest. Rip John Reese 2016

  43. Re:"The Machine" could they get any more non-descr by Anonymous Coward · · Score: 0

    They call it simple "the machine" so that they don't have to tell you what it will be use for.

    Such a system is used for MASSIVE data collection and data mining of YOU, your every purchase, every movement, every phone call, every photo text chat video etc.... ALL OF YOU.... mined and mapreduced into various priceless morsels of control they can instantiate over you, never having given you a dime for the "value" (ahem, control and your soul) they reap.

    You are getting soooo FUCKED, you, your wife, your children, families and grandkids,
    yet you still refuse to rise up and do anything about it.
    Sad, so very sad and stupid you are.

  44. Re:"The Machine" could they get any more non-descr by knightghost · · Score: 2

    AI using multidimensional data sets. I work with cubes in the tens of terabytes that could be sped up thousands of times if they could be held in memory.

  45. Re:Which CPU by Anonymous Coward · · Score: 1

    It is a massive NUMA thing. It uses a ridiculous number of special ARMv8 cores. RFA and all that jazz...

  46. Faster sql server by Anonymous Coward · · Score: 0

    No more disk crunching

  47. himem.sys by Anonymous Coward · · Score: 0

    Don't load OS stuff into the higher memory space, like MSDOS did. Otherwise, In 200 years, gamers will have to fuck around with himem.sys all over again.

  48. Re:"The Machine" could they get any more non-descr by Anonymous Coward · · Score: 0

    Basically they took 10 PCs and put the PC boxes in another box, then labelled that box "The Machine".
    A box of boxes. It'll change the world!

  49. Re: "The Machine" could they get any more non-desc by Anonymous Coward · · Score: 0

    10s of TB? 'nough said

  50. Re:"The Machine" could they get any more non-descr by skids · · Score: 1

    The critical number missing in TFA is the memory access speeds at various tiers of the NUMA.

    Take a 4GHz computer. How far can a memory access go in one cycle given the speed of light? The answer is "not even to the other side of a 19 inch server rack. Not even halfway across a laptop." You can fetch cache lines in bulk, sure, but at some point this fact will intrude into your code, demanding you keep local registers local and tightly coupled calculations on physically close nodes... we can't tell how drastic a relief "The Machine" provides from such constraints without those numbers.

    Also where the RAM design sits between a modern day HPCC and an ideal nonblocking multiport mesh is critical to know... and whether it automatically adapts by moving memory to more efficient banks transparently to the OS/app.

  51. I should have put it in one line by dbIII · · Score: 1

    Why didn't you bother reading the second line of my post before spending so much time writing what you did? What you wrote is all true but kind of irrelevant without a massive leap in technology.
    Multiple nodes is certainly not as fast as having it on one board, but try reading that second line to find out why it's still useful.

    1. Re:I should have put it in one line by skids · · Score: 1

      I did read your second sentence. It seemed pretty a throwaway aside, given this is supposedly more than just a big fast disk.

    2. Re:I should have put it in one line by dbIII · · Score: 0

      Damn - so you didn't understand and I have to explain it to you.

      Access to memory on a remote machine is a great deal faster than access to disk when network speed is not the limiting factor.

      I thought it was kind of obvious to anyone who would want to comment on this article but it appears I was wrong.

  52. Re: "The Machine" could they get any more non-des by Anonymous Coward · · Score: 0

    You mean Cocoa Puffs? (aka root)

  53. obligatory by krray · · Score: 1

    In Russia 160 Terabytes * IS * you. Yet, so true.

  54. Having heard the prototype by Anonymous Coward · · Score: 0

    I think they should have called it "the jet engine." I once made the mistake of entering the room it is in without hearing protection and holy crap is that thing loud!
    You'll note even in the PR video they were wearing acoustic earmuffs.
    Pretty cool technology though. I've always been the kind of person who loves to see the "top" output on an enormous SSI...

  55. Welcome by Anonymous Coward · · Score: 0

    Welcome my son...welcome to the machine...where have you been? It's alright...we know where have you been.

  56. Re:"The Machine" could they get any more non-descr by BlueStrat · · Score: 1

    AI using multidimensional data sets. I work with cubes in the tens of terabytes that could be sped up thousands of times if they could be held in memory.

    Indeed. I wonder how useful it would be for someone like the NSA or NRO for analyzing large datasets in near-realtime like, for instance, all the cellphone communications "metadata" (and contents?) in an area and cross check it against other datasets to destroy privacy, reveal networks of association of political/ideological opponents, etc etc? "Predict" crime a la 'Minority Report'?

    Seems like just the kind of cutting edge mass-data analysis technology leaders of a surveillance state would soil themselves over at the thought of the possibilities for control of the population it could bring to life.

    We must always be careful to remember that the atomic age brought bombs that could end humanity as well as reactors and other tech. This tech I feel has a second edge to that sword with almost equal potential for being used for evil as any weapon of war I know of. This "bomb" could possibly be a key tool in the enslavement of an entire society.

    Strat

    --
    Progressivism (aka US 'Liberalism'): Ideas so good they need a police/surveillance-state to enforce.
  57. Amiga did this way back in 1985 by Anonymous Coward · · Score: 0

    Amiga did this back in 1985 with the Amiga 1000...

    https://en.wikipedia.org/wiki/Amiga_1000

    The Machine is the first modern computing concept that has been able to mimic and surpass the power of the fully dynamic amiga ramdisk back in the day...
    The Machine and Quantum computers are the future maybe coupled with bio and optical curcuits!

  58. IBM POWER and iSeries? by niks42 · · Score: 1

    Isn't this just IBMs iSeries reborn? That was / is a 64-bit address space that addresses physical memory and disk in one single-level storage. Granted, in the real world we don't often put 160TB into a machine, and the balance may be made up of spinning disks, but as far as the software is concerned it is the same, surely?

  59. Could be worse again by niks42 · · Score: 1

    If it were AIX it would have recommended a swap space 2.5x physical memory ..

  60. dear recursivity by hcs_$reboot · · Score: 1

    but then, that memory space being part of the universe...

    --
    Slashdot, fix the reply notifications... You won't get away with it...
    1. Re: dear recursivity by Anonymous Coward · · Score: 0

      That's why you fork off a new universe...

    2. Re:dear recursivity by skovnymfe · · Score: 1

      Doesn't matter how deep you go, there's always another layer to the puzzle. If we concern ourselves with such insignificant details we'll never go anywhere or do anything. Infinity is like that. At some point you just have to say enough is enough.

  61. Old Cosmos computer at Cambridge by Laxator2 · · Score: 1

    The old version of that machine (more than 10 years ago) was using 384 Itaniums with 2GB of RAM per CPU and custom SGI interconnects so that the operating system saw one single memory space an all the CPUs.

    No big news here.

    It looks like HP wants to take something out of the effort that was put into the whole Itanium business, now that it is being discontinued.

    The new version of Cosmos uses x86 CPUs and GPUs as accelerators.

  62. When 64bits will not be enough by Anonymous Coward · · Score: 0

    If you need to address more than 16 exabyte-scale single-memory system, we'll need 128bits arch, not to mention yottabytes...

  63. Re:"The Machine" could they get any more non-descr by Maritz · · Score: 1

    They call it simple "the machine" so that they don't have to tell you what it will be use for.

    Such a system is used for MASSIVE data collection and data mining of YOU, your every purchase, every movement, every phone call, every photo text chat video etc.... ALL OF YOU.... mined and mapreduced into various priceless morsels of control they can instantiate over you, never having given you a dime for the "value" (ahem, control and your soul) they reap.

    You are getting soooo FUCKED, you, your wife, your children, families and grandkids, yet you still refuse to rise up and do anything about it. Sad, so very sad and stupid you are.

    lol. They already do all of that, you numpty.

    --
    I do not want your cheap brainburning drugs. They are useless for work. And I am a working man today.
  64. Re:"The Machine" could they get any more non-descr by Maritz · · Score: 2

    Basically they took 10 PCs and put the PC boxes in another box, then labelled that box "The Machine". A box of boxes. It'll change the world!

    That's your take-home from this? lol.

    Stick to playing with the worms in your garden mate.

    --
    I do not want your cheap brainburning drugs. They are useless for work. And I am a working man today.
  65. Re:"The Machine" could they get any more non-descr by Anonymous Coward · · Score: 0

    Shared memory = a single pool of memory that can be accessed by multiple threads or CPU's simultaneously (as in the System V shmget manual). But to avoid cache coherency problems and to avoid latency, the memory is managed as if it were a network server (high performance fabric protocol).

  66. spy space by Anonymous Coward · · Score: 0

    that's a lot of keylogging storage!

    I think they don't want to miss a thing!

  67. Books? by mark_reh · · Score: 2

    Seriously, are we still using books as a unit of comparison? Why not say it can process 80% of the internet, etc.?

    1. Re:Books? by Voyager529 · · Score: 3, Informative

      Seriously, are we still using books as a unit of comparison? Why not say it can process 80% of the internet, etc.?

      Yes, and there are two related reasons. First, the LoC is a very large amount of data. It's not the kind of data that can land on a USB stick, it's enough to actually prove something.

      Second, it's a known quantity of data. Even if it's approximate, it's a set amount of books, with a set amount of pages. Can we really count the amount of data on the internet? Let's establish a baseline - what constitutes "the internet" in terms of storage? Every website ever? What about apps and the data they create - do we include those databases because mobile apps use them? How many companies will volunteer how big those databases are? GoDaddy will probably be able to more-or-less say how much data they host, but how much of it is active data - does it have to be served up to count? Similarly, does this include Dropbox data that's technically accessible, but only to its end user? If so, what about end users who own their own Synology boxes and back up their pictures to it over the internet? Does the data on those home NAS units count? Do we limit protocols to HTTP, or are we also talking about FTP sites, NNTP servers (do we count the total amount of Usenet data, or does each company who peers that data count separately?), and data available via torrents? What about e-mail - does e-mail count if it's stored on a server and accessible via a web browser? What if it's only accessible via POP/IMAP?

      Even if *you* came up with a number that includes what you deem appropriate for '80% of the internet', it's not going to translate well. If your metric was "anything that is accessible from a computer and isn't behind a login prompt", that's going to be different than someone who says that Dropbox counts, which doesn't fit your criteria - undoubtedly petabytes of difference, making the measurement irrelevant.

  68. very neat by mtmiller100 · · Score: 1

    but, if power is interrupted (because that NEVER happens, even with UPS, right?), do you have to start over from scratch, and reinstall the OS, databases, etc?

    1. Re:very neat by Anonymous Coward · · Score: 0

      UPS? This thing probably runs off a submarine sized nuclear power plant, so you need to reload the DB etc every couple of years only.

    2. Re:very neat by mbone · · Score: 1

      All of the memory is non-volatile.

  69. Re:"The Machine" could they get any more non-descr by omnichad · · Score: 1

    I wonder how useful it would be for someone like the NSA or NRO for analyzing large datasets in near-realtime like, for instance, all the cellphone communications "metadata" (and contents?) in an area and cross check it against other datasets to destroy privacy, reveal networks of association of political/ideological opponents, etc etc? "Predict" crime a la 'Minority Report'?

    Well, they did call it The Machine, so I assume they're trying to make it easy for the government to connect the dots on that idea.

  70. but by sproketboy · · Score: 1

    But can it run Crysis?

    1. Re:but by David_Hart · · Score: 1

      But can it run Crysis?

      In 1080P with all sliders set to low... After all, I didn't see a 3-way SLI GPU as part of the specs....

  71. Re:"The Machine" could they get any more non-descr by bluefoxlucid · · Score: 1

    Doubtful.

    Read-ahead protocols allow you to identify further data sets and bring them in and out of memory faster than algorithmic performance. The fastest pattern is a giant linear read, and you can issue a DMA to read in the next several hundred megabytes and expire the prior without the CPU being further involved.

    Algorithms that process more-complex data sets generally need instrumentation code to identify where the next addresses are, which can be ordered to occur before processing: instead of identify an array of 300, process it, then read off the next address and move your attention there, you would identify the array of 300, skip it, read the next address, issue the read-ahead, and process. This ordering only really adds the call for read-ahead (an OS madvise() call, really) on top of all other work.

    For sufficiently-small data sets, issuing readaheads at every junction is brutal. For large datasets, scattering across RAM would incur an incredible amount of lag due to long RAS/CAS cycles, and CPU prefetch calls are required for performance. A "large data set" is one that your CPU is not sufficiently fast enough to process in a small time.

  72. Finally, a Machine that CYC can run on! by littlewink · · Score: 1

    This will do more to enable true AI than all the neural networks of the last 5 years'.

  73. 67% if you're Irish by Hognoxious · · Score: 1

    They cut a turd in two. Now there's two turds!

    --
    Confucius say, "Find worm in apple - bad. Find half a worm - worse."
  74. Re:"The Machine" could they get any more non-descr by Anonymous Coward · · Score: 0

    It sounds like it is not SMP architecture. It sounds more like a NUMA architecture. I have questions about whether the processors can crank through all that memory in a reasonable amount of time. I also question how long it would take to load up data into that memory.

    Ignoring that...
    It sounds like someone read this article:
    http://spectrum.ieee.org/computing/hardware/a-radical-proposal-replace-hard-disks-with-dram
    Summary of the article is that if you build a computer system based around RAM instead of non-volatile storage, there is much that can be done. This is especially true if you have a compute model that treats disks as backup to RAM.

    In terms of requiring new software, it is a non-x86 platform and will need to address NUMA memory that exceeds what a typical processor ever expects to see. In fact, you would be hard pressed to find a processor that can support 4TB of RAM. A 64-bit memory address can only access 16TB of RAM. So you need a system that can handle larger pointer sizes.

  75. Memtest by Rick+Schumann · · Score: 1

    What a nightmare. Imagine how long memtest would take to run to identify just ONE goddamn back memory stick! What are you thinking, HP?

  76. A new kind of software? by mbone · · Score: 1

    it could require a whole new kind of software.

    I asked the technical lead Kirk Bresniker (chief architect at Hewlett Packard Labs) about this exact thing at the launch yesterday, and he said no, that you should be able to use conventional software (I specifically asked about Python), with the speed-up occuring under the hood.

    I am not entirely convinced that it will be that easy...

  77. Re: "The Machine" could they get any more non-desc by Anonymous Coward · · Score: 0

    Sooooooooo scary!!!!!

    Lol.

    Grow up. This simply speeds up what's already being done.

    Your life has to be a real disaster thinking like this. What medications are you on?

  78. Re:"The Machine" could they get any more non-descr by Anonymous Coward · · Score: 0

    Isn't that what these guys do?

    http://www.emutechnology.com/

  79. Re:"The Machine" could they get any more non-descr by K.+S.+Kyosuke · · Score: 1

    Read-ahead protocols allow you to identify further data sets and bring them in and out of memory faster than algorithmic performance. The fastest pattern is a giant linear read, and you can issue a DMA to read in the next several hundred megabytes and expire the prior without the CPU being further involved.

    Yes, because it hides the fact that the smallest block you can fetch is hundreds of bytes in size at least, and possibly several kilobytes.

    Algorithms that process more-complex data sets generally need instrumentation code to identify where the next addresses are, which can be ordered to occur before processing: instead of identify an array of 300, process it, then read off the next address and move your attention there, you would identify the array of 300, skip it, read the next address, issue the read-ahead, and process. This ordering only really adds the call for read-ahead (an OS madvise() call, really) on top of all other work.

    And how does that help you with data structures in which the access sequence is data-dependent even over smaller pieces of data? Spatial trees, for example? Unless of course you're tacitly limiting yourself to all the others that aren't. And madvise, isn't that for memory-mapped files on block devices? Since I don't see how madvise could tweak CPU cache logic which is apparently hard-coded. Somehow I can't shed the feeling that you're reasoning circularly, i.e., that we're doing things like this mostly because we were given the tools for them that were provided for us to do these things because that's all we could back then.

    --
    Ezekiel 23:20
  80. Cray computers by Qango · · Score: 1

    Didn't Crays use something like this where the memory was central to the operating structure of the computer? Can anyone enlighten me?

  81. Re:"The Machine" could they get any more non-descr by bluefoxlucid · · Score: 1

    And how does that help you with data structures in which the access sequence is data-dependent even over smaller pieces of data?

    Generally, if you're scattering over different row selects in RAM, you stall the CPU about 200 FSB cycles or 2,000 cycles for a 10x multiplier when you jump around in RAM. That means if the data is all in RAM to begin with and you spend 20 cycles processing, then jump to some data 40 megabytes away, you spend roughly 99.0099% of your time stalled waiting for CPU cache miss. To get around this, you'd have to use CPU prefetch instructions to load the upcoming data into L1.

    Access structures as such tend to be packed to consider CPU cache for that reason--see Judy Trees--and, as such, you'd be fetching blocks of at least 4K if not stuff packed localized to 4M (huge pages) to take advantage of the required packing.

    In other words: structures that aren't decently optimizable by reading the data in from disk ahead of time are hella slow even if in RAM.

    Since I don't see how madvise could tweak CPU cache logic which is apparently hard-coded

    madvise() lets you mmap() 128TiB of disk-backed data (files) and then tell the kernel that span A is no longer needed (so discard it from memory) and span B will be needed very soon (so read it into RAM now instead of waiting for the upcoming read() call). You'll need a separate CPU-level prefetch instruction after that read due to cache flushing on the context switch, except on modern architectures which can context switch to supervisor and back without a thread switch and not flush all user-context cache.

  82. Re:"The Machine" could they get any more non-descr by Anonymous Coward · · Score: 0

    Umm not, a 64-bit memory address addresses 16 exabytes, not terabytes.
    2^40 is a terabyte, 16 terabytes is 2^44. (and so mnemonically : 2^30 is a gigabyte, 2^50 a petabyte, 2^60 an exabyte)
    Though you can certainly run into addressing limits e.g. original x86-64 can physically address 40 bits only, 1TB physical RAM, and 48 bits logically, which is a 256 TB limit.

    The physical addressing limit was since extended (CPU implementation detail rather than hard limit), for the logical limit I am sure x86 can use more if the CPU implements it but old software and OS might not like it (if they did dumb weird things). As you say, a non-x86 platform is used there.

    I give you that this HPE thing is right at any putative 48 bit addressing limit, which is a bit fascinating. If it ramps up in capacity or even right now you could run into file system limitations trying to back it up. Incidentally wikipedia says the max file size for ext4 is 16TB (unless maybe you use block sizes larger than 4KB like FAT did)