Researchers Claim 1,000 Core Chip Created
eldavojohn writes "Remember a few months ago when the feasibility was discussed of a thousand core processor? By using FPGAs, Glasgow University researchers have claimed a proof of concept 1,000 core chip that they demonstrated running an MPEG algorithm at a speed of 5Gbps. From one of the researchers: 'This is very early proof-of-concept work where we're trying to demonstrate a convenient way to program FPGAs so that their potential to provide very fast processing power could be used much more widely in future computing and electronics. While many existing technologies currently make use of FPGAs, including plasma and LCD televisions and computer network routers, their use in standard desktop computers is limited. However, we are already seeing some microchips which combine traditional CPUs with FPGA chips being announced by developers, including Intel and ARM. I believe these kinds of processors will only become more common and help to speed up computers even further over the next few years.'"
How long will it be before we will see the first motherboards with FPGA emerge?
Then you can download the CPU type of your choice:
-- naah, I don't like this new Intel core, I will try the latest AMD instead...
or 1,000 logic blocks? Are they equivalent? Aren't FPGAs common and generally contain multiple logic blocks?
I still cannot find the droids I am looking for...
It may be too late, but perhaps someone could talk with Viva Computing, LLC who now owns the assets of Star Bridge Systems. It was not specified in the news release if they also own the intellectual property.
"Giving money and power to governments is like giving whiskey and car keys to teenage boys." - P.J. O'Rourke
Oooo I bet they run HOT!
Remember the "10 Deca chip?"
This story was already submitted two times before eldavojon managed to get it to the front page in a little over an hour...
http://tech.slashdot.org/submission/1432844/University-of-Glasgow-pioneers-1000-core-processor
http://tech.slashdot.org/submission/1432512/1000-core-processors-
"Lame" - Galaxar
...to a paper that assumes that the reader already knows what a cpu is? This article is content-free.
Warning: this article may contain humor, sarcasm, parody, and perhaps even irony. Read at your own risk.
I think this is a great development. I've been using FPGAs in medical imaging for about 15 years. The groups that use the GPUs are getting great performance--definitely--but seeing as how MRI and CT machines are placed and need to run for 10, 15 20 years, I don't see how the GPUs will survive that time. One large OEM was pushing the GPUs for their architecture and I can't believe it will be successful if success is measured on the longevity scale. I'm sure the service sales guy will clean up.
Why do GPUs fail? I'm not sure of the exact modes of failure but the amount of heat has got to have something to do with it. FPGAs will run much cooler and in the FLOPS/Watt game, will win.
What is particularly new about FPGAs w/ dedicated circuits being faster than general-purpose circuits? This just in - ASIC circuit implementing FPGA circuit is 1000s of time faster and more energy efficient.
It seems like the particular news of this story is that each "core" (this article is using lay-notation, so it's hard to pinpoint what exactly they mean by the term processing core) was given some memory. So this seems to be more an article examining distributed computing & how that can be done within an FPGA.
What are the practical differences between targeting an FPGA on a computing platform and targeting more ubiquitous massively-parallel programmable pipelines in modern GPUs? Also, what are the fundamental differences? Could my GPU already contain FPGAs?
The story's been up for 20 minutes and no one's tried to imagine a Beowulf cluster of them yet? This is a great sadness.
The researchers then used the chip to process an algorithm which is central to the MPEG movie format – used in YouTube videos – at a speed of five gigabytes per second: around 20 times faster than current top-end desktop computers.
20x speed is getting closer to what I need before I can even ATTEMPT to build my very own Holodeck.
http://en.wikipedia.org/wiki/Holodeck
He who knows best knows how little he knows. - Thomas Jefferson
Don't you mean you are dissapoint?
Yawn. Seriously.
(says the guy who does FPGA design for a living.)
I would have gotten first post, but I was still waiting for my 'cat /proc/cpuinfo' to finish.
We first need to break a lock of x86 instruction set and the operating system that requires it. CPUs already try to execute multiple x86 instructions in parallel, but this is severely limited by sequential instruction set design. There needs to be a way to express computation A and B using different sets of virtual registers and let hardware execute them sequentially or in parallel depending on its capabilities, or vectorize/parallelize multiple iterations of a loop. If software, including operating systems, is coded in higher level virtual machine bytecode interpreted by hypervisor, a lot of parallelism can be expressed for future use while still permitting efficient execution on current hardware. LLVM is a good start, although it needs a lot more concurrency/vectorization information to take advantage of coprocessors, GPUs and massively parallel architectures.
As this is made of an FPGA I'm assuming they may be employing a rather broad definition of the term CPU, I'd imagine it's probably more like 1000 DSPs, and their choice of an mpg algorithm as a demonstration piece would seem to reinforce this assumption.
I'm not saying I wouldn't bite their arm off for a chance to play with it, but this is probably more akin to shifting non-branching code off onto the GPU than it is to general purpose computing.
"However, we are already seeing some microchips which combine traditional CPUs with FPGA chips being announced by developers, including Intel and ARM."
welcome to 2004.
Xilinx Virtex II, includes internal PPC 405GP
A programmable hardware platform would provide amazing computing power because of hardware specialization: rather than emulating a proper CPU, you would download core architecture into the FPGA to accelerate tasks such as REGEX processing or H.264 decoding. You could compile the entire logic of a program into a gate array with various logical operators and flip-flop circuits for unlimited (albeit slow) registers (L2 registers) as well as including standard registers and SRAM cache (L1).
Although the FPGA runs slower than a regular CPU, direct programming rather than instructional programming (that is logic blocks that perform programmatic functions, rather than logic blocks that interpret discrete instructions to follow programmatic functions) would shorten the overall hardware logic path. In short, the chip would follow fewer clock cycles and instead just "do things." The CPU would be slow, but optimized for your workload. The main performance bottleneck would be the context switch: replacing the logic gate configuration with a new program every time you switch. Other than that, dynamic program expansion could be utilized: inlining operations like multiplication, addition, etc, or breaking them out if space constraints make it hard to load the whole program onto the FPGA that way.
The obvious, major issue we see is, of course, a security issue. You can now reprogram the CPU. This makes it difficult to prevent a program from bypassing any and all hardware security measures. This is solved by implementing a completely new security design on the chip, by which the CPU itself (the FPGA) is under control of external security mechanisms (paging etc handled in the MMU, outside the FPGA space, would largely mitigate most of this); it's not impossible to deal with, it's just an issue that needs to be raised.
In short, this sucks for "download the new Intel CPU into your BIOS/bootloader." This sucks for whatever general purpose CPU you can think of. For an entirely new programmatic platform, however, this would provide some interesting performance possibilities, and some interesting challenges.
Support my political activism on Patreon.
It's almost enough to run Vista!
Does it run Linux?
Now we need a chip that can take any given problem and divide it into one thousand parts so we can feed it into these processors. -Gives me a headache!
this will be much more of a pain for the developer side. Can you imagine writing a full raytracer with VHDL, not to mention having to design and implement your own clocking scheme? Will be interesting to see just how many developers really take advantage of it...at least besides the hardcore HDL devs....
Software developers have barely figured out how to write single threaded algorithms without crashing. Now we are seeing more multithreaded algorithms with race conditions, deadlocks and other data-sharing bugs.
Can you imagine what will happen if every desktop machine has one or two FPGAs available for programs to use as needed?
PHB says "Hey, I've heard that you can make the program faster if you program custom hardware on the motherboard's FPGA. Get the new intern to write some FGPA code for our algorithms, and then re-write the module to use it. We'll ship it next month!"
Multicore processors has made software development an much more difficult, and putting a FPGA there will make it another two orders of magnitude more difficult. And programmers aren't getting smarter nearl as fast as the hardware is getting more complicated to program.
I hate it when I make a joke and I get modded "+5 insightful". Mod the stupid comments "funny", not "insightful", pleas
Ten years ago some young 6-digit ID Slashdotter was getting modded down for suggesting a Beowulf cluster of cores. Who's laughing now, mods?!?!?
I swear to God...I swear to God! That is NOT how you treat your human!
Without digging for any additional information, it bugs me that this chip has 1000 cores and not 1024.
The ultimate end to this trend is to build a system that is just core processing logic, with logic and memory all fused as closely as possible. I call it the BitGrid... it consists of 4bit look up tables hooked into an orthogonal grid. Because every single table can be used simultaneously, there is no Von Neuman bottleneck to worry about.
Petaflops... here we come.... !
Someone had to say it, be kind.
Mine goes to 1011.
Proverbs 21:19
I think this is fantastic that a 1000-core processor is in development.
I hate to be the devil's advocate but at what point will Amdahl's Law take hold fully and adding more cores to a processor will prove to be a fruitless endeavor?